Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

Nvidia AI chips sales rise but so do fears of an AI bubble bursting

Google’s AI Weather Model Nailed Its First Major Storm Forecast

All 100 AI unicorns since ChatGPT launched

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Business AI
    • Advanced AI News Features
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
Google DeepMind

Google’s Banana Model Takes the Top Spot Overnight! Overthrows GPT-4o and FLUX, Solidifies Its Position as the King of AI Images_model_this

By Advanced AI EditorAugust 27, 2025No Comments7 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


On August 27, Smart Things reported that today, Google launched the Gemini 2.5 Flash Image, the company’s most advanced image generation and editing model.

The core highlight of this model is its image editing capabilities. Google claims that this model can blend multiple images into a single image, maintaining high character consistency, and can perform targeted modifications using natural language, fully leveraging Gemini’s global knowledge.

Nobel Prize winner and CEO of Google DeepMind, Demis Hassabis, promoted the new model using his own photo, demonstrating the character consistency of Gemini 2.5 Flash Image. He modified the background of his photo to a classical style, while his appearance remained unchanged.

This capability has unlocked many interesting use cases, such as designing “star player cards” based on specific visual templates, allowing ordinary people to experience the treatment usually reserved for top athletes with just one click.

This model pairs perfectly with Google Veo 3 and other video generation models, creating rich video effects when used together. The overseas AI creative platform Kera AI has already used a similar model to produce a major advertisement.

This model actually appeared in the large model arena last week under the codename “nano-banana” and received over two million votes from users. Now officially revealed, Gemini 2.5 Flash Image has achieved global first placein both text-to-image and image editing scenarios, scoring an impressive 1362 on the image editing leaderboard, leading the second place by nearly 15%.

In Google’s published benchmark tests, Gemini 2.5 Flash Image outperformed GPT-4o image generation, Flux.1 Kontext (max), Qwen Image Edit, and other models in user preference, character, creativity, infographic, object, and environment generation, though it still lags behind GPT-4o image generation in stylization capabilities.

Gemini 2.5 Flash Image is primarily aimed at developers and is currently available through the Gemini API, Google AI Studio, and the enterprise-focused Vertex AI.

The pricing for this model is $30 per 1 million output tokens, with each image costing 1290 output tokens, approximately $0.039 per image (equivalent to 0.28 RMB).All other input and output modalities follow the Gemini 2.5 Flash pricing.

To make it easier to create AI applications using Gemini 2.5 Flash Image, Google has also made significant updates to the AI Studio’s “Build Mode”. Developers can use AI to create applications and quickly test the features of new models like Gemini 2.5 Flash Image.

When ready to deploy an application, developers can deploy directly from Google AI Studio or save the code to GitHub. Google has also showcased several cases on their blog:

Superb Character Consistency Helps Altman Time Travel with One Click

Maintaining the consistency of character and object appearance in multi-turn dialogue and editing is a significant challenge in image generation and editing. Google’s Gemini 2.5 Flash Image allows users to place the same character in different environments, showcasing a single product from multiple angles in a new environment, or generating consistent brand assets while retaining the subject.

In the example application below, users only need to upload a selfie to generate six portraits from the 1950s to the 2000s, each with the style of the respective era, with no significant deviation in the user’s appearance.

Smart Things also uploaded a photo of OpenAI co-founder and CEO Sam Altman, and Google’s new model allowed Altman to time travel back to the past with one click, achieving a super realistic image quality, accurately restoring the clothing styles of each era.

This consistency can also be applied in professional design scenarios. For example, users can provide the model with a specific texture and request a replacement. The model can complete the texture replacement without altering the shape and details.

Experience link:

https://aistudio.google.com/apps/bundled/past_forward?showPreview=true&showAssistant=true

Precise Image Editing with One Sentence, Customizable Light and Color

Gemini 2.5 Flash Image supports image transformation and editing using natural language. For example, the model can blur the background of an image, remove stains from a T-shirt, delete entire people from photos, change the pose of the photographed subject, and add color to black and white photos.

To showcase the practical applications of these features, Google built a photo editing template application in AI Studio. This photo editing application supports selecting and modifying specific areas or making broad adjustments and filter processing.

Smart Things uploaded a photo of Zuckerberg and asked the model to fine-tune it to make his teeth look whiter.

The final generated result is as follows, showing that Zuckerberg’s other facial features did not undergo significant changes after the modification.

Users can also customize the light, background, and more through preset prompts. In the image below, the lighting of the portrait has been adjusted to be warmer.

Experience link:

https://aistudio.google.com/apps/bundled/pixshop

Rich World Knowledge and Ability to Understand Hand-Drawn Illustrations

In the past, many image generation models could create beautiful visuals but lacked a deep semantic understanding of the real world. Google claims that Gemini 2.5 Flash Image possesses Gemini’s world knowledge. To demonstrate this, they created a template application that turns a simple canvas into an interactive educational mentor.

In the demonstration, Gemini 2.5 Flash Image can understand various hand-drawn images and answer a wide range of questions posed by users.

This world knowledge also enables the model to predict future changes in images and possesses a certain level of image reasoning ability. For example, when seeing a balloon flying next to a cactus, the model can generate an image of the balloon bursting based on the user’s command to “predict the next possible scene.”

Experience link:

https://aistudio.google.com/apps/bundled/codrawing?showAssistant=true&showPreview=true

Outstanding Multi-Image Fusion Capabilities for Precise Product Display

Gemini 2.5 Flash Image can understand and merge multiple input images, which holds significant practical value in scenarios like e-commerce. For instance, merchants can use AI to generate promotional photos of different products in the same scene or provide customers with images of furniture and other products placed in real settings.

Below is a case provided by Google, where users only need to drag the lamp from the left into the scene on the right, and after a short wait, they can see the placement effect. The model not only adds the lamp element to the scene but also turns on the light. However, the generation process is noticeably accelerated.

The multi-image fusion capability can also be used to generate creative images. For example, merging photos of a whale and a mountain creates a visually striking effect.

Experience link:

https://aistudio.google.com/apps/bundled/home_canvas?showPreview=true&showAssistant=true

Since the launch of Gemini 2.5 Flash Image, overseas users have already started experimenting with it. One user created a mooncake advertisement using it, claiming that the same prompts would require ten times the adjustments and fine-tuning in Midjourney to achieve similar results.

Another user shared a video they created using Gemini 2.5 Flash Image in conjunction with Veo 3. During this process, Gemini 2.5 Flash Image generated many different angles of shots, while Veo 3 turned them into a video. The final effect was stunning.

However, some users have complained about the strict censorship of this model, for example, it cannot generate images of people holding knives and axes.

Conclusion: Image Editing Evolves Again, May Become an Important Productivity Tool

In a sense, accurate image editing capabilities are one of the most critical abilities for image generation to enter real production scenarios. In e-commerce and other settings, this ability meets the demands of enterprise users for precise control; while in entertainment scenarios, it can provide users with rich experiences and gameplay.

Currently, several domestic and international large model manufacturers have launched image editing models, and the latest developments in this field are worth continuous attention.返回搜狐,查看更多



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleOpenAI, CEO Sam Altman sued by parents who blame ChatGPT for teen’s death
Next Article Nvidia posts solid growth despite uncertain China outlook
Advanced AI Editor
  • Website

Related Posts

Google’s AI Weather Model Nailed Its First Major Storm Forecast

August 28, 2025

Google’s AI Weather Model Nailed Its First Major Storm Forecast

August 28, 2025

Google’s AI hurricane model impresses in first real-time test with Hurricane Erin

August 28, 2025

Comments are closed.

Latest Posts

Artifacts From 2,000-Year-old Sunken City Lifted Out of the Sea

Claire Oliver Gallery Expands in New York’s Harlem Neighborhood

Van Gogh Museum Threatens Dutch Government with Closure

$15.5 M. Project Uncovers Stone Age Settlement on Seabed Near Denmark

Latest Posts

Nvidia AI chips sales rise but so do fears of an AI bubble bursting

August 28, 2025

Google’s AI Weather Model Nailed Its First Major Storm Forecast

August 28, 2025

All 100 AI unicorns since ChatGPT launched

August 28, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Nvidia AI chips sales rise but so do fears of an AI bubble bursting
  • Google’s AI Weather Model Nailed Its First Major Storm Forecast
  • All 100 AI unicorns since ChatGPT launched
  • Maisa AI gets $25M to fix enterprise AI’s 95% failure rate
  • WhatsApp AI writing help can rephrase your messages and keep them completely private

Recent Comments

  1. ceria777 on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  2. mef_uec on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  3. Juniorfar on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  4. BrianUnfag on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  5. mefedron_kii on C3 AI and Arcfield Announce Partnership to Accelerate AI Capabilities to Serve U.S. Defense and Intelligence Communities

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.