Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

IBM, AMD Collaborate to Build Hybrid Quantum-Centric Supercomputing Platforms

Assort Health nabs $50M to automate patient phone calls, sources say

Nvidia earnings set to test Wall Street’s faith in AI boom

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Business AI
    • Advanced AI News Features
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
Image Generation

Introducing Gemini 2.5 Flash Image, our state-of-the-art image model

By Advanced AI EditorAugust 26, 2025No Comments5 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Introducing Gemini 2.5 Flash Image, our state-of-the-art image model

Today, we’re excited to introduce Gemini 2.5 Flash Image (aka nano-banana), our state-of-the-art image generation and editing model. This update enables you to blend multiple images into a single image, maintain character consistency for rich storytelling, make targeted transformations using natural language, and use Gemini’s world knowledge to generate and edit images.

When we first launched native image generation in Gemini 2.0 Flash earlier this year, you told us you loved its low latency, cost-effectiveness, and ease of use. But you also gave us feedback that you needed higher-quality images and more powerful creative control.

This model is available right now via the Gemini API and Google AI Studio for developers and Vertex AI for enterprise. Gemini 2.5 Flash Image is priced at $30.00 per 1 million output tokens with each image being 1290 output tokens ($0.039 per image). All other modalities on input and output follow Gemini 2.5 Flash pricing.

Gemini 2.5 Flash Image in action

To make building with Gemini 2.5 Flash Image even easier, we have made significant updates to Google AI Studio’s “build mode” (with more updates to come). In the examples below, not only can you quickly test the model’s capabilities with custom AI powered apps, but you can remix them or bring ideas to life with just a single prompt. When you are ready to share an app you built, you can deploy right from Google AI Studio or save the code to GitHub.

Try a prompt like “Build me an image editing app that lets a user upload an image and apply different filters” or choose one of the preset templates and remix it, all for free!

Maintain character consistency

A fundamental challenge in image generation is maintaining the appearance of a character or object across multiple prompts and edits. You can now place the same character into different environments, showcase a single product from multiple angles in new settings, or generate consistent brand assets, all while preserving the subject.

We built a template app in Google AI Studio (that you can easily customize and vibe code on top of) to demonstrate the model’s character consistency capabilities.

Sorry, your browser doesn’t support playback for this video

(sequence shortened)

Beyond character consistency, the model is also excellent at adhering to visual templates. We have already seen developers explore areas like real estate listing cards, uniform employee badges, or dynamic product mockups for an entire catalog—all from a single design template.

gemini-2-5-image-editing-character-consistency

Prompt based image editing

Gemini 2.5 Flash Image enables targeted transformation and precise local edits with natural language. For example, the model can blur the background of an image, remove a stain in a t-shirt, remove an entire person from a photo, alter a subject’s pose, add color to a black and white photo, or whatever else you can conjure up with a simple prompt.

To show these capabilities in action, we built a photo editing template app in AI Studio, with both UI and prompt-based controls.

gemini-2-5-flash-prompt-based-image-editing

Native world knowledge

Historically, image generation models have excelled at aesthetic images, but lacked a deep, semantic understanding of the real world. With Gemini 2.5 Flash Image, the model benefits from Gemini’s world knowledge, which unlocks new use cases.

To demonstrate this, we built a template app in Google AI Studio that turns a simple canvas into an interactive education tutor. It showcases the model’s ability to read and understand hand-drawn diagrams, help with real world questions, and follow complex editing instructions in a single step.

Sorry, your browser doesn’t support playback for this video

(Example prompts and model results)

Multi-image fusion

Gemini 2.5 Flash Image can understand and merge multiple input images. You can put an object into a scene, restyle a room with a color scheme or texture, and fuse images with a single prompt.

To showcase multi-image fusion, we built a template app in Google AI Studio which lets you drag products into a new scene to quickly create a new photorealistic fused image.

Sorry, your browser doesn’t support playback for this video

(Sequences shortened)

Get started building

Check out our developer docs to start building with Gemini 2.5 Flash Image. The model is in preview today via the Gemini API and Google AI Studio but will be stable in the coming weeks. All of the demo apps we highlighted here were vibe coded in Google AI Studio so they can be remixed and customized with just a prompt.

OpenRouter.ai has partnered with us to help bring Gemini 2.5 Flash Image to their 3M+ developers everywhere, today. This is the first model on OpenRouter – of the 480+ live today –that can generate images.

We’re also excited to partner with fal.ai, a leading developer platform for generative media, to make Gemini 2.5 Flash Image available to the broader developer community.

All images created or edited with Gemini 2.5 Flash Image will include an invisible SynthID digital watermark, so they can be identified as AI-generated or edited.

from google import genai
from PIL import Image
from io import BytesIO

client = genai.Client()

prompt = “Create a picture of my cat eating a nano-banana in a fancy restaurant under the gemini constellation”

image = Image.open(‘/path/to/image.png’)

response = client.models.generate_content(
model=”gemini-2.5-flash-image-preview”,
contents=[prompt, image],
)

for part in response.candidates[0].content.parts:
if part.text is not None:
print(part.text)
elif part.inline_data is not None:
image = Image.open(BytesIO(part.inline_data.data))
image.save(“generated_image.png”)

Python

We are actively working to improve long-form text rendering, even more reliable character consistency, and factual representation like fine details in images. Please continue to send us feedback in our developer forum or on X.

We can’t wait to see what you build with Gemini 2.5 Flash Image!



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleLayoffs, DEI, and Tricky AI future
Next Article Notified and Profound Partner to Help Brands Stand Out in
Advanced AI Editor
  • Website

Related Posts

Google Updates AI Image Generation, Adds Loyalty Program Option for Retailers

August 26, 2025

Google’s Nano-Banana AI Image Generator Is Officially Available In The Gemini App

August 26, 2025

Midjourney debuts AI video tool to animate images into short clips: How the feature works

June 20, 2025

Comments are closed.

Latest Posts

France Will Return Colonial-Era Human Remains to Madagascar

Vail Settles with Native American Artist in Suit on Pro-Palestine Art

Met Museum Plans Major Raphael Exhibition for 2026

Gladstone Gallery Adds Peter Saul, Hires Venus Over Manhattan Partner

Latest Posts

IBM, AMD Collaborate to Build Hybrid Quantum-Centric Supercomputing Platforms

August 27, 2025

Assort Health nabs $50M to automate patient phone calls, sources say

August 27, 2025

Nvidia earnings set to test Wall Street’s faith in AI boom

August 27, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • IBM, AMD Collaborate to Build Hybrid Quantum-Centric Supercomputing Platforms
  • Assort Health nabs $50M to automate patient phone calls, sources say
  • Nvidia earnings set to test Wall Street’s faith in AI boom
  • Anthropic Brings Claude AI to Chrome Browser with Experimental Action Capabilities
  • Image editing in Gemini just got a major upgrade

Recent Comments

  1. با رتبه ۱۰۰۰۰ انسانی چی قبول میشم نی نی سایت on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  2. Davidtiz on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  3. Mihiro Taniguchi on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  4. ThomasLyday on Foundation AI: Cisco launches AI model for integration in security applications
  5. продаю мескалин on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.