Introducing Gemini 2.5 Flash Image, Our State-of-the-art Image Model

Today, we’re excited to introduce Gemini 2.5 Flash Image (aka nano-banana), our state-of-the-art image generation and editing model. This update enables you to blend multiple images into a single image, maintain character consistency for rich storytelling, make targeted transformations using natural language, and use Gemini’s world knowledge to generate and edit images.

When we first launched native image generation in Gemini 2.0 Flash earlier this year, you told us you loved its low latency, cost-effectiveness, and ease of use. But you also gave us feedback that you needed higher-quality images and more powerful creative control.

This model is available right now via the Gemini API and Google AI Studio for developers and Vertex AI for enterprise. Gemini 2.5 Flash Image is priced at $30.00 per 1 million output tokens with each image being 1290 output tokens ($0.039 per image). All other modalities on input and output follow Gemini 2.5 Flash pricing.

Gemini 2.5 Flash Image in action

To make building with Gemini 2.5 Flash Image even easier, we have made significant updates to Google AI Studio’s “build mode” (with more updates to come). In the examples below, not only can you quickly test the model’s capabilities with custom AI powered apps, but you can remix them or bring ideas to life with just a single prompt. When you are ready to share an app you built, you can deploy right from Google AI Studio or save the code to GitHub.

Try a prompt like “Build me an image editing app that lets a user upload an image and apply different filters” or choose one of the preset templates and remix it, all for free!

Maintain character consistency

A fundamental challenge in image generation is maintaining the appearance of a character or object across multiple prompts and edits. You can now place the same character into different environments, showcase a single product from multiple angles in new settings, or generate consistent brand assets, all while preserving the subject.

We built a template app in Google AI Studio (that you can easily customize and vibe code on top of) to demonstrate the model’s character consistency capabilities.

Sorry, your browser doesn’t support playback for this video

(sequence shortened)

Beyond character consistency, the model is also excellent at adhering to visual templates. We have already seen developers explore areas like real estate listing cards, uniform employee badges, or dynamic product mockups for an entire catalog—all from a single design template.

gemini-2-5-image-editing-character-consistency

Prompt based image editing

Gemini 2.5 Flash Image enables targeted transformation and precise local edits with natural language. For example, the model can blur the background of an image, remove a stain in a t-shirt, remove an entire person from a photo, alter a subject’s pose, add color to a black and white photo, or whatever else you can conjure up with a simple prompt.

To show these capabilities in action, we built a photo editing template app in AI Studio, with both UI and prompt-based controls.

gemini-2-5-flash-prompt-based-image-editing

Native world knowledge

Historically, image generation models have excelled at aesthetic images, but lacked a deep, semantic understanding of the real world. With Gemini 2.5 Flash Image, the model benefits from Gemini’s world knowledge, which unlocks new use cases.

To demonstrate this, we built a template app in Google AI Studio that turns a simple canvas into an interactive education tutor. It showcases the model’s ability to read and understand hand-drawn diagrams, help with real world questions, and follow complex editing instructions in a single step.

Sorry, your browser doesn’t support playback for this video

(Example prompts and model results)

Multi-image fusion

Gemini 2.5 Flash Image can understand and merge multiple input images. You can put an object into a scene, restyle a room with a color scheme or texture, and fuse images with a single prompt.

To showcase multi-image fusion, we built a template app in Google AI Studio which lets you drag products into a new scene to quickly create a new photorealistic fused image.

Sorry, your browser doesn’t support playback for this video

(Sequences shortened)

Get started building

Check out our developer docs to start building with Gemini 2.5 Flash Image. The model is in preview today via the Gemini API and Google AI Studio but will be stable in the coming weeks. All of the demo apps we highlighted here were vibe coded in Google AI Studio so they can be remixed and customized with just a prompt.

OpenRouter.ai has partnered with us to help bring Gemini 2.5 Flash Image to their 3M+ developers everywhere, today. This is the first model on OpenRouter – of the 480+ live today –that can generate images.

We’re also excited to partner with fal.ai, a leading developer platform for generative media, to make Gemini 2.5 Flash Image available to the broader developer community.

All images created or edited with Gemini 2.5 Flash Image will include an invisible SynthID digital watermark, so they can be identified as AI-generated or edited.

from google import genai
from PIL import Image
from io import BytesIO

client = genai.Client()

prompt = “Create a picture of my cat eating a nano-banana in a fancy restaurant under the gemini constellation”

image = Image.open(‘/path/to/image.png’)

response = client.models.generate_content(
model=”gemini-2.5-flash-image-preview”,
contents=[prompt, image],
)

for part in response.candidates[0].content.parts:
if part.text is not None:
print(part.text)
elif part.inline_data is not None:
image = Image.open(BytesIO(part.inline_data.data))
image.save(“generated_image.png”)

Python

We are actively working to improve long-form text rendering, even more reliable character consistency, and factual representation like fine details in images. Please continue to send us feedback in our developer forum or on X.

We can’t wait to see what you build with Gemini 2.5 Flash Image!

Source link

What's Hot

LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions – Takara TLDR

AI Systems Can Be Fooled by Fake Dates, Giving Newer Content Unfair Visibility

The Alignment Waltz: Jointly Training Agents to Collaborate for Safety – Takara TLDR

Introducing Gemini 2.5 Flash Image, our state-of-the-art image model

10 Nano Banana AI image styles you can create with Gemini, reveals Google

SeedDream 4.0: The viral AI image model rivaling Gemini Nano Banana

Nano Banana AI daily limit: How many images can you generate with Google Gemini; full guide to free, Pro and Ultra access |

The Rubin Names 2025 Art Prize, Research and Art Projects Grants

Kochi-Muziris Biennial Announces 66 Artists for December Exhibition

Instagram Launches ‘Rings’ Awards for Creators—With KAWS as a Judge

Museums Prepare to Close Their Doors as Government Shutdown Continues

LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions – Takara TLDR

AI Systems Can Be Fooled by Fake Dates, Giving Newer Content Unfair Visibility

The Alignment Waltz: Jointly Training Agents to Collaborate for Safety – Takara TLDR

What's Hot

Introducing Gemini 2.5 Flash Image, our state-of-the-art image model

Gemini 2.5 Flash Image in action

Maintain character consistency

Prompt based image editing

Native world knowledge

Multi-image fusion

Get started building

Related Posts

Subscribe to Updates