Google Ships Nano Banana 2 Lite and Gemini Omni Flash — Fastest AI Image + Video Pipeline Yet
On June 30, 2026, Google shipped two new generative-AI models simultaneously: Nano Banana 2 Lite, its fastest and cheapest image-generation model, and Gemini Omni Flash, a video-generation and conversational-editing model now open to developers for the first time. Together they form a text-to-video pipeline that runs entirely within the Gemini API — no third-party model hops required.
Nano Banana 2 Lite: The Price-Speed Leader
The original Nano Banana (gemini-2.5-flash-image) already undercut most alternatives on cost. Nano Banana 2 Lite cuts that further to $0.034 per 1,000-pixel image, generating results in roughly four seconds. At production scale — thousands of image calls per day — that pricing difference becomes significant. The model is available via Google AI Studio, the Gemini API, and the Gemini Enterprise Agent Platform, and is actively rolling out to consumer surfaces: AI Mode in Search, the Gemini app, NotebookLM, Google Photos, and Google Flow.
Every output automatically embeds SynthID watermarking, Google's invisible provenance signal for AI-generated imagery.
Gemini Omni Flash: Developer Video Generation, Finally
Until now, Google's Veo-based video generation was mostly locked to consumer products. Gemini Omni Flash is priced at $0.10 per second of video output, accepts text, image, and video inputs, and currently generates clips up to 10 seconds, with longer durations coming soon. Its standout feature is conversational editing: instead of regenerating from scratch when a result is close but not right, you can give follow-up instructions and the model refines the existing clip.
Google says Omni Flash matches Veo 3.1 Fast on price-performance, positioning it as the practical entry point before the heavier Veo 3.1 Standard tier.
The Combined Pipeline
Google is specifically promoting a chained workflow: generate a still with Nano Banana 2 Lite, then pass it as a reference frame to Gemini Omni Flash to animate it. The image feeds directly into the video model's context — the two integrate natively inside the Gemini API, making the end-to-end text → image → animated video flow a single-provider operation. For developers building marketing automation, social content pipelines, or interactive storytelling tools, this closes a gap that previously required stitching together separate vendors.
Both models are available today in Google AI Studio and the Gemini API. The combined release signals Google's intent to own the full generative-media stack at every price tier, from rapid prototyping to production-scale pipelines.