Introducing Gemini Omni

Gemini Omni —
Speak it. See it. Share it.

Gemini Omni makes creating videos as easy as having a conversation — think of omni gemini as Nano Banana for video. Built on the same intelligence that powers Gemini 3.5 and Gemini 3.5 Flash, it blends text, images, and video so your ideas come to life in motion.

Try Omni Videos

Gemini Omni demos

The main video plays directly in this page. To run the same prompts yourself, open Google Flow or the Gemini app and select Gemini Omni Flash. For the full reel, see the official Google blog.

Gemini Omni capability mock clips

The clips below are locally generated mock videos (not official Gemini Omni output), shown to illustrate three typical capabilities of omni flash. With a Gemini Omni Flash subscription, you can replace the files in assets/videos/ with real outputs from Google Flow.

Create anything.
From everything.

Blend text, images, and video to bring your ideas to life in motion. The Gemini Omni model turns any reference — image, text, video, or audio — into a single, cohesive output, and works alongside Google Flow, Google Pics, and the experimental Google Omnibox integrations.

Describe a scene in natural language. Omni draws on Gemini's world knowledge to reason about what should happen next — far beyond pattern matching.

From concept to clip with Gemini Omni.

Gemini Omni is your creative partner for multimodal content creation, sharing the same foundation as Gemini Spark and Google Spark. It combines Gemini's core intelligence with advanced generative media — including image-to-video and video-to-video AI editing — and works in the same agentic surface as Google Antigravity.

Learn more
  • Image-to-video and video-to-video editing
  • Multimodal references into one cohesive clip
  • Iterative refinement via natural language

Keep the soul of the shot.

Swap backgrounds, change wardrobes, or transfer styles while preserving details. Tell Google Gemini Omni what to change in footage you've already shot — characters stay consistent, physics hold up, and the scene remembers what came before. The same google omni ai reasoning that powers Google Omni Flash keeps every turn coherent.

Try it out
  • Swap the background, keep the subject
  • Transfer styles without losing composition
  • Multi-turn edits with scene memory

Easy editing with Gemini Omni.

Just tell Gemini Omni what to fix — swap characters, adjust lighting, stabilize the video, or modify the background. With a sharper intuition for gravity, kinetic energy, and fluid dynamics — the same physics work showcased at Google I/O 2026 — the results feel more real than earlier omni flash previews.

  • "Make the sculpture out of bubbles"
  • "Apartment lights turn on in sync with the music"
  • "Claymation explainer of protein folding"

Be the star of your own show with Gemini Omni.

Create videos that look and sound like you with an AI Avatar — no need to upload your photo every time. Avatars are optional, and only you can use your own avatar to generate content through the Gemini Omni API.

Every video the Google Omni Model generates is embedded with the SynthID watermark and can be verified in the Gemini app, Chrome (right from the Google Omnibox), and Google Search.

Try Omni Videos

Say hello to Gemini Omni

We're constantly improving the model to make creating easier and more intuitive. Gemini Omni replaces Veo in the Gemini app, sits next to Gemini 3.5, Gemini 3.5 Flash, and ships alongside the agentic Antigravity 2.0 update — covering all your AI video generation and editing needs.

Gemini Omni Flash

A multimodal AI video generation and editing model that replaces the previous Gemini Veo 3.1. The same google omni flash engine also powers the Gemini Omni API for developers.

  • Create 10-second videos
  • Native audio generation
  • Turn photos into a video (up to 5)
  • New Video-to-video editing
  • New Multi-turn editing
  • New Avatar

Google AI subscription required. Features vary by tier and geography. 18+.

Subscribe now

How to get Gemini Omni

  • Google AI Plus / Pro / Ultra subscribers: Gemini app, Google Flow
  • YouTube Shorts and YouTube Create: free starting this week
  • Developer and enterprise Gemini Omni API: rolling out in the coming weeks
  • Cross-product surfaces: Google Pics, Google Omnibox, and the agentic Google Antigravity platform

Frequently asked questions

What is Gemini Omni?

Gemini Omni is a model that understands the world around you, letting you animate photos or create video from any input. Built on Gemini's world understanding and native multimodality, the omni gemini model produces outputs that reflect real-world logic and can be shaped step by step through natural conversation. With a single prompt, you can be an AI video editor.

What is Gemini Omni Flash?

Gemini Omni Flash is the first model in the Gemini Omni family — a fast, multimodal video model that powers Google Omni Flash features inside the Gemini app, Google Flow, and the upcoming developer API. It's the engine behind the in-app "create a video" flow on Google AI plans.

Is there a Gemini Omni API?

Yes. The Gemini Omni API is rolling out for developers and enterprise customers in the coming weeks. It exposes the same Google Omni Model used in Google Flow and YouTube Shorts, with text-, image-, video-, and audio-reference inputs.

How does Gemini Omni compare to Gemini 3.5 and Gemini 3.5 Flash?

Gemini 3.5 and Gemini 3.5 Flash are the general-purpose reasoning models. Gemini Omni is a specialized video model built on top of that intelligence — think of it as the multimodal media counterpart, the same way Gemini Spark handles personal AI agents.

Who can access Gemini Omni?

Users 18+ with a Google AI Plus, Pro, or Ultra plan can use it in every language and market where the Gemini app is available. Some features such as avatars and video-to-video editing may be restricted in certain countries — see the help center for details.

What happened to Veo?

Gemini Omni is our latest video generation and editing model, replacing Veo in the Gemini app to make our tools more helpful and creative for users.

What is an AI avatar?

An avatar is a digital version of yourself that lets you safely generate videos that look and sound like you. It's completely optional, and only you can use your avatar to create videos.

Can I trigger Gemini Omni from the Google Omnibox?

On supported builds of Chrome you can launch Gemini directly from the Google Omnibox address bar and ask it to generate or verify a video. The same SynthID check runs across Chrome, Search, and the Gemini app.

Is Gemini Omni related to Google Antigravity or Antigravity 2.0?

They're complementary. Google Antigravity is Google's agentic development platform; Antigravity 2.0 includes higher rate limits for agent models. Gemini Omni can be invoked from those agents to generate and edit video as a step in a larger workflow.

Is Gemini Omni the same as Libernovo Omni Pro?

No — those are unrelated products. Libernovo Omni Pro is third-party hardware that happens to share the "Omni" name. Gemini Omni is Google's multimodal video model accessed through the Gemini app, Google Flow, and the Gemini Omni API.

How does Gemini approach safety?

Consistent with our AI principles, all videos generated in the Gemini app are embedded with SynthID. You can also upload a file and ask whether it was generated by Google AI — Gemini will check for SynthID and use its own reasoning to respond.

What is "Google Omni"? (sometimes typed Google Ommi, Gimini Omni, or Google Moni)

"Google Omni" is what people call Gemini Omni in shorthand — Google's video AI. The official name is Gemini Omni; the engine inside is the Google Omni Model; the version that does the work day-to-day inside the app and Flow is Gemini Omni Flash. Misspellings like gimini omni, google ommi, google moni, google omnia, and google ovni all point to the same product — same thing, just a few keys off.

How long is a Gemini Omni video, and what format?

About 10 seconds, roughly a TikTok intro — a normal MP4 with audio. There's an invisible marker called SynthID baked in, so the clip can be verified later as AI-made. For longer pieces, the simplest path is making a few 10-second clips in Google Flow and chaining them — Flow keeps characters and lighting consistent so the cuts don't feel jumpy.

What's the difference between using Gemini Omni in the app and the Gemini Omni API?

Same model, two doors. The app is the front door — open a chat, type, get a video. The Gemini Omni API (sometimes called the Google Omni API) is the back door, for developers who want Omni inside their own product; entry is Google AI Studio. If you're not writing code, you can forget this one — the app does the same job.

Are "Google Flow", "google flow omni", and "google omni flow" the same thing?

Yes — three nicknames for one product. The real name is Google Flow, the place Gemini Omni lives when you want to see what you're making — storyboards, references, several clips on a timeline. The other two are just how people end up typing the combo into search.

Field notes

Short, practical reads on Gemini Omni — how to use it, where it lives in Google's stack, and how it stacks up against other AI video tools.

How to use Gemini Omni

Gemini Omni is dead simple to use — it's the videographer friend you can ask for things in plain English, and a short clip comes back. No timelines, no plug-ins, no render bars to stare at.

The easiest place to start is the Gemini app, on your phone or in your browser. Open a new chat, pick Gemini Omni Flash from the model dropdown, and type what you want to see — say, "a 10-second clip of a violinist on a quiet rooftop at golden hour." About a minute later, the video shows up in the chat. That's the whole thing.

If you'd rather work visually, open Google Flow at labs.google/flow. Flow is Omni's visual workspace — drop in up to five reference photos, line up shots like a storyboard, and tweak details by talking ("add rain, keep her expression"). Put another way: editing, without an editing app.

If you write code and want Omni inside your own product, that's the Gemini Omni API — find it in Google AI Studio. Same model, just plugged into your code instead of a chat window. Not a developer? Skip this paragraph.

  • Price. Any Google AI plan (Plus, Pro, Ultra) covers it. In YouTube Shorts and YouTube Create, Omni is free starting this week.
  • Length. Up to 10 seconds per clip — roughly a TikTok intro. For longer pieces, stack a few short clips in Google Flow.
  • Where it works. Anywhere the Gemini app is available. A handful of features (avatars, editing existing footage) open country by country.

One last tip — worth more than any setting: talk to it the way you'd talk to a friend, not the way you'd type into a search box. "Show me a busy ramen shop at midnight, steam off the pots, neon outside the window" gets you somewhere much better than "ramen night neon video."

Where Gemini Omni shows up in Google's tools

Gemini Omni isn't tied to one app — it shows up across several Google products, each doing a different job. Knowing which one lives where saves a lot of trial and error.

  • Google Flow is the studio. If "make a video" is the goal, this is the room. Timeline, reference photos, prompt history — all in one place. People search both google flow omni and google omni flow; same product, two folk names.
  • Google Vids is the office tool. It lives inside Google Workspace next to Docs, Slides, and Meet. More internal explainers and team how-tos than cinema — think PowerPoint that moves.
  • Google Pics is the image-side sibling, and home to Nano Banana, Google's image model. The handy bit: a still you generate in Pics can be sent straight into Omni and start moving — no re-upload, no style drift.
  • Google AI Studio and Google Labs are the workbenches. AI Studio is mostly for developers — Omni and Gemini 3.5 sit side-by-side for testing. Labs is where Google puts early experiments out for anyone to play with — Flow itself started life in Labs.
  • Google Spark and Gemini Spark are the personal assistants. Tell Spark "make a little birthday video for my dad" and Spark quietly calls Omni behind the scenes — the way you'd ring up a friend who happens to be good at editing.

What ties them together is the same brain. The video Flow renders, the image Pics paints, the avatar in your meeting — all the same Gemini Omni model under the hood, just dressed for different work. The reference photo you saved in one app last week is usable in another app today, no setup required.

Gemini Omni next to Veo 3, Sora 2, Runway, HeyGen, and Higgsfield

Picking an AI video tool is a bit like picking a kitchen knife — they all cut, but each has something it's best at. Here's a quick map to save you a lot of exploring.

  • Veo 3 is Google's previous video model — Omni now handles the day-to-day slot in the Gemini app. Veo 3 isn't gone (enterprise customers still use it through Vertex AI), it just isn't the default anymore. If you were using veo3 before, opening the app today already puts you on Omni.
  • Sora 2 is OpenAI's closest match. Same broad idea — type, get a 10-to-20-second clip — and physics (water, hair, fabric) feels solid. Day to day, the difference is which world you live in: Sora 2 sits inside ChatGPT's world; Omni sits inside Google's (Docs, Photos, YouTube). Pick by where your stuff already lives.
  • Runway is the short-film veteran's pick. Think chef's knife: more knobs, more precision, more learning curve. If you already work in After Effects or Premiere, Runway feels at home. Otherwise, the button count can scare you off.
  • HeyGen does one thing very well — a person on camera reading a script in their own voice. Sales videos, corporate training, language tutorials. Omni's avatar handles the short version of this; for anything past a couple of minutes, HeyGen is the better fit.
  • Higgsfield is the cinematographer. If you can already see the shot in your head ("slow push-in, then crane up over her shoulder"), Higgsfield gives you the controls to spell that out. Most people don't need to go that deep; the ones who do, swear by it.
  • Grok is the model inside X (formerly Twitter). Its images and short clips are optimized for posts — paste straight into a tweet and go. Social-first, low-friction, not really the place for serious creative work.

In one line: Omni is for talking your way to a clip; Sora 2 is its twin in a different ecosystem; Runway is for precision editing; HeyGen is for on-camera scripts; Higgsfield is for cinematography; Grok is for posts. Most people end up using two — Omni for the first cut, plus a specialist tool for whatever specialty the project needs.

Create videos with Gemini Omni by having a conversation

Creating and editing video is now as easy as chatting — powered by Gemini Omni Flash, available in the Gemini app and Google Flow.

Try Omni Videos