Skip to main content

Gemini Omni, the ‘create anything’ model, starts today with lifelike video

Google has unveiled Gemini Omni, a new family of generative models designed to “create anything,” and you can use it today to create surprisingly realistic videos.

Something Google has been working on in recent years is a “world model” that can maintain a cohesive, grounded world. The company explored the idea through its Genie model, which generates interactive video-game-esque experiences based on user prompts. Google has also long offered the Veo and Nano Banana models that bring capable video and image creation/editing via text and image inputs.

As part of I/O 2026, Google revealed Gemini Omni, a new model which leverages a similar level of multimodal understanding grounded in reality. While Omni is currently only designed to generate video content, it is presented as being designed to “create anything from any input.” This means bringing together text, images, video, and audio (initially limited to speech samples) to create a unified output video. After generation, you can further refine your video in subsequent turns.

Google’s initial demos for Omni are quite impressive, showing how Gemini understands each of the elements in the final video. The rolling marble video is a great example, with believable physics for the ball and convincing sound effects for each bounce and the bell ring.

Advertisement - scroll for more content

Another demo presents a claymation-style video explainer of how protein folding works.

Unlike the Genie model, which is still exclusively available to those paying for an AI Ultra subscription, Google is positioning the Gemini Omni series to be broadly accessible. The first model in the series, Gemini Omni Flash, is available now to all subscribers of AI Plus and higher. Or if you want to share your creations with the world, Gemini Omni will be available for free through YouTube Shorts and YouTube Create later this week. A higher-level model, “Omni Pro,” was also teased, with details coming soon.

Given the significant sense of realism presented, the company is taking several measures to ensure videos are generated responsibly. Taking a cue from OpenAI’s recently discontinued Sora app, Gemini Omni will allow you to create a bespoke “Avatar” of yourself to be featured in the videos you create. Otherwise, Omni will not initially be able to edit audio and speech in videos until Google can “bring this capability to users responsibly.” As another safety measure, all videos created by Gemini Omni will be watermarked with SynthID to be readily identified as AI generated.

FTC: We use income earning auto affiliate links. More.

You’re reading 9to5Google — experts who break news about Google and its surrounding ecosystem, day after day. Be sure to check out our homepage for all the latest news, and follow 9to5Google on Twitter, Facebook, and LinkedIn to stay in the loop. Don’t know where to start? Check out our exclusive stories, reviews, how-tos, and subscribe to our YouTube channel

Comments

Author

Avatar for Kyle Bradshaw Kyle Bradshaw

Kyle is an author and researcher for 9to5Google, with special interests in Made by Google products, Fuchsia, and uncovering new features.

Got a tip or want to chat? Twitter or Email. Kyle@9to5mac.com