Google has unveiled Gemini Omni, a new family of generative models designed to “create anything,” and you can use it today to create surprisingly realistic videos.
Something Google has been working on in recent years is a “world model” that can maintain a cohesive, grounded world. The company explored the idea through its Genie model, which generates interactive video-game-esque experiences based on user prompts. Google has also long offered the Veo and Nano Banana models that bring capable video and image creation/editing via text and image inputs.
As part of I/O 2026, Google revealed Gemini Omni, a new model which leverages a similar level of multimodal understanding grounded in reality. While Omni is currently only designed to generate video content, it is presented as being designed to “create anything from any input.” This means bringing together text, images, video, and audio (initially limited to speech samples) to create a unified output video. After generation, you can further refine your video in subsequent turns.
Google’s initial demos for Omni are quite impressive, showing how Gemini understands each of the elements in the final video. The rolling marble video is a great example, with believable physics for the ball and convincing sound effects for each bounce and the bell ring.
Another demo presents a claymation-style video explainer of how protein folding works.
Unlike the Genie model, which is still exclusively available to those paying for an AI Ultra subscription, Google is positioning the Gemini Omni series to be broadly accessible. The first model in the series, Gemini Omni Flash, is available now to all subscribers of AI Plus and higher. Or if you want to share your creations with the world, Gemini Omni will be available for free through YouTube Shorts and YouTube Create later this week. A higher-level model, “Omni Pro,” was also teased, with details coming soon.
Given the significant sense of realism presented, the company is taking several measures to ensure videos are generated responsibly. Taking a cue from OpenAI’s recently discontinued Sora app, Gemini Omni will allow you to create a bespoke “Avatar” of yourself to be featured in the videos you create. Otherwise, Omni will not initially be able to edit audio and speech in videos until Google can “bring this capability to users responsibly.” As another safety measure, all videos created by Gemini Omni will be watermarked with SynthID to be readily identified as AI generated.
FTC: We use income earning auto affiliate links. More.
Comments