Google has unveiled Gemini Omni, a new artificial intelligence model capable of creating and editing content from almost any type of input, including text, images, audio and video, in a move that could position the company to fill the void left by the discontinuation of OpenAI’s Sora platform.
The company said the first model in the new Omni family, Gemini Omni Flash, is rolling out globally through the Gemini app, Google Flow, YouTube Shorts and YouTube Create, enabling users to generate and refine clips using natural language prompts rather than traditional editing software.
“Omni is our new model that can create anything from any input – starting with video,” wrote Koray Kavukcuoglu, CTO of Google DeepMind and Google’s Chief AI Architect, in a blog post announcing the launch.
Google described Omni as the next step in Gemini’s evolution from a natively multimodal AI system, combining its reasoning abilities with creative generation.
Users, who have been feeling the vacuum after Sora’s discontinuation, can now mix text, images, video and voice inputs to create cohesive outputs grounded in Gemini’s “real-world knowledge,” while also editing content conversationally over multiple prompts.
Sora’s Exit, Omni’s Entry
The launch arrives at a time when the AI-generated media market is shifting. OpenAI’s Sora generated significant attention for its ability to create photorealistic clips from text prompts, but its web and app experiences were discontinued in April, with API access scheduled to end in September this year.
That transition has left creators, developers and businesses looking for scalable alternatives for AI-assisted content production.
Google appears to be targeting that demand with a product focused not just on visual quality, but also practical usability.
Unlike experimental models designed mainly for demonstrations, Gemini Omni is built to support everyday creative and commercial workflows.
Google said users can transform static images into moving scenes, upload references such as sketches or character images and refine outputs through conversation while preserving scene continuity and realistic motion.
The company said Omni can also reason about “what should happen next” in a scene, combining Gemini’s broader understanding of science, history and culture with an “improved intuitive understanding of forces like gravity, kinetic energy and fluid dynamics”.
Digital Avatars and Safeguards
Google is also introducing Avatars, allowing users to create digital versions of themselves using their own voice and likeness for generated clips.
To address concerns around AI-generated media, all content created with Omni will include an imperceptible SynthID watermark, enabling verification through Gemini services and Google Search.
Gemini Omni Flash is now available to Google AI Plus, Pro and Ultra subscribers, while also launching free on YouTube Shorts and YouTube Create.
The tech giant said developers and enterprise customers will gain API access in the coming weeks.
However, Gemini Omni could gain an edge over rivals that drew attention following Sora’s exit. While competing platforms gained some popularity after Sora’s discontinuation, users have often reported inconsistent motion, unnatural backgrounds and heavier editing requirements.
Google is positioning Omni as a more reliable, conversational tool that can generate clips from mixed inputs, while maintaining scene continuity, realistic physics and faster workflows through direct integration with Google’s AI and creator platforms.
