Coming soon
Google positions Gemini Omni as a model family where reasoning meets generation—combining references like images, clips, text, and (over time) more audio so you can steer outputs from what you already have. The first variant highlighted in public materials is Gemini Omni Flash, with conversational editing and stronger scene coherence as headline themes.
When Omni lands in Studio AI, we expect a familiar loop: describe the shot or attach references, tune generator settings, then export—similar to our other video generators.

You would combine prompts with optional reference images, clips, or audio (as supported) so the model can lock onto characters, motion, or mood.

Fine-tune your output by adjusting the available settings to match your creative vision and project requirements

You would preview results, refine instructions across turns, then download or iterate—especially important for conversational edits.
These ideas follow how Google frames Omni today—your Studio AI workflow may differ once integration ships.
Public materials emphasize editing video with natural language where each instruction builds on the last, aiming for consistent characters, stable physics, and a scene that remembers prior steps—useful for iterative creative direction instead of one-shot prompts only.
Google highlights reasoning about what should happen next—pairing intuitive physics cues with broader knowledge so clips can feel less arbitrary when you need narrative or explanatory motion.
The roadmap described publicly points toward mixing ingredients—still frames, text, existing footage, and selective audio references—into a single render. Availability of each input mode can vary by surface and release phase.
We will announce availability on this page when Gemini Omni is enabled for Studio AI.
Gemini Omni is coming soon to Studio AI. Watch release notes for when Omni appears in the model picker.
Answers grounded in public Google materials; Studio AI availability will follow product launch.
In Google’s announcements, Gemini Omni is a multimodal creation line that starts with high-quality video, with Gemini Omni Flash as the first broadly referenced variant. It is described as combining Gemini-style reasoning with generation, including conversational video editing and richer reference inputs over time.
Studio AI has not shipped Gemini Omni in the generator yet. We publish this landing page early so teams can align on positioning and plan workflows before the integration is live.
Materials from Google and DeepMind describe multi-turn instructions that refine environment, camera, style, or specific details without losing the thread of the original scene—similar in spirit to iterative image editing, but applied to video.
Final Studio AI controls will depend on our integration and policies. Public Omni demos reference combinations of image, text, video, and evolving audio inputs; we will document exactly what is supported when the model is enabled for MotionElements customers.
See Google’s Keyword introduction and the DeepMind Gemini Omni overview for feature framing, roadmap context, and responsible-use notes such as SynthID-style transparency on supported surfaces.
Commercial terms for Omni are defined by Google for their consumer and cloud surfaces and will be defined by MotionElements for Studio AI once the model is offered here. Review your Studio AI agreement and model-specific terms at launch.
Use whichever Studio AI generators are already available in your workspace today. When Gemini Omni arrives, we expect it to appear in the model picker with its own settings and pricing.