Reference control
Keep the subject recognizable
A reference image gives Gemini Omni a stable product, character, layout, or brand frame before motion is added.
Image to video
Create Gemini Omni image-to-video drafts from product photos, character frames, layouts, or visual references with motion-focused prompts.

Best for
Product reveals and visual continuity
Input
One clear reference image plus a motion prompt
Output
Short video drafts for ads, pages, and social
Reference control
A reference image gives Gemini Omni a stable product, character, layout, or brand frame before motion is added.
Motion prompt
Good prompts separate subject motion, camera travel, background behavior, on-screen text, and the ending frame.
Commercial use
Image-to-video pages capture high purchase intent from users making product explainers, ecommerce reels, and ad concepts.
Image-to-video workflow
01
Choose an image with a single primary subject, enough lighting, and minimal unwanted background clutter.
02
Write exactly what should move: product rotation, character gesture, camera push-in, light sweep, or reveal.
03
Tell the model which details must stay stable, such as logo, material, face, packaging, UI layout, or color.
04
Iterate on the prompt if the clip loses framing, adds unwanted motion, or misses the intended final frame.
Use cases
Text-only prompts are flexible. Image-to-video is better when users need the generated clip to preserve a real object, character, interface, or visual style.
Animate a hero product shot into a short reveal for landing pages, ecommerce galleries, and paid social.
Use a character frame to keep appearance and style consistent while describing action and camera movement.
Start from a dashboard or app screenshot and turn it into a walkthrough concept for launches or onboarding.
FAQ
It is a workflow where a still image acts as a visual reference and the prompt describes the motion, camera move, mood, text, and final frame.
Clear product photos, character frames, interface screenshots, storyboard panels, and brand visuals work best because they give the model a stable starting point.
Start with one strong reference when consistency matters. Use additional references only when each one has a clear role in the scene.
Describe what should move, what should stay consistent, how the camera travels, whether text appears on screen, and how the clip should end.
Yes. It is especially useful for product reveals, ecommerce hero clips, landing page motion, feature demos, and ad variants.
Yes. Pick a vertical aspect ratio and write the prompt around framing, safe text zones, hook timing, and the final callout.
Image reference workflow