
Google unveils Gemini Omni: a multi-modal AI world model for video creation and editing
Google introduces Gemini Omni, a new multi-modal AI world-model family led by Gemini Omni Flash that can accept text, audio, images, and video to generate and edit realistic videos with improved physics; it supports avatars and interactive edits via conversation, with outputs watermarked by SynthID for AI verification, and will roll out to paid Google AI plans before expanding to YouTube Shorts and YouTube Create.

