Google Omni Is Nano Banana for Video

Google Omni is here, and I got a few days of early access before Google I/O. In this one, I test Google’s new Omni video model across text-to-video, image-to-video, video editing, style transfer, clip extension, avatar/cameo mode, lip-sync repair, camera-angle changes, POV shifts, and full location changes. Google is pitching Omni as the next step after Nano Banana: a multimodal Gemini model that can create video from images, text, video, audio references, and natural language instructions. The interesting part is not just generation. It is conversational video editing, multi-turn refinement, consistent characters, scene memory, style changes, and using Gemini’s world knowledge to make complex ideas visual. My early takeaway: Gemini Omni Flash is not really a Seedance killer yet. If anything, it feels more like the start of Nano Banana for video — less about one perfect text-to-video clip, and more about using AI to remix, repair, restyle, extend, and reimagine video through conversation. It is early. It is definitely not perfect. But if Omni develops the way Nano Banana did for image generation and editing, this could become a much bigger deal than a normal AI video model launch