Runway demonstrated an unnamed model generating HD video in under 100 milliseconds on Nvidia's Vera Rubin architecture at GTC 2026, working more like a game engine than a traditional diffusion model.
Pika's 2.5 engine introduced physics-based interaction modeling and integrated sound effects, positioning the platform as the fastest and most physically realistic option for short-form social video.
Kuaishou launched Kling 3.0 on February 4, featuring native 4K resolution at 60fps generated at the pixel level, multi-shot storyboarding with up to six camera cuts per generation, and native audio in five languages with dialect support.
Kuaishou announced Kling 3.0 on January 31, featuring native 4K 60fps output, up to six camera cuts per generation with visual consistency, and synchronized audio-visual output in a single pass.
Google DeepMind released Veo 3.1 on January 13, introducing professional 4K upscaling, native 9:16 vertical output, and Scene Extension technology for narratives exceeding 60 seconds.
ByteDance officially released Seedance 1.5 Pro on December 16, introducing joint audio-visual generation that creates video and audio simultaneously from text and image prompts.
Kuaishou unveiled Kling O1, positioning it as the industry's first unified multimodal creation tool that consolidates text, video, image, and subject inputs into a single generation and editing engine.