The AI Research Deep Dive

LongLive: Real-time Interactive Long Video Generation


Listen Later

Arxiv: https://arxiv.org/abs/2509.22622

This episode of "The AI Research Deep Dive" explores LongLive, a paper from NVIDIA and MIT that aims to transform video generation from a slow, offline process into a real-time, interactive creative tool. The host explains how LongLive allows a user to direct a video as it's being generated, seamlessly changing the prompt mid-scene without jarring jump-cuts. Listeners will learn about the paper's three key innovations: a "KV-recache" mechanism for smooth, instant reactions to new instructions; a "Streaming Long Tuning" method that teaches the model to maintain quality over minute-long videos; and a clever attention mechanism that delivers real-time speed. The episode covers the stunning results, where LongLive runs over 40 times faster than competing models while achieving state-of-the-art quality, offering a blueprint for the future of collaborative, live AI content creation.

...more
View all episodesView all episodes
Download on the App Store

The AI Research Deep DiveBy The AI Research Deep Dive