Intellectually Curious

Real-Time AI Video: The AAPT Breakthrough for Live, Interactive Worlds


Listen Later

We dive into ByteDance Seed's AAPT—autoregressive adversarial post-training—that promises fast, frame-by-frame AI video for interactive experiences. Learn how a pre-trained diffusion model is converted into a causal, one-pass-per-frame generator, how KV caching and a sliding 5-second window keep latency in check, and why a three-stage training pipeline (diffusion adaptation, consistency distillation, and adversarial training with a frame-level discriminator) matters. We'll unpack student forcing versus teacher forcing, what the results say about latency, throughput, and long-horizon coherence, and what this could mean for real-time virtual worlds.


Note:  This podcast was AI-generated, and sometimes AI can make mistakes.  Please double-check any critical information.

Sponsored by Embersilk LLC

...more
View all episodesView all episodes
Download on the App Store

Intellectually CuriousBy Mike Breault