
Sign up to save your podcasts
Or
August 24th, 2023 - Revolutionizing Pixels and Prose: Breakthroughs in Diffusion Models, Multimodal Language Learning, and Media Editing

- Scalable Diffusion Models with Transformers
- BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions
- StableVideo: Text-driven Consistency-aware Diffusion Video Editing
- Exploiting Diffusion Prior for Real-World Image Super-Resolution
Support the show
...more