ML Cult

August 24th, 2023 - Revolutionizing Pixels and Prose: Breakthroughs in Diffusion Models, Multimodal Language Learning, and Media Editing


Listen Later

  • Scalable Diffusion Models with Transformers
  • BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions
  • StableVideo: Text-driven Consistency-aware Diffusion Video Editing
  • Exploiting Diffusion Prior for Real-World Image Super-Resolution

Support the show

...more
View all episodesView all episodes
Download on the App Store

ML CultBy Marcus Edel