Super Data Science: ML & AI Podcast with Jon Krohn

692: Lossless LLM Weight Compression: Run Huge Models on a Single GPU

06.30.2023 - By Jon KrohnPlay

Download our free app to listen on your phone

Download on the App StoreGet it on Google Play

Join Jon as he navigates listeners through the innovative SpQR approach—a cutting-edge, lossless LLM weight compression technique that harnesses the power of quantization. Tune in as Jon delves into the four steps behind this groundbreaking method in this week's episode.

Additional materials: www.superdatascience.com/692

Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

More episodes from Super Data Science: ML & AI Podcast with Jon Krohn