The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Quantizing Transformers by Helping Attention Heads Do Nothing with Markus Nagel - #663


Listen Later

Today we’re joined by Markus Nagel, research scientist at Qualcomm AI Research, who helps us kick off our coverage of NeurIPS 2023. In our conversation with Markus, we cover his accepted papers at the conference, along with other work presented by Qualcomm AI Research scientists. Markus’ first paper, Quantizable Transformers: Removing Outliers by Helping Attention Heads Do Nothing, focuses on tackling activation quantization issues introduced by the attention mechanism and how to solve them. We also discuss Pruning vs Quantization: Which is Better?, which focuses on comparing the effectiveness of these two methods in achieving model weight compression. Additional papers discussed focus on topics like using scalarization in multitask and multidomain learning to improve training and inference, using diffusion models for a sequence of state models and actions, applying geometric algebra with equivariance to transformers, and applying a deductive verification of chain of thought reasoning performed by LLMs.


The complete show notes for this episode can be found at twimlai.com/go/663.

...more
View all episodesView all episodes
Download on the App Store

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)By Sam Charrington

  • 4.7
  • 4.7
  • 4.7
  • 4.7
  • 4.7

4.7

416 ratings


More shows like The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

View all
The AI in Business Podcast by Daniel Faggella

The AI in Business Podcast

159 Listeners

Data Skeptic by Kyle Polich

Data Skeptic

476 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

298 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

340 Listeners

AI Today Podcast by AI & Data Today

AI Today Podcast

151 Listeners

Practical AI by Practical AI LLC

Practical AI

183 Listeners

Last Week in AI by Skynet Today

Last Week in AI

298 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

91 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

425 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

128 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

201 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

72 Listeners

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

496 Listeners

AI + a16z by a16z

AI + a16z

31 Listeners

Training Data by Sequoia Capital

Training Data

43 Listeners