Microsoft Research Podcast

Abstracts: NeurIPS 2024 with Weizhu Chen


Listen Later

Next-token prediction trains a language model on all tokens in a sequence. VP Weizhu Chen discusses his team’s 2024 NeurIPS paper on how distinguishing between useful and “noisy” tokens in pretraining can improve token efficiency and model performance.

Read the paper

Get the code

...more
View all episodesView all episodes
Download on the App Store

Microsoft Research PodcastBy Researchers across the Microsoft research community

  • 4.8
  • 4.8
  • 4.8
  • 4.8
  • 4.8

4.8

80 ratings


More shows like Microsoft Research Podcast

View all
Data Engineering Podcast by Tobias Macey

Data Engineering Podcast

145 Listeners

The Daily by The New York Times

The Daily

112,952 Listeners

Practical AI by Practical AI LLC

Practical AI

200 Listeners

Google DeepMind: The Podcast by Hannah Fry

Google DeepMind: The Podcast

201 Listeners

Think Fast Talk Smart: Communication Techniques by Matt Abrahams, Think Fast Talk Smart

Think Fast Talk Smart: Communication Techniques

821 Listeners

Last Week in AI by Skynet Today

Last Week in AI

309 Listeners

Big Technology Podcast by Alex Kantrowitz

Big Technology Podcast

507 Listeners

The Rest Is History by Goalhanger

The Rest Is History

15,823 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

140 Listeners

Unsupervised Learning with Jacob Effron by by Redpoint Ventures

Unsupervised Learning with Jacob Effron

51 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

99 Listeners