Microsoft Research Podcast

Abstracts: NeurIPS 2024 with Weizhu Chen


Listen Later

Next-token prediction trains a language model on all tokens in a sequence. VP Weizhu Chen discusses his team’s 2024 NeurIPS paper on how distinguishing between useful and “noisy” tokens in pretraining can improve token efficiency and model performance.

Read the paper

Get the code

...more
View all episodesView all episodes
Download on the App Store

Microsoft Research PodcastBy Researchers across the Microsoft research community

  • 4.8
  • 4.8
  • 4.8
  • 4.8
  • 4.8

4.8

80 ratings


More shows like Microsoft Research Podcast

View all
The Daily by The New York Times

The Daily

113,121 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

551 Listeners

Hard Fork by The New York Times

Hard Fork

5,576 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

150 Listeners