December 06, 2024

Abstracts: NeurIPS 2024 with Weizhu Chen

Listen Later

8 minutes

Next-token prediction trains a language model on all tokens in a sequence. VP Weizhu Chen discusses his team’s 2024 NeurIPS paper on how distinguishing between useful and “noisy” tokens in pretraining can improve token efficiency and model performance.

Read the paper

Get the code

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

Microsoft Research Podcast

By Researchers across the Microsoft research community

4.8

8080 ratings

December 06, 2024

Abstracts: NeurIPS 2024 with Weizhu Chen

Listen Later

8 minutes

Next-token prediction trains a language model on all tokens in a sequence. VP Weizhu Chen discusses his team’s 2024 NeurIPS paper on how distinguishing between useful and “noisy” tokens in pretraining can improve token efficiency and model performance.

Read the paper

Get the code

...more

More shows like Microsoft Research Podcast

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

341 Listeners

AI Today Podcast by AI & Data Today

AI Today Podcast

154 Listeners

Practical AI by Practical AI LLC

Practical AI

213 Listeners

Last Week in AI by Skynet Today

Last Week in AI

306 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

90 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

506 Listeners

Big Technology Podcast by Alex Kantrowitz

Big Technology Podcast

478 Listeners

NEJM AI Grand Rounds by NEJM Group

NEJM AI Grand Rounds

59 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

131 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

95 Listeners

Possible by Reid Hoffman

Possible

123 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

591 Listeners

Practical: AI & Business News by Practical News

Practical: AI & Business News

26 Listeners

AI + a16z by a16z

AI + a16z

35 Listeners

AI Applied: Covering AI News, Interviews and Tools - ChatGPT, Midjourney, Gemini, OpenAI, Anthropic by Jaeden Schafer and Conor Grennan

AI Applied: Covering AI News, Interviews and Tools - ChatGPT, Midjourney, Gemini, OpenAI, Anthropic

136 Listeners