February 20, 2025

Small Models Struggle to Learn from Strong Reasoners

13 minutes

The study reveals the Small Model Learnability Gap, showing that smaller models benefit more from simpler reasoning chains. Mix Distillation improves their performance by balancing reasoning complexity.

https://arxiv.org/abs//2502.12143

YouTube: https://www.youtube.com/@ArxivPapers

TikTok: https://www.tiktok.com/@arxiv_papers

Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016

Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

...more

View all episodes

By Igor Melnyk

33 ratings

February 20, 2025

Small Models Struggle to Learn from Strong Reasoners

13 minutes

https://arxiv.org/abs//2502.12143

YouTube: https://www.youtube.com/@ArxivPapers

TikTok: https://www.tiktok.com/@arxiv_papers

Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016

Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

...more

More shows like Arxiv Papers

View all

FT News Briefing

698 Listeners

Google DeepMind: The Podcast

197 Listeners

Last Week in AI

288 Listeners

Latent Space: The AI Engineer Podcast

77 Listeners

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

448 Listeners

Share Small Models Struggle to Learn from Strong Reasoners

Sign up to save your podcasts

Small Models Struggle to Learn from Strong Reasoners

Small Models Struggle to Learn from Strong Reasoners

More shows like Arxiv Papers

FT News Briefing

Google DeepMind: The Podcast

Last Week in AI

Latent Space: The AI Engineer Podcast

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis