February 20, 2025

[QA] Small Models Struggle to Learn from Strong Reasoners

6 minutes

The study reveals the Small Model Learnability Gap, showing that smaller models benefit more from simpler reasoning chains. Mix Distillation improves their performance by balancing reasoning complexity.

https://arxiv.org/abs//2502.12143

YouTube: https://www.youtube.com/@ArxivPapers

TikTok: https://www.tiktok.com/@arxiv_papers

Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016

Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

...more

View all episodes

By Igor Melnyk

33 ratings

February 20, 2025

[QA] Small Models Struggle to Learn from Strong Reasoners

6 minutes

https://arxiv.org/abs//2502.12143

YouTube: https://www.youtube.com/@ArxivPapers

TikTok: https://www.tiktok.com/@arxiv_papers

Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016

Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

...more

More shows like Arxiv Papers

View all

FT News Briefing

698 Listeners

Google DeepMind: The Podcast

197 Listeners

Last Week in AI

288 Listeners

Latent Space: The AI Engineer Podcast

77 Listeners

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

448 Listeners

Share [QA] Small Models Struggle to Learn from Strong Reasoners

Sign up to save your podcasts

[QA] Small Models Struggle to Learn from Strong Reasoners

[QA] Small Models Struggle to Learn from Strong Reasoners

More shows like Arxiv Papers

FT News Briefing

Google DeepMind: The Podcast

Last Week in AI

Latent Space: The AI Engineer Podcast

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis