Arxiv Papers

By Igor Melnyk

Running out of time to catch up with new arXiv papers? We take the most impactful papers and present them as convenient podcasts. If you're a visual learner, we offer these papers in an engaging video... more

· Science

5

33 ratings

Download on the App Store

Download on the App Store

Get it on Google Play

FAQs about Arxiv Papers:

How many episodes does Arxiv Papers have?

The podcast currently has 2,489 episodes available.

Arxiv Papers episodes:

April 21, 2025 [QA] Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning

PODS decouples reinforcement learning phases by parallelizing rollouts and selectively updating, using max-variance down-sampling to enhance performance on the GSM8K benchmark compared to standard GRPO.

https://arxiv.org/abs//2504.13818

YouTube: https://www.youtube.com/@ArxivPapers

TikTok: https://www.tiktok.com/@arxiv_papers

Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016

Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

...more
8min
April 21, 2025 Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning

PODS decouples reinforcement learning phases by parallelizing rollouts and selectively updating, using max-variance down-sampling to enhance performance on the GSM8K benchmark compared to standard GRPO.

https://arxiv.org/abs//2504.13818

YouTube: https://www.youtube.com/@ArxivPapers

TikTok: https://www.tiktok.com/@arxiv_papers

Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016

Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

...more
8min
April 21, 2025 [QA] Let Me Grok for You: Accelerating Grokking via Embedding Transfer from a Weaker Model

The paper presents a method to accelerate "grokking" in neural networks by using learned embeddings from a weaker model, enabling direct generalization without delay across various tasks.

https://arxiv.org/abs//2504.13292

YouTube: https://www.youtube.com/@ArxivPapers

TikTok: https://www.tiktok.com/@arxiv_papers

Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016

Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

...more
8min
April 21, 2025 Let Me Grok for You: Accelerating Grokking via Embedding Transfer from a Weaker Model

The paper presents a method to accelerate "grokking" in neural networks by using learned embeddings from a weaker model, enabling direct generalization without delay across various tasks.

https://arxiv.org/abs//2504.13292

YouTube: https://www.youtube.com/@ArxivPapers

TikTok: https://www.tiktok.com/@arxiv_papers

Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016

Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

...more
17min
April 20, 2025 [QA] Reasoning Models Can Be Effective Without Thinking

This paper challenges the necessity of lengthy reasoning processes in LLMs, showing that simple prompting (NoThinking) can outperform traditional methods in various reasoning tasks, especially in low-budget scenarios.

https://arxiv.org/abs//2504.09858

YouTube: https://www.youtube.com/@ArxivPapers

TikTok: https://www.tiktok.com/@arxiv_papers

Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016

Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

...more
8min
April 20, 2025 Reasoning Models Can Be Effective Without Thinking

This paper challenges the necessity of lengthy reasoning processes in LLMs, showing that simple prompting (NoThinking) can outperform traditional methods in various reasoning tasks, especially in low-budget scenarios.

https://arxiv.org/abs//2504.09858

YouTube: https://www.youtube.com/@ArxivPapers

TikTok: https://www.tiktok.com/@arxiv_papers

Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016

Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

...more
21min
April 20, 2025 [QA] A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

This paper analyzes GRPO in reinforcement learning for language models, revealing that a simple rejection sampling method, RAFT, performs competitively and suggesting improvements for future reward-based training approaches.

https://arxiv.org/abs//2504.11343

YouTube: https://www.youtube.com/@ArxivPapers

TikTok: https://www.tiktok.com/@arxiv_papers

Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016

Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

...more
9min
April 20, 2025 A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

This paper analyzes GRPO in reinforcement learning for language models, revealing that a simple rejection sampling method, RAFT, performs competitively and suggesting improvements for future reward-based training approaches.

https://arxiv.org/abs//2504.11343

YouTube: https://www.youtube.com/@ArxivPapers

TikTok: https://www.tiktok.com/@arxiv_papers

Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016

Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

...more
15min
April 19, 2025 [QA] CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

https://arxiv.org/abs//2504.13161

YouTube: https://www.youtube.com/@ArxivPapers

TikTok: https://www.tiktok.com/@arxiv_papers

Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016

Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

...more
8min
April 19, 2025 CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

https://arxiv.org/abs//2504.13161

YouTube: https://www.youtube.com/@ArxivPapers

TikTok: https://www.tiktok.com/@arxiv_papers

Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016

Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers

...more
21min

FAQs about Arxiv Papers:

How many episodes does Arxiv Papers have?

The podcast currently has 2,489 episodes available.

More shows like Arxiv Papers

Exchanges by Goldman Sachs

Exchanges

970 Listeners

Odd Lots by Bloomberg

Odd Lots

1,967 Listeners

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

436 Listeners

The Daily by The New York Times

The Daily

111,948 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

10,182 Listeners

Hard Fork by The New York Times

Hard Fork

5,530 Listeners

UnHerd with Freddie Sayers by UnHerd

UnHerd with Freddie Sayers

195 Listeners

Unsupervised Learning with Jacob Effron by by Redpoint Ventures

Unsupervised Learning with Jacob Effron

52 Listeners

Latent Space: The AI Engineer Podcast by Latent.Space

Latent Space: The AI Engineer Podcast

101 Listeners

BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

BG2Pod with Brad Gerstner and Bill Gurley

491 Listeners