Sign up to save your podcastsEmail addressPasswordRegisterOrContinue with GoogleAlready have an account? Log in here.
Running out of time to catch up with new arXiv papers? We take the most impactful papers and present them as convenient podcasts. If you're a visual learner, we offer these papers in an engaging video... more
FAQs about Arxiv Papers:How many episodes does Arxiv Papers have?The podcast currently has 2,267 episodes available.
May 07, 2025[QA] Absolute Zero: Reinforced Self-play Reasoning with Zero Datahttps://arxiv.org/abs//2505.03335YouTube: https://www.youtube.com/@ArxivPapersTikTok: https://www.tiktok.com/@arxiv_papersApple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers...more8minPlay
May 07, 2025Absolute Zero: Reinforced Self-play Reasoning with Zero Datahttps://arxiv.org/abs//2505.03335YouTube: https://www.youtube.com/@ArxivPapersTikTok: https://www.tiktok.com/@arxiv_papersApple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers...more28minPlay
May 07, 2025[QA] Teaching Models to Understand (but not Generate) High-risk DataThe lmssSLUNG paradigm allows language models to understand high-risk content without generating it, improving their ability to recognize harmful text while preventing toxic outputs.https://arxiv.org/abs//2505.03052YouTube: https://www.youtube.com/@ArxivPapersTikTok: https://www.tiktok.com/@arxiv_papersApple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers...more8minPlay
May 07, 2025Teaching Models to Understand (but not Generate) High-risk DataThe lmssSLUNG paradigm allows language models to understand high-risk content without generating it, improving their ability to recognize harmful text while preventing toxic outputs.https://arxiv.org/abs//2505.03052YouTube: https://www.youtube.com/@ArxivPapersTikTok: https://www.tiktok.com/@arxiv_papersApple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers...more17minPlay
May 06, 2025[QA] RM-R1: Reward Modeling as ReasoningThis paper introduces Reasoning Reward Models (REASRMS) to enhance interpretability and performance in reward modeling for large language models, achieving state-of-the-art results through innovative training methods.https://arxiv.org/abs//2505.02387YouTube: https://www.youtube.com/@ArxivPapersTikTok: https://www.tiktok.com/@arxiv_papersApple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers...more8minPlay
May 06, 2025RM-R1: Reward Modeling as ReasoningThis paper introduces Reasoning Reward Models (REASRMS) to enhance interpretability and performance in reward modeling for large language models, achieving state-of-the-art results through innovative training methods.https://arxiv.org/abs//2505.02387YouTube: https://www.youtube.com/@ArxivPapersTikTok: https://www.tiktok.com/@arxiv_papersApple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers...more26minPlay
May 06, 2025[QA] Practical Efficiency of Muon for PretrainingMuon outperforms AdamW in expanding the Pareto frontier for compute-time tradeoff, enhancing data efficiency at large batch sizes while enabling economical training through effective hyperparameter transfer.https://arxiv.org/abs//2505.02222YouTube: https://www.youtube.com/@ArxivPapersTikTok: https://www.tiktok.com/@arxiv_papersApple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers...more8minPlay
May 06, 2025Practical Efficiency of Muon for PretrainingMuon outperforms AdamW in expanding the Pareto frontier for compute-time tradeoff, enhancing data efficiency at large batch sizes while enabling economical training through effective hyperparameter transfer.https://arxiv.org/abs//2505.02222YouTube: https://www.youtube.com/@ArxivPapersTikTok: https://www.tiktok.com/@arxiv_papersApple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers...more24minPlay
May 05, 2025[QA] Llama-Nemotron: Efficient Reasoning ModelsThe Llama-Nemotron models offer advanced reasoning capabilities, efficient inference, and an open license, available in three sizes, with a unique dynamic reasoning toggle for enhanced user interaction.https://arxiv.org/abs//2505.00949YouTube: https://www.youtube.com/@ArxivPapersTikTok: https://www.tiktok.com/@arxiv_papersApple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers...more8minPlay
May 05, 2025Llama-Nemotron: Efficient Reasoning ModelsThe Llama-Nemotron models offer advanced reasoning capabilities, efficient inference, and an open license, available in three sizes, with a unique dynamic reasoning toggle for enhanced user interaction.https://arxiv.org/abs//2505.00949YouTube: https://www.youtube.com/@ArxivPapersTikTok: https://www.tiktok.com/@arxiv_papersApple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers...more27minPlay
FAQs about Arxiv Papers:How many episodes does Arxiv Papers have?The podcast currently has 2,267 episodes available.