Arxiv Papers

[QA] Position: The Most Expensive Part of an LLM should be its Training Data


Listen Later



This paper argues that compensating human labor for training data is the largest cost in developing Large Language Models, significantly exceeding model training expenses, and suggests fairer practices for the future.


https://arxiv.org/abs//2504.12427


YouTube: https://www.youtube.com/@ArxivPapers


TikTok: https://www.tiktok.com/@arxiv_papers


Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016


Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers


...more
View all episodesView all episodes
Download on the App Store

Arxiv PapersBy Igor Melnyk

  • 5
  • 5
  • 5
  • 5
  • 5

5

3 ratings


More shows like Arxiv Papers

View all
FT News Briefing by Financial Times

FT News Briefing

697 Listeners

Google DeepMind: The Podcast by Hannah Fry

Google DeepMind: The Podcast

202 Listeners

Last Week in AI by Skynet Today

Last Week in AI

281 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

75 Listeners

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

429 Listeners