
Sign up to save your podcasts
Or


Exhausting the data on the Internet by 2026, the value or non-value of transcripts, training on other languages, and subsidizing additional dataset generation
By Pierce Freeman & Richard Diehl MartinezExhausting the data on the Internet by 2026, the value or non-value of transcripts, training on other languages, and subsidizing additional dataset generation