
Sign up to save your podcasts
Or


On the premiere episode of the AI Inside podcast, hosts Jeff Jarvis and Jason Howell discuss AI copyright issues with Common Crawl Foundation's Rich Skrenta regarding news outlets limiting access to content they publish publicly, impacting the integrity of Common Crawl's internet archive. In recent years, the archive has been used by LLMs as AI training data, and the implications of restricting information have a dramatic impact on the data quality that survives.
INTERVIEW
NEWS BITES
Learn more about your ad choices. Visit megaphone.fm/adchoices
By Jason Howell and Jeff Jarvis4.9
4242 ratings
On the premiere episode of the AI Inside podcast, hosts Jeff Jarvis and Jason Howell discuss AI copyright issues with Common Crawl Foundation's Rich Skrenta regarding news outlets limiting access to content they publish publicly, impacting the integrity of Common Crawl's internet archive. In recent years, the archive has been used by LLMs as AI training data, and the implications of restricting information have a dramatic impact on the data quality that survives.
INTERVIEW
NEWS BITES
Learn more about your ad choices. Visit megaphone.fm/adchoices

837 Listeners

3,062 Listeners

1,965 Listeners

2,007 Listeners

2,013 Listeners

887 Listeners

1,072 Listeners

781 Listeners

3,724 Listeners

554 Listeners

3,150 Listeners

1,395 Listeners

530 Listeners

963 Listeners

507 Listeners