
Sign up to save your podcasts
Or


On the premiere episode of the AI Inside podcast, hosts Jeff Jarvis and Jason Howell discuss AI copyright issues with Common Crawl Foundation's Rich Skrenta regarding news outlets limiting access to content they publish publicly, impacting the integrity of Common Crawl's internet archive. In recent years, the archive has been used by LLMs as AI training data, and the implications of restricting information have a dramatic impact on the data quality that survives.
INTERVIEW
NEWS BITES
Learn more about your ad choices. Visit megaphone.fm/adchoices
By Jason Howell and Jeff Jarvis4.9
4242 ratings
On the premiere episode of the AI Inside podcast, hosts Jeff Jarvis and Jason Howell discuss AI copyright issues with Common Crawl Foundation's Rich Skrenta regarding news outlets limiting access to content they publish publicly, impacting the integrity of Common Crawl's internet archive. In recent years, the archive has been used by LLMs as AI training data, and the implications of restricting information have a dramatic impact on the data quality that survives.
INTERVIEW
NEWS BITES
Learn more about your ad choices. Visit megaphone.fm/adchoices

836 Listeners

3,059 Listeners

1,964 Listeners

2,011 Listeners

2,013 Listeners

888 Listeners

1,072 Listeners

781 Listeners

3,722 Listeners

555 Listeners

3,141 Listeners

1,391 Listeners

532 Listeners

964 Listeners

512 Listeners