AI Inside

AI Positive


Listen Later

On the premiere episode of the AI Inside podcast, hosts Jeff Jarvis and Jason Howell discuss AI copyright issues with Common Crawl Foundation's Rich Skrenta regarding news outlets limiting access to content they publish publicly, impacting the integrity of Common Crawl's internet archive. In recent years, the archive has been used by LLMs as AI training data, and the implications of restricting information have a dramatic impact on the data quality that survives.


INTERVIEW

  • Introduction and background on AI Inside podcast
  • Discussion of the recent AI oversight Senate hearing Jeff testified at
  • Introduction of guest Rich Skrenta from Common Crawl Foundation
  • Overview of Common Crawl and its goals to archive the open web
  • Discussion of how Common Crawl data is used to train AI models
  • News publishers wanting content removed from Common Crawl
  • Debate around copyright, fair use, and AI’s “right to read”
  • Mechanics of how Common Crawl works and what it archives
  • Concerns about restricting AI access to data for training
  • Risk of regulatory capture and only big companies being able to use AI
  • Discussion of recent court ruling related to web scraping
  • Hopes for Common Crawl's growth and evolution

  • NEWS BITES

    • Interesting device announcement from CES - Rabbit R1 with Perplexity AI integration
    • Study on actual risk of AI automating jobs away in the near future
    • Learn more about your ad choices. Visit megaphone.fm/adchoices

      ...more
      View all episodesView all episodes
      Download on the App Store

      AI InsideBy Yellowgold Studios

      • 5
      • 5
      • 5
      • 5
      • 5

      5

      39 ratings


      More shows like AI Inside

      View all
      This Week in Tech (Audio) by TWiT

      This Week in Tech (Audio)

      3,050 Listeners

      Mac Geek Gab — Apple Tips, Tricks, and Troubleshooting by Dave Hamilton, Pilot Pete & Adam Christianson

      Mac Geek Gab — Apple Tips, Tricks, and Troubleshooting

      829 Listeners

      Security Now (Audio) by TWiT

      Security Now (Audio)

      1,998 Listeners

      MacBreak Weekly (Audio) by TWiT

      MacBreak Weekly (Audio)

      2,013 Listeners

      Windows Weekly (Audio) by TWiT

      Windows Weekly (Audio)

      881 Listeners

      Intelligent Machines (Audio) by TWiT

      Intelligent Machines (Audio)

      774 Listeners

      Tech News Weekly (Audio) by TWiT

      Tech News Weekly (Audio)

      1,076 Listeners

      iOS Today (Audio) by TWiT

      iOS Today (Audio)

      555 Listeners

      This Week in Tech (Video) by TWiT

      This Week in Tech (Video)

      274 Listeners

      MacBreak Weekly (Video) by TWiT

      MacBreak Weekly (Video)

      355 Listeners

      Accidental Tech Podcast by Marco Arment, Casey Liss, John Siracusa

      Accidental Tech Podcast

      2,092 Listeners

      Daily Tech News Show by Tom Merritt

      Daily Tech News Show

      1,389 Listeners

      Upgrade by Relay

      Upgrade

      1,210 Listeners

      Tech Brew Ride Home by Morning Brew

      Tech Brew Ride Home

      965 Listeners

      Big Technology Podcast by Alex Kantrowitz

      Big Technology Podcast

      469 Listeners