Practical AI

Speech tech and Common Voice at Mozilla


Listen Later

Many people are excited about creating usable speech technology. However, most of the audio data used by large companies isn’t available to the majority of people, and that data is often biased in terms of language, accent, and gender. Jenny, Josh, and Remy from Mozilla join us to discuss how Mozilla is building an open-source voice database that anyone can use to make innovative apps for devices and the web (Common Voice). They also discuss efforts through Mozilla fellowship program to develop speech tech for African languages and understand bias in data sets.

Join the discussion

Changelog++ members get a bonus 2 minutes at the end of this episode and zero ads. Join today!

Sponsors:

  • LinodeOur cloud of choice and the home of Changelog.com. Deploy a fast, efficient, native SSD cloud server for only $5/month. Get 4 months free using the code changelog2019 OR changelog2020. To learn more and get started head to linode.com/changelog
  • Pace.dev – Minimalist web based management tool for your teams. Async by default communication and simplistic task management gives you everything you need to build your next thing. Brought to you by Go Time panelist Mat Ryer. Try it out today!
  • FastlyOur bandwidth partner. Fastly powers fast, secure, and scalable digital experiences. Move beyond your content delivery network to their powerful edge cloud platform. Learn more at fastly.com
  • RollbarWe move fast and fix things because of Rollbar. Resolve errors in minutes. Deploy with confidence. Learn more at rollbar.com/changelog

Featuring:

  • Jenny Zhang – Website, X
  • Remy Muhire – GitHub, X
  • Josh Meyer – GitHub, X
  • Chris Benson – Website, GitHub, LinkedIn, X
  • Daniel Whitenack – Website, GitHub, X

Show Notes:

  • Mozilla Common Voice
  • Announcement of Josh and Remy’s fellowship work on speech tech for African languages
  • Artie Bias Corpus
  • Readings on Demographic Bias in ASR: 
    • Voice recognition still has significant race and gender biases
    • Gender and Dialect Bias in YouTube’s Automatic Captions
    • Racial disparities in automated speech recognition
  • Common Voice LREC Paper
  • Common Voice + DeepSpeech collaborators for Low-resource languages: 
    • Digital Umuganda
    • AI Lab, Makerere University
    • Language Technologies Unit, Bangor University
    • Linguistics Department, Indiana University Bloomington
  • “under-sampled majority” is a quote from Joy Boulamwini (see this article)

Something missing or broken? PRs welcome!

...more
View all episodesView all episodes
Download on the App Store

Practical AIBy Practical AI LLC

  • 4.4
  • 4.4
  • 4.4
  • 4.4
  • 4.4

4.4

185 ratings


More shows like Practical AI

View all
The AI in Business Podcast by Daniel Faggella

The AI in Business Podcast

170 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

303 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

334 Listeners

Last Week in AI by Skynet Today

Last Week in AI

306 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

95 Listeners

Me, Myself, and AI by MIT Sloan Management Review

Me, Myself, and AI

110 Listeners

AI Chat: ChatGPT, AI News, Artificial Intelligence, OpenAI, Machine Learning by Jaeden Schafer

AI Chat: ChatGPT, AI News, Artificial Intelligence, OpenAI, Machine Learning

154 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

227 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

608 Listeners

AI For Humans: Making Artificial Intelligence Fun & Practical by Kevin Pereira & Gavin Purcell

AI For Humans: Making Artificial Intelligence Fun & Practical

274 Listeners

Everyday AI Podcast – An AI and ChatGPT Podcast by Everyday AI

Everyday AI Podcast – An AI and ChatGPT Podcast

107 Listeners

A Beginner's Guide to AI by Dietmar Fischer

A Beginner's Guide to AI

54 Listeners

AI Hustle: Make Money from AI and ChatGPT, Midjourney, NVIDIA, Anthropic, OpenAI by Jaeden Schafer and Jamie McCauley

AI Hustle: Make Money from AI and ChatGPT, Midjourney, NVIDIA, Anthropic, OpenAI

173 Listeners

AI + a16z by a16z

AI + a16z

35 Listeners

The TED AI Show by TED

The TED AI Show

49 Listeners