The Python Podcast.__init__

Exploring The SpeechBrain Toolkit For Speech Processing


Listen Later

Summary

With the rising availability of computation in everyday devices, there has been a corresponding increase in the appetite for voice as the primary interface. To accomodate this desire it is necessary for us to have high quality libraries for being able to process and generate audio data that can make sense of human speech. To facilitate research and industry applications for speech data Mirco Ravanelli and Peter Plantinga are building SpeechBrain. In this episode they explain how it works under the hood, the projects that they are using it for, and how you can get started with it today.

Announcements
  • Hello and welcome to Podcast.__init__, the podcast about Python’s role in data and science.
  • When you’re ready to launch your next app or want to try a project you hear about on the show, you’ll need somewhere to deploy it, so take a look at our friends over at Linode. With the launch of their managed Kubernetes platform it’s easy to get started with the next generation of deployment and scaling, powered by the battle tested Linode platform, including simple pricing, node balancers, 40Gbit networking, dedicated CPU and GPU instances, and worldwide data centers. Go to pythonpodcast.com/linode and get a $100 credit to try out a Kubernetes cluster of your own. And don’t forget to thank them for their continued support of this show!
  • Your host as usual is Tobias Macey and today I’m interviewing Mirco Ravanelli and Peter Plantinga about SpeechBrain, an open-source and all-in-one speech toolkit powered by PyTorch
  • Interview
    • Introductions
    • How did you get introduced to Python?
    • Can you describe what SpeechBrain is and the story behind it?
    • What are the goals and target use cases of the SpeechBrain project?
    • What are some of the ways that processing audio with a focus on speech differs from more general audio processing?
    • What are some of the other libraries/frameworks/services that are available to work with speech data and what are the unique capabilities that SpeechBrain offers?
    • How is SpeechBrain implemented?
      • What was your decision process for determining which framework to build on top of?
      • What are some of the original ideas and assumptions that you had for SpeechBrain which have been changed or invalidated as you worked through implementing it?
      • Can you talk through the workflow of using SpeechBrain?
        • What would be involved in developing a system to automate transcription with speaker recognition and diarization?
        • In the documentation it mentions that SpeechBrain is built to be used for research purposes. What are some of the kinds of research that it is being used for?
        • What are some of the features or capabilities of SpeechBrain which might be non-obvious that you would like to highlight?
        • What are the most interesting, innovative, or unexpected ways that you have seen SpeechBrain used?
        • What are the most interesting, unexpected, or challenging lessons that you have learned while working on SpeechBrain?
        • When is SpeechBrain the wrong choice?
        • What do you have planned for the future of SpeechBrain?
        • Keep In Touch
          • Mirco
            • mravanelli on GitHub
            • LinkedIn
            • @mirco_ravanelli on Twitter
            • Peter
              • pplantinga on GitHub
              • @ComPeterScience on Twitter
              • Website
              • LinkedIn
              • Picks
                • Tobias
                  • x.ai
                  • Closing Announcements
                    • Thank you for listening! Don’t forget to check out our other show, the Data Engineering Podcast for the latest on modern data management.
                    • Visit the site to subscribe to the show, sign up for the mailing list, and read the show notes.
                    • If you’ve learned something or tried out a project from the show then tell us about it! Email [email protected]) with your story.
                    • To help other people find the show please leave a review on iTunes and tell your friends and co-workers
                    • Join the community in the new Zulip chat workspace at pythonpodcast.com/chat
                    • Links
                      • SpeechBrain
                      • Mila
                      • Speech Processing
                      • Speech Enhancement
                      • NumPy
                      • SciPy
                      • Theano
                      • PyTorch
                        • Podcast Episode
                        • Speech Recognition
                        • NeMo
                        • ESPNet
                        • Sequence to Sequence (Seq2Seq)
                        • HyperParameters
                        • TorchAudio
                        • PyTorch Lightning
                        • Keras
                        • HuggingFace
                        • Generative Adversarial Network
                        • Snorkel
                          • Data Engineering Podcast Episode
                          • The intro and outro music is from Requiem for a Fish The Freak Fandango Orchestra / CC BY-SA

                            ...more
                            View all episodesView all episodes
                            Download on the App Store

                            The Python Podcast.__init__By Tobias Macey

                            • 4.4
                            • 4.4
                            • 4.4
                            • 4.4
                            • 4.4

                            4.4

                            100 ratings


                            More shows like The Python Podcast.__init__

                            View all
                            TED Talks Daily by TED

                            TED Talks Daily

                            11,280 Listeners

                            6 Minute English by BBC Radio

                            6 Minute English

                            1,779 Listeners

                            The Changelog: Software Development, Open Source by Changelog Media

                            The Changelog: Software Development, Open Source

                            285 Listeners

                            Data Skeptic by Kyle Polich

                            Data Skeptic

                            474 Listeners

                            Talk Python To Me by Michael Kennedy

                            Talk Python To Me

                            585 Listeners

                            Software Engineering Daily by Software Engineering Daily

                            Software Engineering Daily

                            630 Listeners

                            The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

                            The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

                            429 Listeners

                            Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

                            Super Data Science: ML & AI Podcast with Jon Krohn

                            295 Listeners

                            Python Bytes by Michael Kennedy and Brian Okken

                            Python Bytes

                            212 Listeners

                            Syntax - Tasty Web Development Treats by Wes Bos & Scott Tolinski - Full Stack JavaScript Web Developers

                            Syntax - Tasty Web Development Treats

                            984 Listeners

                            DataFramed by DataCamp

                            DataFramed

                            267 Listeners

                            Practical AI by Practical AI LLC

                            Practical AI

                            196 Listeners

                            The Real Python Podcast by Real Python

                            The Real Python Podcast

                            136 Listeners

                            Last Week in AI by Skynet Today

                            Last Week in AI

                            275 Listeners

                            Latent Space: The AI Engineer Podcast by swyx + Alessio

                            Latent Space: The AI Engineer Podcast

                            64 Listeners