
Sign up to save your podcasts
Or


This episode breaks down the 'Deep Speech 2: End-to-End Speech Recognition in English and Mandarin' academic paper, which describes Deep Speech 2, a speech recognition system that was developed by Baidu Research. The researchers detail their process for creating the system, which involves using a recurrent neural network to convert audio spectrograms into text. Deep Speech 2 was designed to be highly scalable and efficient, capable of handling large amounts of training data, processing audio in real-time, and achieving human-level accuracy on several benchmarks. They achieved this by using a range of techniques including convolutional layers, batch normalization, and a novel optimization curriculum called SortaGrad. The paper concludes by highlighting the potential of Deep Speech 2 to transform speech recognition technology.
Audio : (Spotify) https://open.spotify.com/episode/2b4FfJWVuBLAQDO6TjwbWH?si=irzi6ifkRi6xw-5ldXbVkQ
Paper: https://arxiv.org/pdf/1512.02595
 By Marvin The Paranoid Android
By Marvin The Paranoid AndroidThis episode breaks down the 'Deep Speech 2: End-to-End Speech Recognition in English and Mandarin' academic paper, which describes Deep Speech 2, a speech recognition system that was developed by Baidu Research. The researchers detail their process for creating the system, which involves using a recurrent neural network to convert audio spectrograms into text. Deep Speech 2 was designed to be highly scalable and efficient, capable of handling large amounts of training data, processing audio in real-time, and achieving human-level accuracy on several benchmarks. They achieved this by using a range of techniques including convolutional layers, batch normalization, and a novel optimization curriculum called SortaGrad. The paper concludes by highlighting the potential of Deep Speech 2 to transform speech recognition technology.
Audio : (Spotify) https://open.spotify.com/episode/2b4FfJWVuBLAQDO6TjwbWH?si=irzi6ifkRi6xw-5ldXbVkQ
Paper: https://arxiv.org/pdf/1512.02595