In this episode, Chris is harassed by quite a few artificial nuisance callers, among
drug lords, Irish nurses and some random Linux Inlaws Chief Financial Officer. Based
on these examples, our two heroes discuss the history and current state of text-to-
speech (TTS) and voice recognition. We attempted to use voice recognition software in order
to produce a transcript of the show.
Shownotes:
Wavenet: https://deepmind.com/blog/article/wavenet-generative-model-raw-audioTacotron: https://ai.googleblog.com/2017/12/tacotron-2-generating-human-like-speech.htmlDeepSpeech: https://github.com/mozilla/DeepSpeechLyrebird / Welcome.AI: https://www.welcome.ai/lyrebirdNvidia Tacotron 2: https://github.com/NVIDIA/tacotron2Tensorflow: https://www.tensorflow.orgPyTorch: https://pytorch.orgMelspectrograms: https://medium.com/analytics-vidhya/understanding-the-mel-spectrogram-fca2afa2ce53GRAPHCORE: https://www.graphcore.aiFGPA: https://en.wikipedia.org/wiki/Field-programmable_gate_arrayIBM ROMP: https://en.wikipedia.org/wiki/IBM_ROMPGoogle's TTS: https://cloud.google.com/text-to-speechApple M1: https://www.gsmarena.com/the_apple_m1_is_the_first_armbased_chipset_for_macs_with_the_fastest_cpu_cores_and_top_igpu-news-46222.phpSecure Enclaves: https://support.apple.com/guide/security/secure-enclave-overview-sec59b0b31ff/webOSDU: https://www.opengroup.org/osdu/forum-homepageJack Kerouac's On the Road: https://en.wikipedia.org/wiki/On_the_Road