VUX World

What is text-to-speech and how does it work with Niclas Bergström


Listen Later

Every voice assistant needs three core components: Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Text-to-Speech (TTS). We've already covered what Automatic Speech Recognition is and how it works with Catherine Breslin and in this episode, we're covering the latter, text-to-speech.


To guide us through the ins and outs of TTS, we're joined by Niclas Bergström, a TTS veteran and co-founder of one of the largest TTS companies on the planet, Readspeaker.


Text-to-speech is the technology that gives voice assistants a voice. It's the thing that produces the synthetic vocal sound that's played from your smart speaker or phone whenever Alexa or Siri speaks. It's the only part of a voice assistant that you'd recognise. The other core components, ASR and NLU, are silent.


And, given how we're hard wired for speech - a baby can recognise its mother's voice from the womb - how your voice assistant or voice user interface (VUI) sounds is one of the most important parts of it.


A voice communicates so much information without us necessarily being aware. Just from the sound of someone's voice, you can infer gender, age, mood, education, place of birth and social status. From the sound of someone's voice, you can decide whether you trust them.


With voice assistants, voice user interfaces, or any hardware or software that speaks, choosing the right voice is imperative.


Some companies decide on a stock voice. One of Readspeaker's 90 voices or perhaps Amazon Polly. Others create their own bespoke voice that's fit for their brand.


We see examples of Lyrbird's voice cloning and we hear Alexa speak every day, so it's easy to take talking computers for granted. Because speaking is natural and easy for us, we assume that it's natural and easy for machines to talk. But it isn't.


So in this episode, we're going to lift the curtain on text-to-speech and find out just exactly how it works. We'll look at what's happening under the hood when voice assistants talk and see what goes into creating a TTS system.


Readspeaker is a pioneering voice technology company that provides lifelike Text to Speech (TTS) services for IVR systems, voice applications, automobiles, robots, public service announcement systems, websites or anywhere else. It's been in the TTS game for over 20 years and has in-depth knowledge and experience in AI and Deep Neural Networks, which they put to work in creating custom TTS voices for the world's biggest brands.


Links

Visit Readspeaker.com to find out more about TTS services

And Readspeaker.ai for more information on TTS research and samples

Hosted on Acast. See acast.com/privacy for more information.

...more
View all episodesView all episodes
Download on the App Store

VUX WorldBy Kane Simms

  • 4.9
  • 4.9
  • 4.9
  • 4.9
  • 4.9

4.9

8 ratings


More shows like VUX World

View all
Accidental Tech Podcast by Marco Arment, Casey Liss, John Siracusa

Accidental Tech Podcast

2,092 Listeners

Rahapodi by Nordnet

Rahapodi

5 Listeners

Pivot by New York Magazine

Pivot

9,111 Listeners

The Official SaaStr Podcast: SaaS | Founders | Investors by SaaStr

The Official SaaStr Podcast: SaaS | Founders | Investors

174 Listeners

Azeem Azhar's Exponential View by Azeem Azhar

Azeem Azhar's Exponential View

611 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

323 Listeners

The Diary Of A CEO with Steven Bartlett by DOAC

The Diary Of A CEO with Steven Bartlett

6,939 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

9,053 Listeners

Big Technology Podcast by Alex Kantrowitz

Big Technology Podcast

421 Listeners

NN/g UX Podcast by Nielsen Norman Group

NN/g UX Podcast

106 Listeners

Hard Fork by The New York Times

Hard Fork

5,420 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

15,229 Listeners

ZOE Science & Nutrition by ZOE

ZOE Science & Nutrition

1,993 Listeners

The Rest Is Politics by Goalhanger

The Rest Is Politics

3,131 Listeners

The Rest Is Entertainment by Goalhanger

The Rest Is Entertainment

803 Listeners