February 26, 2026

Building a Universal Speech Model: Native Accuracy Across 60+ Languages

49 minutes

In this episode of the Convo AI World Podcast, Hermes Frangoudis interviews Klemen Simonic, founder and CEO of Soniox, who discusses how his team is achieving native speaker accuracy across 60+ languages. Klemen explains how Soniox leverages unsupervised learning and a universal model architecture to handle seamless language switching and real-time, mid-sentence translation with minimal latency. By prioritizing robustness and low-latency performance over traditional cascading models, Soniox enables high-fidelity voice interfaces for healthcare, wearables, and voice agents, while also breaking down significant accessibility barriers for the hearing-impaired community

Check out video episodes and subscribe to the Convo AI Newsletter at convoai.world

...more

View all episodes

By Agora

February 26, 2026

Building a Universal Speech Model: Native Accuracy Across 60+ Languages

49 minutes

Check out video episodes and subscribe to the Convo AI Newsletter at convoai.world

...more

Share Building a Universal Speech Model: Native Accuracy Across 60+ Languages

Sign up to save your podcasts

Building a Universal Speech Model: Native Accuracy Across 60+ Languages

Building a Universal Speech Model: Native Accuracy Across 60+ Languages