November 01, 2024

Zhipu AI Releases GLM-4-Voice: A New Open-Source End-to-End Speech Large Language Model

3 minutes

Zhipu AI has released GLM-4-Voice, an open-source speech large language model that combines speech recognition, text generation, and speech synthesis into a single system.

This model can translate speech to text, text to speech, and even speech to speech. GLM-4-Voice is built upon the GLM-4 language model and supports both English and Chinese.

This open-source release, like others such as LG's EXAONE 3.0 and Google's Gemma, provides researchers and developers with the tools to further explore and advance the field of speech artificial intelligence.

...more

View all episodes

By Michael Iversen

November 01, 2024

Zhipu AI Releases GLM-4-Voice: A New Open-Source End-to-End Speech Large Language Model

3 minutes

Zhipu AI has released GLM-4-Voice, an open-source speech large language model that combines speech recognition, text generation, and speech synthesis into a single system.

This model can translate speech to text, text to speech, and even speech to speech. GLM-4-Voice is built upon the GLM-4 language model and supports both English and Chinese.

...more

Share Zhipu AI Releases GLM-4-Voice: A New Open-Source End-to-End Speech Large Language Model

Sign up to save your podcasts

Zhipu AI Releases GLM-4-Voice: A New Open-Source End-to-End Speech Large Language Model

Zhipu AI Releases GLM-4-Voice: A New Open-Source End-to-End Speech Large Language Model