Kyutai TTS

Kyutai TTS

The voice for your real-time AI applications

183 followers

Kyutai TTS is a new open-source text-to-speech model optimized for real-time use. It's the first TTS that streams text in as it streams audio out, enabling ultra-low latency for LLM applications.
Kyutai TTS gallery image
Kyutai TTS gallery image
Kyutai TTS gallery image
Kyutai TTS gallery image
Free
Launch Team

What do you think? …

Zac Zuo
Hunter
📌

Hi everyone!

I was literally blown away by the quality of this new open-source text-to-speech model from Kyutai 🤯 The voices are incredibly natural, and the response time is impressively fast.

It's the first TTS that can stream text in while it streams audio out. Ultra speed. This is a huge deal for real-time conversational AI, as it can start talking almost instantly as an LLM generates text, without waiting for the full response.

It's amazing to see a model this good, with such a smart technical approach, be so generously open-sourced.

If you want to support their work, you can even help expand the voice library by donating your own voice here: https://unmute.sh/voice-donation

Animesh

Awesome work @vvolhejn and @eugene_kharitonov

Just tried all the voices, I am definitely going to use the Calming (US, m) & Calming (US, f) for a custom guided meditation session

Congrats on the launch!

Isabella Song

The real - time streaming and natural voices of Kyutai TTS are game - changers for conversational AI! For developers integrating it into multilingual chatbots, how does the model handle accurate pronunciation and tone consistency across different languages?