
Launched on February 22nd, 2025
Hi Everyone!
Solving AI audio end-to-end means tackling both generation and understanding - from text-to-speech to speech-to-text and everything in between. At ElevenLabs, we re working on breakthroughs in AI audio that bridge research and real-world use.
Ask me anything about what we re building, the challenges of scaling AI speech models, and where this space is headed. Also keen to hear what you ve built with ElevenLabs!
Talk to your dog with our new AI Pawdio engine. Simply type a message, choose your breed, and our models will convert it into fluent barking. 😀
It outperforms Gemini 2.0 and Whisper v3, setting new benchmarks in accuracy. Leading in English, Spanish, Italian, and more, it supports 99 languages, speaker diarization, character-level timestamps, and non-speech events like laughter.
I've tried it, and the quality of the voices is extremely good. I have a problem, though, and would love to get some help from the ElevenLabs team. In the context I have (Touring, you can find it here on PH) I will most certainly have to deal with numbers, dates etc etc. I've played around on your landing page and tried sentences like "I have 200000 apples", or - in Italian - "Ho 1234 case in giro per il mondo" ("I have 1234 houses around the world"), and the quality of the voice goes down dramatically. In the 200000 apples case, I even got "I have 20 thousand thousand" instead of "I have 2 hundred thousands". Can you help? Perhaps this is something the team should focus on improving? In my case, unfortunately, it makes the tool unusable even though I love the quality of your voices. Thanks!
I love ElevenLabs ability to transform written text into lifelike speech across multiple languages, making content creation more engaging and accessible. Its user-friendly interface and extensive voice library offer a variety of accents and tones, allowing for seamless customization to suit any project's needs. The platform's advanced features, such as voice cloning and real-time editing tools, provide unparalleled flexibility and control over audio outputs. Additionally, ElevenLabs' competitive pricing and free version make it an accessible choice for both beginners and professionals seeking high-quality AI voice generation.
I strongly believe ElevenLabs solutions are the best products in the voice-to-text technologies. I feel the quality while experiencing them. I'll continue following their solutions in the future.
I just tried ElevenLabs Reader and loved it! Great library of voices, the links upload quickly. I also appreciated the minimalistic and easy-to-use design. Great job!
hands down the best TTS voice generator, voice changer, voice cloner on the market... and now they also do sounds as well. imho currently no competitor comes close in terms of authenticity of the voices.