Fish Speech

Fish Speech

Few-shot Voice Cloning and Text-to-Speech

4.8
4 reviews

78 followers

With just 15 seconds of any voice, Fish Speech can reliably synthesize natural and fluent speech while maintaining the given timbre, style, and accent. Our open-source team, creators of So-VITS-SVC and Bert-VITS2, proudly introduces Fish Speech.
This is the 2nd launch from Fish Speech. View more
Fish Speech 1.4

Fish Speech 1.4

Open-Source Multilingual Text-to-Speech with Voice Cloning
Your Voice, Your Way: Open-Source TTS Powerful, fast, and natural speech in any language. Clone voices instantly. Self-host or use our service. Lightning-fast, affordable pricing.
Fish Speech 1.4 gallery image
Fish Speech 1.4 gallery image
Fish Speech 1.4 gallery image
Fish Speech 1.4 gallery image
Free
Launch tags:
APIOpen SourceGitHub
Launch Team / Built With

What do you think? …

Yue Leng
Maker
📌
Excited to introduce Fish Audio 1.4 - now open-source and more powerful than ever! 🎉 What's new: - Trained on 700k hours of multilingual data (up from 200k) - Now supports 8 languages: English, Chinese, German, Japanese, French, Spanish, Korean, and Arabic - Fully open-source, empowering developers and researchers worldwide Our mission: Make cutting-edge voice tech accessible to everyone. Key features: - Lightning-fast TTS with ultra-low latency - Instant voice cloning - Self-host or use our cloud service - Simple, flat-rate pricing Try it out: - Playground: https://fish.audio - GitHub: https://github.com/fishaudio/fis... - HuggingFace Model: https://huggingface.co/fishaudio... - Demo: https://huggingface.co/spaces/fi... We can't wait to see what you'll create with Fish Audio. Happy voice building! 🎧🐠
Pradhumn Vijayvargiya
@lengyue This is amazing, are you planning to add Hindi to the languages list? there's a huge market with hindi audio, pls do explore
Yue Leng
@owenfar We have a demo on hf space :) https://huggingface.co/spaces/fi...
Jatin Kaurani
@lengyue Congratulations on the launch of Fish Audio 1.4! 🎉 It's incredible to see the platform grow with 700k hours of multilingual data and support for 8 languages—this is a huge step forward! Making it open-source will truly empower developers and researchers across the globe. Excited to see the innovations that come from this. Keep up the amazing work!
Allen
Congratulations on launching such an innovative product! I'm really intrigued by the idea of having powerful, fast, and natural speech synthesis available in any language—it's a game changer for accessibility and creativity. The feature that stands out to me is the ability to clone voices instantly. This opens up so many possibilities for content creators and developers alike. Additionally, the option to self-host or use your service provides flexibility that many users will appreciate. I’m curious about how you handle voice cloning from an ethical standpoint. Also, are there plans to integrate more languages or dialects in the future? Excited to see where this goes—keep up the great work!
Yue Leng
@allen_xu1130 Yes, we are adding more languages and improving our repo to make it easy to use :)
Liam Patrick O'Connor
Sounds really powerful! 🎧 The fact that it's now trained on 700k hours and supports 8 languages is a huge plus. Congrats on the open-source release! 🚀