89 followers
3mo ago
What sets MiniMax apart:
Zero-shot speaker cloning using raw audio (no transcripts required)
Flow-VAE model: no spectrograms needed, enabling faster and more natural speech
Multilingual and cross-lingual synthesis (supports Thai, Vietnamese, Cantonese, etc.)
0
2
MiniMax Audio's Speech-02-HD is now ranked #1 globally on Artificial Analysis Arena!
1