MiniMax Speech-02-HD: Zero-Shot, Multilingual, Cost-Effective TTS

What sets MiniMax apart:

Zero-shot speaker cloning using raw audio (no transcripts required)

Flow-VAE model: no spectrograms needed, enabling faster and more natural speech

Multilingual and cross-lingual synthesis (supports Thai, Vietnamese, Cantonese, etc.)

MiniMax Audio

1mo ago

MiniMax Audio New Milestone!

MiniMax Audio's Speech-02-HD is now ranked #1 globally on Artificial Analysis Arena!

MiniMax Audio

3mo ago

MiniMax Audio - Level Up Your Audio with Realistic AI Voices

MiniMax Audio just leveled up with the new Speech-02 model! Get ultra-realistic Al voices (30+ langs, 99% similarity). Read Files/URLs & handle long text (200k chars). API available at: api@minimax.io.