MiniMax API Text-to-Speech (TTS): Voices & Samples

3 min read • December 30, 2024 (June 15, 2026)

Table of contents

  1. Introduction
  2. Cloned Voices Samples
  3. Standard Voices Samples
  4. Current API: Mureka speech
  5. Conclusion

MiniMax’s text-to-speech endpoints have since been retired. The voices and samples below remain as a showcase of what the original MiniMax TTS model produced. For current text-to-speech and voice cloning, use the Mureka API — see POST speech, GET speech/voices and POST speech/voice (voice cloning).

Introduction

The MiniMax API v1 was a third-party API for the MiniMax speech AI model, deployed at www.minimax.io/audio.

At the time these samples were produced, the MiniMax TTS API provided the following features:

  • Up to 20 parallel TTS jobs per single www.minimax.io/audio account.
    You can connect as many accounts as you need.
  • Average response time for live streaming is 3 seconds.
  • Average time to create an MP3 from text is under 10 seconds.
  • Over 300 pre-built voices available.
  • Ability to clone voices.
  • Supported Languages: English (US, UK, Australia, India), Chinese (Mandarin and Cantonese), Japanese, Korean, French, German, Spanish, Portuguese (including Brazilian), Italian, Arabic, Russian, Turkish, Dutch, Ukrainian, Vietnamese, and Indonesian.
    The list is constantly updated to include more languages!
  • Supported Emotions: happy, sad, angry, fearful, disgusted, surprised, and neutral.
  • Supported Accents: US (General), British English, and Indian English.
  • Supported Ages: Young Adult, Adult, Middle-Aged, and Senior.
  • Supported Genders: Male and Female.

Cloned Voices Samples

Examples below were created using the MiniMax API endpoint POST audio/create-mp3 using voices cloned via POST audio/clone-voice:

  • Donald Trump on AI moderation issues (audio clip used for voice cloning)
  • Scarlett Johansson (audio clip used for voice cloning)
  • Arnold Schwarzenegger as T1000 (audio clip used for voice cloning)
  • Morgan Freeman (audio clip used for voice cloning)

Standard Voices Samples

Examples below were created using the MiniMax API endpoint POST audio/create-mp3:

  • Dr. Evil: Sharks with laser beams attached to their heads reference

Current API: Mureka speech

MiniMax’s audio endpoints are retired. To generate text-to-speech and clone voices today, use the Mureka API:

Conclusion

Visit our Discord Server or Telegram Channel for any support questions and concerns.

We regularly post guides and tutorials on the YouTube Channel.

Check our GitHub repo with code examples.