Base Model
The Base model is the perfect choice for a wide range of applications. It is designed for reliability, speed, and high-quality audio output across a large and diverse voice library.54 Voices
Extensive library spanning various accents, genders, and styles.
High Quality
Clear, natural-sounding speech for narration and content creation.
Simple
Easy to use with straightforward API parameters.
Advanced Model
The Advanced model is built for users who require the highest level of customization, expressiveness, and unique vocal identity. While it has a smaller library of preset voices, its power lies in its advanced features.Voice Cloning
Create a digital replica of any voice from a short audio sample.
Custom Voices
Design unique, high-quality synthetic voices tailored to your brand.
Superior Expressiveness
Fine-tuned for more nuanced and emotionally resonant speech.
Greater Customization
Detailed control over expressiveness settings like
cfg_weight and exaggeration.Side-by-Side Comparison
| Feature | Base Model | Advanced Model |
|---|---|---|
| Preset Voices | ✅ 54 Voices | ⚪️ Fewer (Focus on custom) |
| Voice Cloning | ❌ Not Supported | ✅ Supported |
| Custom Voices | ❌ Not Supported | ✅ Supported |
| Expressiveness Control | ❌ Not Supported | ✅ Supported |
| Quality | High Quality & Natural | Highest Quality & Nuanced |
| Best For | General applications | Branding, custom experiences |
| API Parameter | "base" | "advanced" |
| Cost | See Pricing | See Pricing |
Specifying a Model in Your API Request
To use a specific model, set themodel parameter in your API request:
- Base Model
- Advanced Model
SDK Examples
Which Model Should I Use?
Choose Base If...
- You need a wide variety of preset voices
- Cost efficiency is important
- Standard narration and content creation
- Quick implementation without customization
Choose Advanced If...
- You need voice cloning capabilities
- Creating a unique brand voice
- Emotional and expressive content
- Maximum control over output