Skip to main content
Audixa AI offers two powerful text-to-speech models to suit different needs: Base and Advanced. Each model is optimized for a different set of use cases, from general-purpose narration to highly expressive, custom voice generation. Understanding the differences will help you choose the best model for your application.

Base Model

The Base model is the perfect choice for a wide range of applications. It is designed for reliability, speed, and high-quality audio output across a large and diverse voice library.

54 Voices

Extensive library spanning various accents, genders, and styles.

High Quality

Clear, natural-sounding speech for narration and content creation.

Simple

Easy to use with straightforward API parameters.
For the majority of users, the Base model provides more than enough variety and quality to build incredible audio experiences.

Advanced Model

The Advanced model is built for users who require the highest level of customization, expressiveness, and unique vocal identity. While it has a smaller library of preset voices, its power lies in its advanced features.

Voice Cloning

Create a digital replica of any voice from a short audio sample.

Custom Voices

Design unique, high-quality synthetic voices tailored to your brand.

Superior Expressiveness

Fine-tuned for more nuanced and emotionally resonant speech.

Greater Customization

Detailed control over expressiveness settings like cfg_weight and exaggeration.
The Advanced model is ideal for enterprises, creators, and developers who need a signature voice or the flexibility of voice cloning.

Side-by-Side Comparison

FeatureBase ModelAdvanced Model
Preset Voices✅ 54 Voices⚪️ Fewer (Focus on custom)
Voice Cloning❌ Not Supported✅ Supported
Custom Voices❌ Not Supported✅ Supported
Expressiveness Control❌ Not Supported✅ Supported
QualityHigh Quality & NaturalHighest Quality & Nuanced
Best ForGeneral applicationsBranding, custom experiences
API Parameter"base""advanced"
CostSee PricingSee Pricing

Specifying a Model in Your API Request

To use a specific model, set the model parameter in your API request:
{
  "text": "Hello, this is a voice from the Base model.",
  "voice_id": "am_ethan",
  "model": "base"
}

SDK Examples

import audixa

audixa.set_api_key("YOUR_API_KEY")

# Using Base model
audio_url = audixa.tts_and_wait(
    "Hello from the Base model.",
    voice_id="am_ethan",
    model="base",
)

# Using Advanced model with expressiveness settings
audio_url = audixa.tts_and_wait(
    "Hello from the Advanced model!",
    voice_id="am_ethan",
    model="advanced",
    cfg_weight=3.5,
    exaggeration=0.7,
)

Which Model Should I Use?

Choose Base If...

  • You need a wide variety of preset voices
  • Cost efficiency is important
  • Standard narration and content creation
  • Quick implementation without customization

Choose Advanced If...

  • You need voice cloning capabilities
  • Creating a unique brand voice
  • Emotional and expressive content
  • Maximum control over output