Skip to main content
Audixa AI offers two powerful text-to-speech models to suit different needs: Base and Advance. Each model is optimized for a different set of use cases, from general-purpose narration to highly expressive, custom voice generation. Understanding the differences will help you choose the best model for your application.

Base Model

The Base model is the perfect choice for a wide range of applications. It is designed for reliability, speed, and high-quality audio output across a large and diverse voice library.

54 Voices

Extensive library spanning various accents, genders, and styles.

High Quality

Clear, natural-sounding speech for narration and content creation.

Simple

Easy to use with straightforward API parameters.
For the majority of users, the Base model provides more than enough variety and quality to build incredible audio experiences.

Advance Model

The Advance model is built for users who require the highest level of customization, expressiveness, and unique vocal identity. While it has a smaller library of preset voices, its power lies in its advanced features.

Voice Cloning

Create a digital replica of any voice from a short audio sample.

Custom Voices

Design unique, high-quality synthetic voices tailored to your brand.

Superior Expressiveness

Fine-tuned for more nuanced and emotionally resonant speech.

Greater Customization

Detailed control over emotion, temperature, and sampling.
The Advance model is ideal for enterprises, creators, and developers who need a signature voice or the flexibility of voice cloning.

Side-by-Side Comparison

FeatureBase ModelAdvance Model
Preset Voices✅ 54 Voices⚪️ Fewer (Focus on custom)
Voice Cloning❌ Not Supported✅ Supported
Custom Voices❌ Not Supported✅ Supported
Emotion Control❌ Not Supported✅ Supported
ExpressivenessHigh Quality & NaturalHighest Quality & Nuanced
Best ForGeneral applicationsBranding, custom experiences
API Parameter"base""advance"
Credit Cost1 credit/token4 credits/token

Specifying a Model in Your API Request

To use a specific model, set the model parameter in your API request:
{
  "text": "Hello, this is a voice from the Base model.",
  "voice": "am_ethan",
  "model": "base"
}

SDK Examples

import audixa

audixa.set_api_key("YOUR_API_KEY")

# Using Base model
audio_url = audixa.tts_and_wait(
    "Hello from the Base model.",
    voice="am_ethan",
    model="base",
)

# Using Advance model with emotion
audio_url = audixa.tts_and_wait(
    "Hello from the Advance model!",
    voice="am_ethan",
    model="advance",
    emotion="happy",
)

Which Model Should I Use?

Choose Base If...

  • You need a wide variety of preset voices
  • Cost efficiency is important
  • Standard narration and content creation
  • Quick implementation without customization

Choose Advance If...

  • You need voice cloning capabilities
  • Creating a unique brand voice
  • Emotional and expressive content
  • Maximum control over output