Skip to main content
POST
/
v3
/
tts
curl -X POST "https://api.audixa.ai/v3/tts" \
  -H "Content-Type: application/json" \
  -H "x-api-key: YOUR_API_KEY" \
  -d '{
    "text": "Hello world from Audixa!",
    "voice_id": "am_ethan",
    "model": "base",
    "speed": 1.0,
    "audio_format": "wav"
  }'
{
  "generation_id": "gen_abc123xyz",
  "status": "IN_QUEUE",
  "input_text": "Hello world from Audixa!",
  "voice_id": "am_ethan",
  "voice_name": "Ethan",
  "model": "base",
  "tokens": 6,
  "dollar_cost": 0.000006,
  "method": null,
  "created_at": "2025-02-05T12:00:00Z",
  "started_at": null,
  "completed_at": null
}
Submits a text-to-speech task and returns a generation_id for tracking. The audio is generated asynchronously—use the Get Generation endpoint to check when it’s ready.
Endpoint
POST https://api.audixa.ai/v3/tts

Request

text
string
required
The text to convert to speech. Maximum 50,000 characters (varies by plan).
voice_id
string
required
The voice ID to use. Get available voices from /voices.
model
string
default:"base"
The model tier to use.
  • base: Standard high-quality voices. Lowest cost/latency.
  • advanced: Premium voices. Supports cloning and higher expressiveness.
speed
number
default:"1.0"
Playback speed multiplier. Range: 0.5 to 2.0.

Advanced Model Settings

cfg_weight
number
default:"2.5"
Controls how strictly the model follows the text/style. Range: 1.0 - 5.0.
exaggeration
number
default:"0.5"
Controls emotional fluctuation/expressiveness. Range: 0.0 - 1.0.
audio_format
string
default:"wav"
Output format: wav or mp3.

Response

generation_id
string
Unique identifier for this generation job. Use this to poll /tts.
status
string
Initial status: IN_QUEUE.
input_text
string
The input text that was submitted.
voice_id
string
Voice ID used for generation.
voice_name
string
Human-readable voice name.
model
string
TTS model used (base or advanced).
tokens
integer
Number of tokens consumed.
dollar_cost
number
Cost in USD (set when generation completes).
method
string
Payment method used: API_WALLET or CREDITS_BALANCE.
audio_url
string
URL to download the generated audio (when status=COMPLETED).
error_message
string
Error details if status=FAILED.
created_at
string
ISO 8601 timestamp of creation.
started_at
string
ISO 8601 timestamp when processing started.
completed_at
string
ISO 8601 timestamp when processing completed.
curl -X POST "https://api.audixa.ai/v3/tts" \
  -H "Content-Type: application/json" \
  -H "x-api-key: YOUR_API_KEY" \
  -d '{
    "text": "Hello world from Audixa!",
    "voice_id": "am_ethan",
    "model": "base",
    "speed": 1.0,
    "audio_format": "wav"
  }'
{
  "generation_id": "gen_abc123xyz",
  "status": "IN_QUEUE",
  "input_text": "Hello world from Audixa!",
  "voice_id": "am_ethan",
  "voice_name": "Ethan",
  "model": "base",
  "tokens": 6,
  "dollar_cost": 0.000006,
  "method": null,
  "created_at": "2025-02-05T12:00:00Z",
  "started_at": null,
  "completed_at": null
}

Error Responses

Invalid parameters (e.g. invalid model, voice not found).
Insufficient balance.
Rate limit exceeded.