Submits a text-to-speech task and returns a generation_id for tracking. The audio is generated asynchronously—use the Get Generation endpoint to check when it’s ready.
POST https://api.audixa.ai/v3/tts
Request
The text to convert to speech. Maximum 50,000 characters (varies by plan).
The voice ID to use. Get available voices from /voices .
The model tier to use.
base: Standard high-quality voices. Lowest cost/latency.
advanced: Premium voices. Supports cloning and higher expressiveness.
Playback speed multiplier. Range: 0.5 to 2.0.
Advanced Model Settings
Controls how strictly the model follows the text/style. Range: 1.0 - 5.0.
Controls emotional fluctuation/expressiveness. Range: 0.0 - 1.0.
Output format: wav or mp3.
Response
Unique identifier for this generation job. Use this to poll /tts .
Initial status: IN_QUEUE.
The input text that was submitted.
Voice ID used for generation.
Human-readable voice name.
TTS model used (base or advanced).
Number of tokens consumed.
Cost in USD (set when generation completes).
Payment method used: API_WALLET or CREDITS_BALANCE.
URL to download the generated audio (when status=COMPLETED).
Error details if status=FAILED.
ISO 8601 timestamp of creation.
ISO 8601 timestamp when processing started.
ISO 8601 timestamp when processing completed.
cURL
Python
TypeScript
JavaScript
curl -X POST "https://api.audixa.ai/v3/tts" \
-H "Content-Type: application/json" \
-H "x-api-key: YOUR_API_KEY" \
-d '{
"text": "Hello world from Audixa!",
"voice_id": "am_ethan",
"model": "base",
"speed": 1.0,
"audio_format": "wav"
}'
{
"generation_id" : "gen_abc123xyz" ,
"status" : "IN_QUEUE" ,
"input_text" : "Hello world from Audixa!" ,
"voice_id" : "am_ethan" ,
"voice_name" : "Ethan" ,
"model" : "base" ,
"tokens" : 6 ,
"dollar_cost" : 0.000006 ,
"method" : null ,
"created_at" : "2025-02-05T12:00:00Z" ,
"started_at" : null ,
"completed_at" : null
}
Error Responses
Invalid parameters (e.g. invalid model, voice not found).
Insufficient balance.
Rate limit exceeded.