Create Generation

curl -X POST "https://api.audixa.ai/v3/tts" \
  -H "Content-Type: application/json" \
  -H "x-api-key: YOUR_API_KEY" \
  -d '{
    "text": "Hello world from Audixa!",
    "voice_id": "am_ethan",
    "model": "base",
    "speed": 1.0,
    "audio_format": "wav"
  }'

{
  "generation_id": "gen_abc123xyz",
  "status": "IN_QUEUE",
  "input_text": "Hello world from Audixa!",
  "voice_id": "am_ethan",
  "voice_name": "Ethan",
  "model": "base",
  "tokens": 6,
  "dollar_cost": 0.000006,
  "method": null,
  "created_at": "2025-02-05T12:00:00Z",
  "started_at": null,
  "completed_at": null
}

POST

tts

curl -X POST "https://api.audixa.ai/v3/tts" \
  -H "Content-Type: application/json" \
  -H "x-api-key: YOUR_API_KEY" \
  -d '{
    "text": "Hello world from Audixa!",
    "voice_id": "am_ethan",
    "model": "base",
    "speed": 1.0,
    "audio_format": "wav"
  }'

{
  "generation_id": "gen_abc123xyz",
  "status": "IN_QUEUE",
  "input_text": "Hello world from Audixa!",
  "voice_id": "am_ethan",
  "voice_name": "Ethan",
  "model": "base",
  "tokens": 6,
  "dollar_cost": 0.000006,
  "method": null,
  "created_at": "2025-02-05T12:00:00Z",
  "started_at": null,
  "completed_at": null
}

Submits a text-to-speech task and returns a generation_id for tracking. The audio is generated asynchronously—use the Get Generation endpoint to check when it’s ready.

Endpoint

POST https://api.audixa.ai/v3/tts

Request

text

string

required

The text to convert to speech. Maximum 50,000 characters (varies by plan).

voice_id

string

required

The voice ID to use. Get available voices from /voices.

model

string

default:"base"

The model tier to use.

base: Standard high-quality voices. Lowest cost/latency.
advanced: Premium voices. Supports cloning and higher expressiveness.

speed

number

default:"1.0"

Playback speed multiplier. Range: 0.5 to 2.0.

Advanced Model Settings

cfg_weight

number

default:"2.5"

Controls how strictly the model follows the text/style. Range: 1.0 - 5.0.

exaggeration

number

default:"0.5"

Controls emotional fluctuation/expressiveness. Range: 0.0 - 1.0.

audio_format

string

default:"wav"

Output format: wav or mp3.

Response

generation_id

string

Unique identifier for this generation job. Use this to poll /tts.

status

string

Initial status: IN_QUEUE.

input_text

string

The input text that was submitted.

voice_id

string

Voice ID used for generation.

voice_name

string

Human-readable voice name.

model

string

TTS model used (base or advanced).

tokens

integer

Number of tokens consumed.

dollar_cost

number

Cost in USD (set when generation completes).

method

string

Payment method used: API_WALLET or CREDITS_BALANCE.

audio_url

string

URL to download the generated audio (when status=COMPLETED).

error_message

string

Error details if status=FAILED.

created_at

string

ISO 8601 timestamp of creation.

started_at

string

ISO 8601 timestamp when processing started.

completed_at

string

ISO 8601 timestamp when processing completed.

curl -X POST "https://api.audixa.ai/v3/tts" \
  -H "Content-Type: application/json" \
  -H "x-api-key: YOUR_API_KEY" \
  -d '{
    "text": "Hello world from Audixa!",
    "voice_id": "am_ethan",
    "model": "base",
    "speed": 1.0,
    "audio_format": "wav"
  }'

{
  "generation_id": "gen_abc123xyz",
  "status": "IN_QUEUE",
  "input_text": "Hello world from Audixa!",
  "voice_id": "am_ethan",
  "voice_name": "Ethan",
  "model": "base",
  "tokens": 6,
  "dollar_cost": 0.000006,
  "method": null,
  "created_at": "2025-02-05T12:00:00Z",
  "started_at": null,
  "completed_at": null
}

Error Responses

400 Bad Request

Invalid parameters (e.g. invalid model, voice not found).

402 Payment Required

Insufficient balance.

429 Too Many Requests

Rate limit exceeded.

API Reference Get Generation

Overview

Endpoints

Streaming

Request

Advanced Model Settings

Response

Error Responses

Overview

Endpoints

Streaming

​Request

​Advanced Model Settings

​Response

​Error Responses

Request

Advanced Model Settings

Response

Error Responses