# Gladia

Gladia provides state-of-the-art audio transcription and intelligence services through a simple API, enabling real-time and asynchronous transcription, translation, and audio analysis.

- **Category:** transcription
- **Auth:** API_KEY
- **Composio Managed App Available?** N/A
- **Tools:** 10
- **Triggers:** 0
- **Slug:** `GLADIA`
- **Version:** 20260312_00

## Tools

### Delete Live Session

**Slug:** `GLADIA_DELETE_LIVE_SESSION`

Tool to delete a live transcription session and all its associated data (audio file, transcription). Use when permanently removing a live session from Gladia.

#### Input Parameters

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `id` | string | Yes | UUID of the live transcription session to delete |

#### Output

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `data` | string | Yes | Data from the action execution |
| `error` | string | No | Error if any occurred during the execution of the action |
| `successful` | boolean | Yes | Whether or not the action execution was successful or not |

### Delete Pre-recorded Job

**Slug:** `GLADIA_DELETE_PRE_RECORDED_JOB`

Tool to delete a pre-recorded transcription job and all its associated data (audio file, transcription). Use when permanently removing a transcription job from Gladia.

#### Input Parameters

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `id` | string | Yes | UUID of the pre-recorded transcription job to delete |

#### Output

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `data` | string | Yes | Data from the action execution |
| `error` | string | No | Error if any occurred during the execution of the action |
| `successful` | boolean | Yes | Whether or not the action execution was successful or not |

### Get live transcription result

**Slug:** `GLADIA_GET_LIVE_TRANSCRIPTION_RESULT`

Tool to retrieve metadata and results of a live transcription job. Use when you need detailed status or results for a specific live transcription session.

#### Input Parameters

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `id` | string | Yes | Unique identifier of the live transcription job |

#### Output

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `data` | string | Yes | Data from the action execution |
| `error` | string | No | Error if any occurred during the execution of the action |
| `successful` | boolean | Yes | Whether or not the action execution was successful or not |

### Get Pre-recorded Job

**Slug:** `GLADIA_GET_PRE_RECORDED_JOB`

Tool to retrieve a pre-recorded transcription job's metadata, status, and results by ID. Use when checking the status or retrieving results of a specific pre-recorded job.

#### Input Parameters

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `id` | string | Yes | Id of the pre recorded job |

#### Output

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `data` | string | Yes | Data from the action execution |
| `error` | string | No | Error if any occurred during the execution of the action |
| `successful` | boolean | Yes | Whether or not the action execution was successful or not |

### Get Transcription Audio File (Deprecated)

**Slug:** `GLADIA_GET_TRANSCRIPTION_AUDIO_FILE_DEPRECATED`

DEPRECATED - use GET /v2/pre-recorded/{id}/file instead. Download the audio file used for a transcription job. Use when you need to retrieve the original audio file associated with a specific transcription job ID.

#### Input Parameters

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `id` | string | Yes | Id of the transcription job |

#### Output

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `data` | string | Yes | Data from the action execution |
| `error` | string | No | Error if any occurred during the execution of the action |
| `successful` | boolean | Yes | Whether or not the action execution was successful or not |

### Initiate Live Transcription Session

**Slug:** `GLADIA_INITIATE_LIVE_SESSION`

Initiates a live (real-time) transcription session with Gladia's API. Returns a WebSocket URL that you can connect to for streaming audio data in real-time. The WebSocket URL includes an embedded authentication token and should be used promptly. After initiating, connect to the returned WebSocket URL and send audio chunks. Use GET /v2/live/{id} to retrieve the full transcription results after the session ends.

#### Input Parameters

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `model` | string ("solaria-1" | "solaria-2") | No | Transcription model to use. 'solaria-1' is the default model. 'solaria-2' may require a higher subscription plan. |
| `region` | string ("us-west" | "eu-west") | No | Processing region for the session. Use 'us-west' for US West or 'eu-west' for EU West. If not specified, the default region is used. |
| `callback` | boolean | No | Enable callback notifications. When true, configure callback_config to specify the callback URL. |
| `channels` | integer | No | Number of audio channels. Use 1 for mono (most speech), 2 for stereo. Maximum 8 channels supported. |
| `encoding` | string ("wav/pcm" | "wav/alaw" | "wav/ulaw") | No | Audio encoding format for the stream. Use 'wav/pcm' for raw PCM, 'wav/alaw' for A-law, or 'wav/ulaw' for µ-law encoding. |
| `bit_depth` | integer ("8" | "16" | "24" | "32") | No | Audio sample bit depth. Common values: 8-bit for telephony, 16-bit for standard audio, 24/32-bit for high-quality audio. |
| `endpointing` | number | No | Duration of silence in seconds that triggers end of utterance. Lower values (0.01-0.1) for responsive transcription, higher values (0.5-2.0) for longer pauses. |
| `sample_rate` | integer ("8000" | "16000" | "32000" | "44100" | "48000") | No | Audio sample rate in Hz. Use 8000 for telephony, 16000 for speech, 44100/48000 for high-fidelity audio. |
| `pre_processing` | object | No | Pre-processing settings for audio enhancement and speech activity detection thresholds. |
| `callback_config` | object | No | Callback delivery configuration. Required if callback is true. Specify callback URL and which events to send. |
| `custom_metadata` | object | No | Custom key/value metadata to attach to the session for tracking or identification purposes. |
| `language_config` | object | No | Language configuration object. Example: {'languages': ['en']} for English, or enable code-switching for multilingual audio. |
| `messages_config` | object | No | Configuration for which message types to receive over WebSocket (transcripts, partial results, etc.). |
| `post_processing` | object | No | Post-processing settings for summarization and chapterization of the transcript after the session ends. |
| `realtime_processing` | object | No | Realtime processing options including custom vocabulary, translation, named entity recognition (NER), and sentiment analysis. |
| `maximum_duration_without_endpointing` | number | No | Maximum duration in seconds before forcing an utterance end, even without detecting silence. Range: 5-60 seconds. |

#### Output

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `data` | string | Yes | Data from the action execution |
| `error` | string | No | Error if any occurred during the execution of the action |
| `successful` | boolean | Yes | Whether or not the action execution was successful or not |

### Initiate Pre-Recorded Transcription

**Slug:** `GLADIA_INITIATE_PRE_RECORDED_TRANSCRIPTION`

Tool to initiate a pre-recorded transcription job. Use when you have an audio URL and need asynchronous transcription results.

#### Input Parameters

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `model` | string ("solaria-1" | "solaria-2") | No | Transcription model. 'solaria-1' (default) or 'solaria-2' (may require higher tier plan). |
| `callback` | boolean | No | Enable webhook callback when transcription completes. |
| `audio_url` | string | Yes | URL to an audio or video file. Can be a public URL or a Gladia file URL from the upload endpoint (format: https://api.gladia.io/file/<id>). |
| `sentences` | boolean | No | Enable sentence segmentation for better readability. |
| `subtitles` | boolean | No | Enable subtitle generation. Set to true to generate SRT/VTT files. |
| `diarization` | boolean | No | Enable speaker diarization to identify who said what. Set to true to detect speakers. |
| `translation` | boolean | No | Enable translation of transcription. Set to true to translate to other languages. |
| `summarization` | boolean | No | Enable summarization of transcription. Set to true to get a summary. |
| `callback_config` | object | No | Configuration for webhook callback when transcription completes. |
| `custom_metadata` | object | No | Arbitrary metadata to attach to the job for tracking and organization. |
| `language_config` | object | No | Configuration for preferred language handling. |
| `subtitles_config` | object | No | Configuration for subtitle generation. |
| `diarization_config` | object | No | Configuration for speaker diarization. |
| `translation_config` | object | No | Configuration for translation of transcription. |
| `punctuation_enhanced` | boolean | No | Enable enhanced punctuation and casing for cleaner transcripts. |
| `summarization_config` | object | No | Configuration for summarization of transcription. |

#### Output

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `data` | string | Yes | Data from the action execution |
| `error` | string | No | Error if any occurred during the execution of the action |
| `successful` | boolean | Yes | Whether or not the action execution was successful or not |

### List live transcription jobs

**Slug:** `GLADIA_LIST_LIVE_TRANSCRIPTIONS`

Tool to list live transcription jobs. Use when you need an overview of live transcription sessions with optional filtering and pagination. Use after setting up live transcription.

#### Input Parameters

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `date` | string | No | Filter items relevant to a specific date in ISO format (YYYY-MM-DD). |
| `limit` | integer | No | Max number of items to return. |
| `offset` | integer | No | The starting point for pagination; 0 starts from the first item. |
| `status` | array | No | Filter by item status; accepts multiple values. Valid statuses: 'queued', 'processing', 'done', 'error'. |
| `after_date` | string | No | Filter for items after the specified ISO date-time (use with before_date for a range). |
| `before_date` | string | No | Include items that occurred before the specified ISO date-time. |
| `custom_metadata` | object | No | Filter by custom metadata object (e.g., {'user': 'John Doe'}). |

#### Output

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `data` | string | Yes | Data from the action execution |
| `error` | string | No | Error if any occurred during the execution of the action |
| `successful` | boolean | Yes | Whether or not the action execution was successful or not |

### List Pre-Recorded Jobs

**Slug:** `GLADIA_LIST_PRE_RECORDED_JOBS`

Tool to list all pre-recorded transcription jobs matching the query parameters. Use when you need to retrieve a paginated list of transcription jobs with optional filtering by date, status, or custom metadata.

#### Input Parameters

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `date` | string | No | Filter items relevant to a specific date in ISO format (YYYY-MM-DD). |
| `limit` | integer | No | The maximum number of items to return. Useful for pagination and controlling data payload size. |
| `offset` | integer | No | The starting point for pagination. A value of 0 starts from the first item. |
| `status` | array | No | Filter the list based on item status. Accepts multiple values from the predefined list. |
| `after_date` | string | No | Filter for items after the specified date. Use with `before_date` for a range. Date in ISO format. |
| `before_date` | string | No | Include items that occurred before the specified date in ISO format. |
| `custom_metadata` | object | No | Custom metadata filter to match jobs, e.g., {"user": "John Doe"}. |

#### Output

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `data` | string | Yes | Data from the action execution |
| `error` | string | No | Error if any occurred during the execution of the action |
| `successful` | boolean | Yes | Whether or not the action execution was successful or not |

### Upload Audio/Video File

**Slug:** `GLADIA_UPLOAD_AUDIO_VIDEO_FILE`

Upload an audio or video file to Gladia for speech-to-text transcription. Returns a URL that can be used with the pre-recorded transcription endpoint. Supports common audio formats (WAV, MP3, FLAC, OGG) and video formats (MP4, MKV, etc.).

#### Input Parameters

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `file_path` | string | Yes | Path to the audio or video file to upload. Supports common formats like WAV, MP3, FLAC, MP4, etc. |

#### Output

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `data` | string | Yes | Data from the action execution |
| `error` | string | No | Error if any occurred during the execution of the action |
| `successful` | boolean | Yes | Whether or not the action execution was successful or not |