Skip to main content

Overview

The Transcript API converts audio into text. Provide an audio URL or upload a file, and receive a full transcription.

Typical Flow

Submit a transcription request with an audio URL or file upload
Receive a request ID to track progress
Poll for results until the transcription completes
Retrieve the transcribed text

Features

URL or file upload: Provide a link to hosted audio, or upload a file directly (up to 100MB)
Asynchronous processing: Transcription runs in the background; poll for results
Language support: Specify a language code or enable automatic language detection
Filler words: Optionally include filler words (um, uh) in the output
Keyword boosting: Improve accuracy for domain-specific terms

Important Considerations

Supported formats: mp3, wav, ogg, flac, m4a, mp4, webm
File size limit: 100MB for direct uploads
Rate limits: Requests are rate-limited per API key

Typical Flow
Features