Skip to main content

Overview

The Transcript API converts audio into text. Provide an audio URL or upload a file, and receive a full transcription.

Typical Flow

  1. Submit a transcription request with an audio URL or file upload
  2. Receive a request ID to track progress
  3. Poll for results until the transcription completes
  4. Retrieve the transcribed text

Features

  • URL or file upload: Provide a link to hosted audio, or upload a file directly (up to 100MB)
  • Asynchronous processing: Transcription runs in the background; poll for results
  • Language support: Specify a language code or enable automatic language detection
  • Filler words: Optionally include filler words (um, uh) in the output
  • Keyword boosting: Improve accuracy for domain-specific terms
Important Considerations
  • Supported formats: mp3, wav, ogg, flac, m4a, mp4, webm
  • File size limit: 100MB for direct uploads
  • Rate limits: Requests are rate-limited per API key