LEAPERone Docs

Audio Transcription

Transcribe audio files to text.

The rapid model is temporarily unavailable. Please use whisper-1 instead.

The Audio Transcription endpoint converts speech to text. Upload an audio file and choose a model.

Models

ModelPricingBest for
rapid (default)0.006 credits/minFast, general-purpose transcription
whisper-10.006 credits/minHigh accuracy with prompt support

Quick Start

POST /v1/audio/transcriptions
curl -X POST https://api.leaper.one/v1/audio/transcriptions \
  -H "Authorization: Bearer sk-your-api-key" \
  -F file=@meeting.mp3
Response
{
  "text": "Welcome to today's meeting. Let's start with the agenda..."
}

Choosing a Model

rapid (default)

Best for quick transcription without extra configuration. No model parameter needed.

Using rapid
curl -X POST https://api.leaper.one/v1/audio/transcriptions \
  -H "Authorization: Bearer sk-your-api-key" \
  -F file=@meeting.mp3 \
  -F response_format=json

whisper-1

OpenAI's Whisper model. Supports prompt to improve recognition of specific terms and verbose_json for word-level timestamps.

Using whisper-1 with prompt
curl -X POST https://api.leaper.one/v1/audio/transcriptions \
  -H "Authorization: Bearer sk-your-api-key" \
  -F file=@meeting.mp3 \
  -F model=whisper-1 \
  -F response_format=verbose_json \
  -F prompt="LEAPERone, API, transcription"

Supported Formats

FormatExtension
MP3.mp3
MP4.mp4
MPEG.mpeg, .mpga
M4A.m4a
WAV.wav
WebM.webm

Response Formats

Set response_format to control the output:

ValueDescription
textPlain text transcription.
jsonJSON object with a text field (default).
verbose_jsonJSON with timestamps, segments, and metadata.

Billing is based on audio duration. See the API Reference for per-model pricing.

Language is detected automatically. You can specify language to improve accuracy.