Skip to main content
POST
/
v1
/
audio
/
transcriptions
curl https://qingbo.dev/v1/audio/transcriptions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -F file="@audio.mp3" \
  -F model="whisper-1"
{
  "text": "欢迎使用 WaveAPI 语音识别服务。"
}
Synchronous endpoint — returns the recognition result directly upon completion.
curl https://qingbo.dev/v1/audio/transcriptions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -F file="@audio.mp3" \
  -F model="whisper-1"
{
  "text": "欢迎使用 WaveAPI 语音识别服务。"
}

Available Models

Model IDDescription
whisper-1OpenAI Whisper, supports multi-language recognition

Two Endpoints

Speech-to-Text

POST /v1/audio/transcriptions
Transcribes audio into text in its original language.

Speech Translation

POST /v1/audio/translations
Translates audio into English text. Parameters are the same as the transcription endpoint.

Request Parameters

Uses multipart/form-data format:
file
file
required
Audio file, supports mp3, mp4, mpeg, mpga, m4a, wav, webm formats
model
string
required
Model ID: whisper-1

Response

text
string
Recognized or translated text content