Synchronous endpoint — returns the recognition result directly upon completion.
curl https://qingbo.dev/v1/audio/transcriptions \
-H "Authorization: Bearer YOUR_API_KEY" \
-F file="@audio.mp3" \
-F model="whisper-1"
{
"text": "欢迎使用 WaveAPI 语音识别服务。"
}
Available Models
| Model ID | Description |
|---|
whisper-1 | OpenAI Whisper, supports multi-language recognition |
Two Endpoints
Speech-to-Text
POST /v1/audio/transcriptions
Transcribes audio into text in its original language.
Speech Translation
POST /v1/audio/translations
Translates audio into English text. Parameters are the same as the transcription endpoint.
Request Parameters
Uses multipart/form-data format:
Audio file, supports mp3, mp4, mpeg, mpga, m4a, wav, webm formats
Response
Recognized or translated text content