Model · OpenAI · Speech-To-Text
GPT-4o Transcribe
OpenAI's hosted speech-to-text model, built on GPT-4o. The API-recommended transcription model, with lower word-error rate and better language recognition than the original Whisper API.
- Modality
- Speech-To-Text
- License
- Proprietary (Proprietary)
- Context window
- 16,000 tokens
- Released
- March 20, 2025
- Last verified
- June 8, 2026
- Runs locally
- No
Strengths
- Lower word-error rate than the legacy whisper-1 API
- The default transcription model in OpenAI's API docs
- Shares the modern GPT-4o API surface
Weaknesses
- Hosted API only — not open weights (use Whisper large-v3 to self-host)
- A newer gpt-4o-mini-transcribe snapshot (2025-12-15) reduces hallucinations further
Try it
| Where | Type | Notes |
|---|---|---|
| OpenAI API | hosted-api | API key required |
Official sources
- Model docs docs
- Audio models update announcement
Change log
- — Initial entry (2026-06-08 catalog refresh, Tier 3). OpenAI's hosted-API STT model; complements the open-weights Whisper entries.
Esc