Model · OpenAI · Speech-To-Text

GPT-4o Transcribe

OpenAI's hosted speech-to-text model, built on GPT-4o. The API-recommended transcription model, with lower word-error rate and better language recognition than the original Whisper API.

Modality: Speech-To-Text
License: Proprietary (Proprietary)
Context window: 16,000 tokens
Released: March 20, 2025
Last verified: June 8, 2026
Runs locally: No

Strengths

Lower word-error rate than the legacy whisper-1 API
The default transcription model in OpenAI's API docs
Shares the modern GPT-4o API surface

Weaknesses

Hosted API only — not open weights (use Whisper large-v3 to self-host)
A newer gpt-4o-mini-transcribe snapshot (2025-12-15) reduces hallucinations further

Try it

Where	Type	Notes
OpenAI API	hosted-api	API key required

Official sources

Model docs docs
Audio models update announcement

Change log

Jun 8, 2026 — Initial entry (2026-06-08 catalog refresh, Tier 3). OpenAI's hosted-API STT model; complements the open-weights Whisper entries.

Strengths

Weaknesses

Try it

Official sources

Change log

Related models

NVIDIA Parakeet

Whisper large-v3 Turbo

Whisper large-v3

Distil-Whisper

Vosk