Cyberax AI Playbook
cyberax.com
Model · OpenAI · Speech-To-Text

GPT-4o Transcribe

OpenAI's hosted speech-to-text model, built on GPT-4o. The API-recommended transcription model, with lower word-error rate and better language recognition than the original Whisper API.

Modality
Speech-To-Text
License
Proprietary (Proprietary)
Context window
16,000 tokens
Released
March 20, 2025
Last verified
June 8, 2026
Runs locally
No

Strengths

  • Lower word-error rate than the legacy whisper-1 API
  • The default transcription model in OpenAI's API docs
  • Shares the modern GPT-4o API surface

Weaknesses

  • Hosted API only — not open weights (use Whisper large-v3 to self-host)
  • A newer gpt-4o-mini-transcribe snapshot (2025-12-15) reduces hallucinations further

Try it

WhereTypeNotes
OpenAI API hosted-api API key required

Change log

  • — Initial entry (2026-06-08 catalog refresh, Tier 3). OpenAI's hosted-API STT model; complements the open-weights Whisper entries.