Cyberax AI Playbook
cyberax.com
Model · OpenAI · Speech-To-Text

Whisper large-v3

High-accuracy multilingual speech-to-text. Best-in-class for non-English audio; the de-facto open baseline.

Modality
Speech-To-Text
License
MIT (Open)
Parameter size
1.55B
Released
November 6, 2023
Last verified
May 10, 2026
Runs locally
Yes

Strengths

  • Multilingual (99 languages)
  • Robust to noise and accents
  • Open weights — runs offline

Weaknesses

  • Slow on CPU; needs a GPU for real-time
  • No speaker diarization built in
  • Hallucinates on long silence

Try it

WhereTypeNotes
Hugging Face weights Open weights, MIT
Replicate hosted API key required
Groq hosted-api Free tier
whisper.cpp local C/C++ port; runs on CPU and Apple Silicon

Used in solutions

Version history

  1. Whisper large-v3 Turbo Oct 2024
  2. Whisper large-v3 Nov 2023 Current
  3. Whisper large-v2 Dec 2022 Deprecated

Change log

  • — Initial entry.