Model · OpenAI · Speech-To-Text
Whisper large-v3
High-accuracy multilingual speech-to-text. Best-in-class for non-English audio; the de-facto open baseline.
- Modality
- Speech-To-Text
- License
- MIT (Open)
- Parameter size
- 1.55B
- Released
- November 6, 2023
- Last verified
- May 10, 2026
- Runs locally
- Yes
Strengths
- Multilingual (99 languages)
- Robust to noise and accents
- Open weights — runs offline
Weaknesses
- Slow on CPU; needs a GPU for real-time
- No speaker diarization built in
- Hallucinates on long silence
Try it
| Where | Type | Notes |
|---|---|---|
| Hugging Face | weights | Open weights, MIT |
| Replicate | hosted | API key required |
| Groq | hosted-api | Free tier |
| whisper.cpp | local | C/C++ port; runs on CPU and Apple Silicon |
Used in solutions
Version history
- Whisper large-v3 Turbo Oct 2024
- Whisper large-v3 Nov 2023 Current
- Whisper large-v2 Dec 2022 Deprecated
Official sources
- Model card model
- Whisper paper paper
- GitHub github
Change log
- — Initial entry.
Esc