Model · NVIDIA · Speech-To-Text
NVIDIA Parakeet
NVIDIA's STT family. parakeet-tdt-0.6b-v2 tops the HF Open ASR leaderboard for English; parakeet-tdt-0.6b-v3 adds 25-language multilingual support. Very fast on NVIDIA hardware via NeMo.
- Modality
- Speech-To-Text
- License
- CC BY 4.0 (Open weights)
- Parameter size
- 0.6B
- Released
- May 1, 2025
- Last verified
- June 8, 2026
- Runs locally
- Yes
Strengths
- Top-of-leaderboard English accuracy
- Fast inference on NVIDIA GPUs via NeMo
- Permissive CC BY 4.0 license
Weaknesses
- Peak accuracy is the English v2 model; multilingual needs the separate v3 model
- Tightest performance is in NVIDIA's NeMo runtime
Try it
| Where | Type | Notes |
|---|---|---|
| Hugging Face (English) | weights | CC BY 4.0 — English flagship |
| Hugging Face (multilingual) | weights | 25-language multilingual |
Official sources
- Model card model
Change log
- — Repointed from parakeet-tdt-1.1b to current flagship parakeet-tdt-0.6b-v2 (English, 2025-05-01); added parakeet-tdt-0.6b-v3 (multilingual, 2025-08-14).
- — Initial entry.
Esc