Cyberax AI Playbook
cyberax.com
Model · NVIDIA · Speech-To-Text

NVIDIA Parakeet

NVIDIA's STT family. parakeet-tdt-0.6b-v2 tops the HF Open ASR leaderboard for English; parakeet-tdt-0.6b-v3 adds 25-language multilingual support. Very fast on NVIDIA hardware via NeMo.

Modality
Speech-To-Text
License
CC BY 4.0 (Open weights)
Parameter size
0.6B
Released
May 1, 2025
Last verified
June 8, 2026
Runs locally
Yes

Strengths

  • Top-of-leaderboard English accuracy
  • Fast inference on NVIDIA GPUs via NeMo
  • Permissive CC BY 4.0 license

Weaknesses

  • Peak accuracy is the English v2 model; multilingual needs the separate v3 model
  • Tightest performance is in NVIDIA's NeMo runtime

Try it

WhereTypeNotes
Hugging Face (English) weights CC BY 4.0 — English flagship
Hugging Face (multilingual) weights 25-language multilingual

Change log

  • — Repointed from parakeet-tdt-1.1b to current flagship parakeet-tdt-0.6b-v2 (English, 2025-05-01); added parakeet-tdt-0.6b-v3 (multilingual, 2025-08-14).
  • — Initial entry.