Cyberax AI Playbook
cyberax.com
Model · Alpha Cephei · Speech-To-Text

Vosk

An offline, lightweight speech recognition toolkit. Runs on phones, Raspberry Pi, and embedded devices — the right choice when Whisper is too heavy.

Modality
Speech-To-Text
License
Apache 2.0 (Open)
Released
January 1, 2020
Last verified
May 10, 2026
Runs locally
Yes

Strengths

  • Runs on truly constrained hardware (mobile, Raspberry Pi)
  • Streaming/realtime capable
  • Apache 2.0 with mature SDK bindings

Weaknesses

  • Accuracy below Whisper on hard audio
  • Languages are downloaded as separate models — bigger languages mean bigger downloads

Try it

WhereTypeNotes
Vosk models weights Apache 2.0 — many sizes available
GitHub local Bindings for Python, Java, Node, Swift, etc.

Change log

  • — Initial entry.