Model · Alpha Cephei · Speech-To-Text
Vosk
An offline, lightweight speech recognition toolkit. Runs on phones, Raspberry Pi, and embedded devices — the right choice when Whisper is too heavy.
- Modality
- Speech-To-Text
- License
- Apache 2.0 (Open)
- Released
- January 1, 2020
- Last verified
- May 10, 2026
- Runs locally
- Yes
Strengths
- Runs on truly constrained hardware (mobile, Raspberry Pi)
- Streaming/realtime capable
- Apache 2.0 with mature SDK bindings
Weaknesses
- Accuracy below Whisper on hard audio
- Languages are downloaded as separate models — bigger languages mean bigger downloads
Try it
| Where | Type | Notes |
|---|---|---|
| Vosk models | weights | Apache 2.0 — many sizes available |
| GitHub | local | Bindings for Python, Java, Node, Swift, etc. |
Official sources
- Project site vendor
- GitHub github
Change log
- — Initial entry.
Esc