Vosk: Difference between revisions
Jump to navigation
Jump to search
No edit summary |
m (Formatting improved) |
||
Line 1: | Line 1: | ||
Vosk<ref>https://github.com/alphacep/vosk-api</ref> is an open-source speech recognition toolkit by Alphacephei<ref>https://alphacephei.com/vosk/</ref>. Key features are: | |||
# Supports 20+ languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi, Filipino, Ukrainian, Kazakh, Swedish. More to come. | # Supports 20+ languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi, Filipino, Ukrainian, Kazakh, Swedish. More to come. | ||
Line 9: | Line 9: | ||
# Allows quick reconfiguration of vocabulary for best accuracy. | # Allows quick reconfiguration of vocabulary for best accuracy. | ||
# Supports speaker identification beside simple speech recognition. | # Supports speaker identification beside simple speech recognition. | ||
== Related projects == | |||
* https://github.com/solyarisoftware/voskjs Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server | |||
== References == | |||
<references /> | |||
[[Category:STT]] | [[Category:STT]] | ||
[[Category:Project]] | |||
Latest revision as of 20:31, 3 December 2021
Vosk[1] is an open-source speech recognition toolkit by Alphacephei[2]. Key features are:
- Supports 20+ languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi, Filipino, Ukrainian, Kazakh, Swedish. More to come.
- Works offline, even on lightweight devices - Raspberry Pi, Android, iOS
- Installs with simple
pip3 install vosk
- Portable per-language models are only 50Mb each, but there are much bigger server models available.
- Provides streaming API for the best user experience (unlike popular speech-recognition python packages)
- There are bindings for different programming languages, too - java/csharp/javascript etc.
- Allows quick reconfiguration of vocabulary for best accuracy.
- Supports speaker identification beside simple speech recognition.
Related projects[edit | edit source]
- https://github.com/solyarisoftware/voskjs Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server