Vosk is an offline open source speech recognition toolkit. It enables speech recognition for 20+ languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi, Filipino, Ukrainian, Kazakh, Swedish, Japanese, Esperanto, Hindi, Czech, Polish. More to come. Vosk models are small (50 Mb) but provide continuous large vocabulary transcription, zero-latency response with streaming API, reconfigurable vocabulary and speaker identification. Speech recognition bindings are implemented for various programming languages like Python, Java, Node.JS, C#, C++, Rust, Go and others. Vosk supplies speech recognition for chatbots, smart home appliances, and virtual assistants. It can also create subtitles for movies, and transcription for lectures and interviews. Vosk scales from small devices like Raspberry Pi or Android smartphones to big clusters.

Features

  • Supports 20+ languages and dialects
  • Works offline, even on lightweight devices
  • Installs with simple pip3 install vosk
  • Portable per-language models are only 50Mb each
  • Provides streaming API for the best user experience
  • Allows quick reconfiguration of vocabulary for best accuracy

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow Vosk Speech Recognition Toolkit

Vosk Speech Recognition Toolkit Web Site

Other Useful Business Software
Go from Code to Production URL in Seconds Icon
Go from Code to Production URL in Seconds

Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
Try it free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Vosk Speech Recognition Toolkit!

Additional Project Details

Operating Systems

Android, Apple iPhone

Programming Language

C++

Related Categories

C++ Machine Learning Software, C++ Speech Recognition Software, C++ Raspberry Pi Software

Registered

2022-08-03