DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the instructions in the usage docs. If you want to use the pre-trained English model for performing speech-to-text, you can download it (along with other important inference material) from the DeepSpeech releases page.

Features

Uses a model trained by machine learning techniques
Based on Baidu's Deep Speech research paper
Uses Google's TensorFlow to make the implementation easier
A pre-trained English model is available for use
Download important inference material from the DeepSpeech releases page
Run in real time on all devices

Project Samples

Project Activity

See All Activity >

License

Mozilla Public License 2.0 (MPL 2.0)

Follow DeepSpeech

DeepSpeech Web Site

Other Useful Business Software

Gemini 3 and 200+ AI Models on One Platform

Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free

Rate This Project

User Reviews

Be the first to post a review of DeepSpeech!

Additional Project Details

Programming Language

C++, Python

Related Categories

Python Word Processors, Python Speech Software, Python Speech to Text Software, Python Raspberry Pi Software, C++ Word Processors, C++ Speech Software, C++ Speech to Text Software, C++ Raspberry Pi Software

Registered

2021-03-16

Similar Business Software

Google Cloud Speech-to-Text

Google Cloud’s Speech API processes more than 1 billion voice minutes per month with close to human levels of understanding for many commonly spoken languages. Powered by the best of Google's AI research and technology, Google Cloud's Speech-to-Text API helps you accurately transcribe speech...

See Software
ONLYOFFICE Docs

ONLYOFFICE is an open-source project that offers cloud-based and self-hosted solutions for business of all sizes. The key product is ONLYOFFICE Docs, a secure office suite that seamlessly integrates into the most popular platforms, e.g. Odoo, Alfresco, Confluence, Pipedrive, Redmine, SuiteCRM...

See Software
talvala surveillance

Talvala is a speech analytics company. We use Baidu’s Deep Speech technology and machine learning for compliance surveillance and human/machine interfaces. We develop speech-based monitoring applications and human machine interfaces (“HMI”) for a wide variety of clients. We believe that the...

See Software
Comet Backup

Start running backups and restores in less than 15 minutes! Fast, secure backup software for businesses and IT providers. Comet is a flexible, all-in-one backup platform available in 13 languages. You choose your backup destination, server location, configuration and setup. Backup to your...

See Software
Kasm Workspaces

Kasm Workspaces streams your workplace environment directly to your web browser…on any device and from any location. Kasm uses our high-performance streaming and secure isolation technology to provide web-native Desktop as a Service (DaaS), application streaming, and secure/private web...

See Software
Wave Browser

Wave Browser is an efficient, eco-conscious browser that creates a cleaner, more organized, and more meaningful online experience while helping remove ocean plastic through its partnership with 4ocean. Built on the trusted Chromium foundation, Wave Browser brings essential tools directly into...

See Software

Report inappropriate content

DeepSpeech

Open source embedded speech-to-text engine

Get an email when there's a new version of DeepSpeech

Features

Project Samples

Project Activity

Categories

License

Follow DeepSpeech

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered