DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the instructions in the usage docs. If you want to use the pre-trained English model for performing speech-to-text, you can download it (along with other important inference material) from the DeepSpeech releases page.

Features

Uses a model trained by machine learning techniques
Based on Baidu's Deep Speech research paper
Uses Google's TensorFlow to make the implementation easier
A pre-trained English model is available for use
Download important inference material from the DeepSpeech releases page
Run in real time on all devices

Project Samples

Project Activity

See All Activity >

License

Mozilla Public License 2.0 (MPL 2.0)

Follow DeepSpeech

DeepSpeech Web Site

Other Useful Business Software

$300 Free Credits to Build on Google Cloud

New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.

Claim $300 Free

Rate This Project

User Reviews

Be the first to post a review of DeepSpeech!

Additional Project Details

Programming Language

C++, Python

Related Categories

Python Word Processors, Python Speech Software, Python Speech to Text Software, Python Raspberry Pi Software, C++ Word Processors, C++ Speech Software, C++ Speech to Text Software, C++ Raspberry Pi Software

Registered

2021-03-16

Similar Business Software

Google Cloud Speech-to-Text

Google Cloud’s Speech API processes more than 1 billion voice minutes per month with close to human levels of understanding for many commonly spoken languages. Powered by the best of Google's AI research and technology, Google Cloud's Speech-to-Text API helps you accurately transcribe speech...

See Software
ONLYOFFICE Docs

ONLYOFFICE is an open-source project that offers cloud-based and self-hosted solutions for business of all sizes. The key product is ONLYOFFICE Docs, a secure office suite that seamlessly integrates into the most popular platforms, e.g. Odoo, Alfresco, Confluence, Pipedrive, Redmine, SuiteCRM...

See Software
talvala surveillance

Talvala is a speech analytics company. We use Baidu’s Deep Speech technology and machine learning for compliance surveillance and human/machine interfaces. We develop speech-based monitoring applications and human machine interfaces (“HMI”) for a wide variety of clients. We believe that the...

See Software
Safetica

Safetica’s Intelligent Data Security protects sensitive data where teams work, using powerful AI to deliver contextual awareness, reduce false positives, and stop real threats without disrupting productivity. With Safetica, security teams can maintain visibility and control over sensitive data,...

See Software
Comet Backup

Start running backups and restores in less than 15 minutes! Fast, secure backup software for businesses and IT providers. Comet is a flexible, all-in-one backup platform available in 13 languages. You choose your backup destination, server location, configuration and setup. Backup to your...

See Software
Kasm Workspaces

Kasm Workspaces streams your workplace environment directly to your web browser…on any device and from any location. Kasm uses our high-performance streaming and secure isolation technology to provide web-native Desktop as a Service (DaaS), application streaming, and secure/private web...

See Software

Report inappropriate content

DeepSpeech

Open source embedded speech-to-text engine

Get an email when there's a new version of DeepSpeech

Features

Project Samples

Project Activity

Categories

License

Follow DeepSpeech

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered