model-builder free download

Moonshine Voice

Fast and accurate automatic speech recognition (ASR) for edge devices

...Moonshine supports multiple platforms including mobile, desktop, and embedded systems, and provides example projects to accelerate integration into real-world products. The toolkit also includes specialized model variants, including monolingual options that improve accuracy for specific languages. Overall, moonshine serves developers building privacy-conscious, on-device voice interfaces that demand high performance with minimal resource overhead.

Downloads: 3 This Week

Last Update: 2026-06-02

See Project

PersonaPlex

PersonaPlex code

PersonaPlex is an open-source real-time conversational speech AI model that goes beyond traditional text chat by providing full-duplex speech-to-speech interaction, meaning it can listen and talk at the same time instead of waiting for you to finish speaking before responding. This architectural approach eliminates awkward pauses and makes conversations feel much more human-like, with natural behaviors such as overlapping speech, interruptions, and fluent turn-taking, traits that traditional AI assistants typically lack. ...

Downloads: 1 This Week

Last Update: 2026-03-02

See Project

Moshi

A speech-text foundation model for real time dialogue

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec. Mimi processes 24 kHz audio, down to a 12.5 Hz representation with a bandwidth of 1.1 kbps, in a fully streaming manner (latency of 80ms, the frame size), yet performs better than existing, non-streaming, codecs like SpeechTokenizer (50 Hz, 4kbps), or SemantiCodec (50 Hz, 1.3kbps).

Downloads: 0 This Week

Last Update: 2024-11-05

See Project

VCClient

Software that uses AI to perform real-time voice conversion

VCClient is a real-time voice conversion system that uses machine learning models to transform a speaker’s voice into another voice with minimal latency. It is designed for live applications such as streaming, gaming, and virtual communication, where immediate feedback is essential. The system supports multiple voice conversion models, including RVC and other neural network-based approaches, allowing users to switch between different voices or customize their output. It provides both a...

Downloads: 34 This Week

Last Update: 2026-03-23

See Project

Coqui STT

The deep learning toolkit for speech-to-text

Coqui STT is a fast, open-source, multi-platform, deep-learning toolkit for training and deploying speech-to-text models. Coqui STT is battle-tested in both production and research. Multiple possible transcripts, each with an associated confidence score. Experience the immediacy of script-to-performance. With Coqui text-to-speech, production times go from months to minutes. With Coqui, the post is a pleasure. Effortlessly clone the voices of your talent and have the clone handle the problems...

Downloads: 3 This Week

Last Update: 2022-09-03

See Project

DeepSpeech

Open source embedded speech-to-text engine

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the instructions in the usage docs. If you want to use the pre-trained English model for performing speech-to-text, you can download it (along with other important inference material) from the DeepSpeech releases page.

Downloads: 3 This Week

Last Update: 2021-04-08

See Project

XZVoice

Free and open source text-to-speech software

...Technically, multi-level rhythmic pauses are taken into account to achieve the purpose of natural synthesizing rhythm, and comprehensively use acoustic parameters and linguistic parameters to establish multiple automatic prediction models based on deep learning. Using massive audio data to train the pronunciation model, the synthetic sound is real, full, cadenced, and expressive, and the MOS score has reached the professional level in the industry.

Downloads: 0 This Week

Last Update: 2022-10-04

See Project

TTS

Deep learning for text to speech

...TTS comes with pre-trained models, tools for measuring dataset quality, and is already used in 20+ languages for products and research projects. Released models in PyTorch, Tensorflow and TFLite. Tools to curate Text2Speech datasets underdataset_analysis. Demo server for model testing. Notebooks for extensive model benchmarking. Modular (but not too much) code base enabling easy testing for new ideas. Text2Spec models (Tacotron, Tacotron2, Glow-TTS, SpeedySpeech). Speaker Encoder to compute speaker embeddings efficiently. Vocoder models (MelGAN, Multiband-MelGAN, GAN-TTS, ParallelWaveGAN, WaveGrad, WaveRNN). ...

Downloads: 2 This Week

Last Update: 2021-10-18

See Project

High-order HMM in Matlab

Implementation of duration high-order hidden Markov model in Matlab.

Implementation of duration high-order hidden Markov model (DHO-HMM) in Matlab with application in speech recognition.

2 Reviews

Downloads: 0 This Week

Last Update: 2015-02-15

See Project

jaivox

Speech recognition application builder and library

Java library and tools to create open source speech recognition applications. Generates dialogs for conversational interfaces. Works with a popular open source speech recognition library.

Downloads: 0 This Week

Last Update: 2015-03-26

See Project

Austrian German Voices for Festival

Austrian voices for the Festival speech synthesis system

Hidden Markov Model based voice models of Austrian German for the Festival speech synthesis system.

Downloads: 0 This Week

Last Update: 2015-08-16

See Project

HMM Speech Recognition in Matlab

A speech recognition system using Matlab/Simulink/Stateflow.

This project provide hidden Markov model speech recognition system by using Matlab/Simulink/Stateflow.

4 Reviews

Downloads: 0 This Week

Last Update: 2016-07-25

See Project

ASR-Builder

ASR-Builder provides an easy-to-use interface to the HTK toolkit, that allows users to build ASR systems. ASR-Builder provides a platform that performs house-keeping tasks when using HTK and also provides default training/testing/recognition scripts.

Downloads: 0 This Week

Last Update: 2013-04-26

See Project

M68331 Voice Recognition System

This project will show how to implement the Hidden Markov Model approximations of Voice Recognition into embedded and low power systems.

Downloads: 0 This Week

Last Update: 2013-02-21

See Project

Search Results for "model-builder"

Showing 14 open source projects for "model-builder"

Moonshine Voice

PersonaPlex

Moshi

VCClient

Coqui STT

DeepSpeech

XZVoice

TTS

High-order HMM in Matlab

jaivox

Austrian German Voices for Festival

HMM Speech Recognition in Matlab

ASR-Builder

M68331 Voice Recognition System

Search Results for "model-builder"

Showing 14 open source projects for "model-builder"

Moonshine Voice

PersonaPlex

Moshi

VCClient

Coqui STT

DeepSpeech

XZVoice

TTS

High-order HMM in Matlab

jaivox

Austrian German Voices for Festival

HMM Speech Recognition in Matlab

ASR-Builder

M68331 Voice Recognition System

Related Searches

Related Categories