encoding free download

WavTokenizer

SOTA discrete acoustic codec models with 40/75 tokens per second

WavTokenizer is a state-of-the-art discrete acoustic codec designed specifically for audio language modeling, capable of compressing 24 kHz audio into just 40 or 75 tokens per second while preserving high perceptual quality. It is built to represent speech, music, and general audio with extremely low bitrate, making it ideal as a front-end for large audio language models like GPT-4o and similar architectures. The model uses a single-quantizer design together with temporal compression to...

Downloads: 1 This Week

Last Update: 2025-11-28

See Project

Bert-VITS2

VITS2 backbone with multilingual-bert

Bert-VITS2 is a neural text-to-speech project that combines a VITS2 backbone with a multilingual BERT front-end to produce high-quality speech in multiple languages. The core idea is to use BERT-style contextual embeddings for text encoding while relying on a refined VITS2 architecture for acoustic generation and vocoding. The repository includes everything needed to train, fine-tune, and run the model, from configuration files to preprocessing scripts, spectrogram utilities, and training entrypoints for multi-GPU and multi-node setups. It provides emotional modeling through “emo embeddings,” allowing voices to be conditioned on different affective states during synthesis. ...

Downloads: 0 This Week

Last Update: 2025-11-28

See Project

Concrete Voice

Concrete Voice is a text to speech program. It can read the time, anounce weather, read text file, save text files to audio files, open any text file (supports all text encoding formats) and many more advance stuff!

Downloads: 0 This Week

Last Update: 2016-01-31

See Project

Search Results for "encoding"

Showing 3 open source projects for "encoding"

WavTokenizer

Bert-VITS2

Concrete Voice

Search Results for "encoding"

Showing 3 open source projects for "encoding"

WavTokenizer

Bert-VITS2

Concrete Voice

Related Searches

Related Categories