MARS5 speech model (TTS) from CAMB.AI
Towards Human-Level Text-to-Speech through Style Diffusion
SpeeD ReaD is a little program to help you read faster.
A deep learning toolkit for Text-to-Speech, battle-tested in research
Audio Transcription software for Linux (Vlc) with a foot pedal
Best practice TTS based on BERT and VITS
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
Chinese voice dialogue robot/smart speaker project
Singing voice change based on whisper, lora for singing voice clone
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Audio Transcription software for Linux (Gstreamer) with a foot pedal
[WIP] VoiceSmith makes training text to speech models easy
We provide a PyTorch implementation of the paper Voice Separation
Conditional Variational Autoencoder with Adversarial Learning
An implementation of Tacotron 2 that supports multilingual experiments
Automated Plant Environment Growing System using Raspberry Pi
PyTorch implementation of convolutional neural networks
TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Cross Audio-Visual Recognition using 3D Architectures
Beamforming and Speech Recognition Toolkit
(audio, video, image) Multimedia Multimodal Information Retrieval
The all in one solution for collaborative academic writing.
Open source platform to write and publish print and digital books