Audiocraft is a library for audio processing and generation
Multimodal Diffusion with Representation Alignment
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Official repository for LTX-Video
Workflow and speech recognition app
Python inference and LoRA trainer package for the LTX-2 audio–video
Common Resource Grep
AlphaPlayer is a video animation engine
Music research software
ILA is a fully customizable and teachable voice assistant for Java
Speech recognition application builder and library
This project includes basic NLP and DSP techniques for Text-to-Speech
An Incremental Spoken Dialogue Processing Toolkit
HMM Speech Recognition in Java
A duration high-order hidden Markov model (DHO-HMM) in Java.
Provides speech to text gui to sphinx4
Artificial intelligence evolves musical instruments played with mouse
simple algorithm for a realtime interactive visual cortex for painting