Audiocraft is a library for audio processing and generation
Multimodal Diffusion with Representation Alignment
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Official repository for LTX-Video
Workflow and speech recognition app
Python inference and LoRA trainer package for the LTX-2 audio–video
AlphaPlayer is a video animation engine
Music research software
ILA is a fully customizable and teachable voice assistant for Java
An Incremental Spoken Dialogue Processing Toolkit
HMM Speech Recognition in Java
Provides speech to text gui to sphinx4
Artificial intelligence evolves musical instruments played with mouse
simple algorithm for a realtime interactive visual cortex for painting
Speedy Composer – Artificial Neural Network Melody Composer.