A simple, high-quality voice conversion tool focused on ease of use
TTS with kokoro and onnx runtime
Public repository for Agent Skills
Open-source infrastructure for Computer-Use Agents. Sandboxes
Focus on creating classic Python small examples and cases
Lets make video diffusion practical
The Python code to reproduce illustrations from Machine Learning Book
A TTS that fits in your CPU (and pocket)
Open source healthcare AI
Clone a voice in 5 seconds to generate arbitrary speech in real-time
The official Python library for the Fish Audio API
A simple native web interface that uses ChatTTS to synthesize text
The SOTA Open-Source Browser Agent
Controllable and fast Text-to-Speech for over 7000 languages
PyTorch code and models for VJEPA2 self-supervised learning from video
Master the fundamentals of machine learning, deep learning
HY-Motion model for 3D character animation generation
Python library and CLI tool to interface with Google Translate
Building an Intelligent Agent from Scratch
Open-source AI marketing skills for Claude Code
Containerized automation engine for programmable CI/CD workflows
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
On the Structural Pruning of Large Language Models
A PyTorch library for implementing flow matching algorithms
Audiocraft is a library for audio processing and generation