Generate audiobooks from e-books
A simple, high-quality voice conversion tool focused on ease of use
A python library that makes AMR parsing, generation and visualization
Offline Text To Speech synthesis for python
The Python code to reproduce illustrations from Machine Learning Book
Code and models for ICML 2024 paper, NExT-GPT
Accurate × Fast × Comprehensive
A Pythonic framework to simplify AI service building
The simplest, fastest repository for training/finetuning models
A simple, secure MCP-to-OpenAPI proxy server
Stable Diffusion built-in to Blender
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
A sound cloning tool with a web interface, using your voice
MiniSom is a minimalistic implementation of the Self Organizing Maps
Open Source Document Management System for Digital Archives
State-of-the-art diffusion models for image and audio generation
Concatenate a directory full of files into a single prompt
PPTAgent: Generating and Evaluating Presentations
An advanced paper search agent powered by large language models
Audiocraft is a library for audio processing and generation
Two Integrated Text To Speech Engines uses MMS & Silero
Img2Txt - Extract Text From Images using AI
Official PyTorch Implementation of "Scalable Diffusion Models"
A minimal implementation of diffusion models for text generation
Singing Voice Synthesis via Shallow Diffusion Mechanism