Open source machine learning framework to automate text conversations
Foundational model for human-like, expressive TTS
World's first open-source, agentic video production system
Free, high-quality text-to-speech API endpoint to replace OpenAI
Automagically synchronize subtitles with video
C++ inference library for multiple SVC/TTS
Bailing is a voice dialogue robot similar to GPT-4o
Open-source abilities for OpenHome agents
PersonaPlex code
AI framework for automated short video creation and editing tools
Open source personal AI Assistant for Linux, Windows and Mac
From Images to High-Fidelity 3D Assets
Open-source model for program synthesis
Free open source speech synthesizer for Russian and other languages
The most powerful local music generation model
Use Microsoft Edge's online text-to-speech service from Python
Fast multimodal LLM for real-time voice interaction and AI apps
Berkeley Quantum Synthesis Toolkit
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Automatic Speech Recognition with Word-level Timestamps
Open-source framework for conversational voice AI agents
Chat with it via text and voice
Towards Human-Sounding Speech
A Model Context Protocol Server for Home Assistant
Wan2.1: Open and Advanced Large-Scale Video Generative Model