On-device Speech-to-Intent engine powered by deep learning
Open source machine learning framework to automate text conversations
Foundational model for human-like, expressive TTS
Bailing is a voice dialogue robot similar to GPT-4o
C++ inference library for multiple SVC/TTS
Free, high-quality text-to-speech API endpoint to replace OpenAI
Open-source abilities for OpenHome agents
From Images to High-Fidelity 3D Assets
Open source personal AI Assistant for Linux, Windows and Mac
Fast multimodal LLM for real-time voice interaction and AI apps
Open-source model for program synthesis
The most powerful local music generation model
AI framework for automated short video creation and editing tools
Use Microsoft Edge's online text-to-speech service from Python
Multi-modal large language model designed for audio understanding
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Chat with it via text and voice
Automatic Speech Recognition with Word-level Timestamps
The python library for real-time communication
Open-source framework for conversational voice AI agents
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Towards Human-Sounding Speech
A Model Context Protocol Server for Home Assistant
Interface for OuteTTS models
High-quality multi-lingual text-to-speech library by MyShell.ai