Deploy and share agents with open infrastructure
🐈 nanobot: The Ultra-Lightweight Clawdbot / OpenClaw
Interface for OuteTTS models
Build cross-modal and multimodal applications on the cloud
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
A text-to-speech, speech-to-text and speech-to-speech library
Repo of Qwen2-Audio chat & pretrained large audio language model
The common language for platforms, agents and businesses.
Qwen3-omni is a natively end-to-end, omni-modal LLM
Speech-AI-Forge is a project developed around TTS generation model
On-device Speech-to-Intent engine powered by deep learning
A TTS that fits in your CPU (and pocket)
Deep Research framework, combining language models with tools
Open platform for training, serving, and evaluating language models
Capable of understanding text, audio, vision, video
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Stable Diffusion web UI
Multi-Voice and Prompt-Controlled TTS Engine
Embed images and sentences into fixed-length vectors
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments
Shinkai allows you to create advanced AI (local) agents effortlessly
Serve machine learning models within a Docker container
Code repo for "WebArena to build Autonomous Agents
Leading free and open-source liveliness check &face recognition system