Instant voice cloning by MIT and MyShell. Audio foundation model
Toolkit to help you get started with Spec-Driven Development
A simple native web interface that uses ChatTTS to synthesize text
Enable AI to control your desktop, mobile and HMI devices
Parallax is a distributed model serving framework
All-in-one WebUI for AI generative image and video creation
Vibe-Trading: Your Personal Trading Agent
Enterprise platform for building and orchestrating AI agent workflows
Generate audiobooks from e-books
Open-source platform for evaluating, observing, and improving LLM
Low-latency AI inference engine optimized for mobile devices
SOTA Open Source TTS
A self-hosted open source photo management service
Production-grade platform for building agentic IM bots
Control Any Computer Using LLMs
UI-TARS-desktop version that can operate on your local personal device
Pushing the Frontier of Long Audio-Visual Generation
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Open source platform for the machine learning lifecycle
TFX is an end-to-end platform for deploying production ML pipelines
Open-Sora: Democratizing Efficient Video Production for All
Open-source platform for building AI agents and serverless automation
A framework to enable multimodal models to operate a computer
Persistent memory for AI agent fleets (OSS)
CogView4, CogView3-Plus and CogView3(ECCV 2024)