A single Gradio + React WebUI with extensions for ACE-Step
A nearly-live implementation of OpenAI's Whisper
Foundational model for human-like, expressive TTS
Olares: An Open-Source Sovereign Cloud OS for Local AI
Deep Research framework, combining language models with tools
Instant voice cloning by MIT and MyShell. Audio foundation model
The open big data serving engine
An Open Source implementation of Notebook LM with more flexibility
Knowledge Graph Generation from Any Text
Controllable & emotion-expressive zero-shot TTS
Controllable and fast Text-to-Speech for over 7000 languages
The python library for real-time communication
Connect MATLAB to LLM APIs, including OpenAI® Chat Completions
Unified web UI for training and running open models locally
A Multi-Modal World Model for Reconstructing, Generating, Simulation
Create videos with Stable Diffusion
Like the macOS say command, but with a modern voice
A minimal LLM chat app that runs entirely in your browser
Spark-TTS Inference Code
Multimodal embedding and reranking models built on Qwen3-VL
Free, high-quality text-to-speech API endpoint to replace OpenAI
Capable of understanding text, audio, vision, video
NLP Cloud serves high performance pre-trained or custom models for NER
Flowly is 100x faster than OpenClaw
Open Source Document Management System for Digital Archives