Open Source Speech Language Model
AI-powered bridge connecting LLMs and advanced AI agents
A sound cloning tool with a web interface, using your voice
Open source visual editor for building React drag-and-drop pages
Easy-to-use and powerful NLP library with Awesome model zoo
Easily compute clip embeddings and build a clip retrieval system
Collection of Gemma 3 variants that are trained for performance
An Open Source text-to-speech system built by inverting Whisper
Image generation model with single-stream diffusion transformer
Foundation model for image generation
Stable Diffusion web UI
A very simple framework for state-of-the-art NLP
Automated translation solution for visual novels
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image
StreamSpeech is a seamless model for offline speech recognition
An MCP server that autonomously evaluates web applications
Toolkit for conversational AI
Qwen2.5-VL is the multimodal large language model series
NLP Cloud serves high performance pre-trained or custom models
Framework for building real-time voice and multimodal AI agents
A fast, helpful, and open-source document parser
A high-quality PDF to Markdown tool based on large language model
Voice Recognition to Text Tool
Official Python inference and LoRA trainer package
Moonshot's most powerful AI model