Open Source Speech Language Model
AI-powered bridge connecting LLMs and advanced AI agents
Open source visual editor for building React drag-and-drop pages
Easily compute clip embeddings and build a clip retrieval system
Collection of Gemma 3 variants that are trained for performance
Easy-to-use and powerful NLP library with Awesome model zoo
An Open Source text-to-speech system built by inverting Whisper
A sound cloning tool with a web interface, using your voice
Stable Diffusion web UI
A very simple framework for state-of-the-art NLP
Foundation model for image generation
Image generation model with single-stream diffusion transformer
Automated translation solution for visual novels
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image
StreamSpeech is a seamless model for offline speech recognition
An MCP server that autonomously evaluates web applications
NLP Cloud serves high performance pre-trained or custom models
Toolkit for conversational AI
Qwen2.5-VL is the multimodal large language model series
A fast, helpful, and open-source document parser
Voice Recognition to Text Tool
Framework for building real-time voice and multimodal AI agents
A high-quality PDF to Markdown tool based on large language model
Official Python inference and LoRA trainer package
Olares: An Open-Source Sovereign Cloud OS for Local AI