Synchronized Translation for Videos
Build multimodal AI applications with cloud-native stack
AirLLM 70B inference with single 4GB GPU
A nearly-live implementation of OpenAI's Whisper
Letta (formerly MemGPT) is a framework for creating LLM services
Official MiniMax Model Context Protocol (MCP) server
Secure local-first microVM sandbox for running untrusted code fast
Chat with any codebase in under two minutes | Fully local
High-performance inference server for text embeddings models API layer
AI-data warehouse to enrich, transform and analyze unstructured data
An open-source RAG-based tool for chatting with your documents
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Quick illustration of how one can easily read books together with LLMs
Official inference repo for FLUX.2 models
ChatGLM2-6B: An Open Bilingual Chat LLM
A TTS that fits in your CPU (and pocket)
Open platform connecting AI agents to tools via unified MCP server
OCR model for complex documents with layout-aware structured outputs
A simple native web interface that uses ChatTTS to synthesize text
Agent S: an open agentic framework that uses computers like a human
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
TensorFlow is an open source library for machine learning
Qwen2.5-VL is the multimodal large language model series
Open source libraries and APIs to build custom preprocessing pipelines
Unified web UI for training and running open models locally