A TTS that fits in your CPU (and pocket)
Neural Network architecture based on ideas of the original LSTM
157 models, 30 providers, one command to find what runs on hardware
Fast, small, and fully autonomous AI assistant infrastructure
MemU is an open-source memory framework for AI companions
Run a 1-billion parameter LLM on a $10 board with 256MB RAM
Desktop Companion for Hermes Agent
Demo of a customer service use case implemented with the OpenAI Agents
Open-source large language model family from Tencent Hunyuan
Accessible large language models via k-bit quantization for PyTorch
Redundancy-aware KV Cache Compression for Reasoning Models
AI Agent Source Code Deep Research Report
A step-by-step guide to build your own AI agent
Supercharge Your LLM with the Fastest KV Cache Layer
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Memory-efficient and performant finetuning of Mistral's models
A Python library for audio
Faster Whisper transcription with CTranslate2
Unified web UI for training and running open models locally
Developer friendly Natural Language Processing
High-performance neural network inference framework for mobile
Official inference framework for 1-bit LLMs
Agent framework and applications built upon Qwen>=3.0
Persistent context and multi-instance coordination
MNN is a blazing fast, lightweight deep learning framework