Unified Multimodal Understanding and Generation Models
Gemma open-weight LLM library, from Google DeepMind
AI-powered penetration testing assistant using local LLM on linux
AI agents running research on single-GPU nanochat training
20+ high-performance LLMs with recipes to pretrain, finetune at scale
Python framework for building scalable multi-agent systems
The fastest way to bring multi-agent workflows to production
Extract schema, statistics and entities from datasets
Module for automatic summarization of text documents and HTML pages
Official MiniMax Model Context Protocol (MCP) server
Build your own AI SRE agents. The open source toolkit for the AI era
Google Flights MCP and Python Library
Achieving 3+ generation speedup on reasoning tasks
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
WhatsApp MCP server enabling AI access to chats and messaging
LLM framework for document understanding and semantic retrieval
AI tool that converts GitHub repositories into interactive diagrams
LangChain powered shell command generator and runner CLI
MemoryOS is designed to provide a memory operating system
Autoregressive Model Beats Diffusion
DepGraph: Towards Any Structural Pruning
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
Accessible large language models via k-bit quantization for PyTorch
slime is an LLM post-training framework for RL Scaling
In-depth tutorials on LLMs, RAGs and real-world AI agent applications