Optimize your code automatically with AI
C++ library for high performance inference on NVIDIA GPUs
High-performance neural network inference framework for mobile
Text and image to video generation: CogVideoX and CogVideo
The repository provides code for running inference with SAM 2
A system monitoring tool that exposes system metrics
SAPIEN Manipulation Skill Framework
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model
Standardized Serverless ML Inference Platform on Kubernetes
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
SGLang is a fast serving framework for large language models
Sparsity-aware deep learning inference runtime for CPUs
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
OpenVINO™ Toolkit repository
The 100 line AI agent that solves GitHub issues
Open-source evaluation toolkit of large multi-modality models (LMMs)
Unleashing 10,000+ Word Generation from Long Context LLMs
The Fastest LLM Gateway with built in OTel observability
Generate audiobooks from e-books
Ultra-Efficient AI Assistant in Go
Generate music based on natural language prompts using LLMs
Official inference framework for 1-bit LLMs
The official PyTorch implementation of Google's Gemma models
MemU is an open-source memory framework for AI companions
Elegant and Performant Deep Learning