Offline inference engine for art, real-time voice conversations
Fast stable diffusion on CPU and AI PC
A natural language interface for computers
Taming Stable Diffusion for Lip Sync
The repository provides code for running inference with SAM 2
Lightweight Python library for adding real-time multi-object tracking
Redundancy-aware KV Cache Compression for Reasoning Models
Agentic, Reasoning, and Coding (ARC) foundation models
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Foundation Model for Tabular Data
A Powerful Native Multimodal Model for Image Generation
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Probabilistic programming in Python
LLM training code for MosaicML foundation models
Universal LLM Deployment Engine with ML Compilation
OCR expert VLM powered by Hunyuan's native multimodal architecture
Toolkit for running TensorFlow training scripts on SageMaker
Z80-μLM is a 2-bit quantized language model
Uncover insights, surface problems, monitor, and fine tune your LLM
Achieving 3+ generation speedup on reasoning tasks
Inference script for Oasis 500M
20+ high-performance LLMs with recipes to pretrain, finetune at scale
Probabilistic reasoning and statistical analysis in TensorFlow
Code for running inference with the SAM 3D Body Model 3DB
Foundation Models for Time Series