Code for running inference with the SAM 3D Body Model 3DB
Official inference repo for FLUX.1 models
Port of Facebook's LLaMA model in C/C++
MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation
Inference framework for 1-bit LLMs
Code for running inference and finetuning with SAM 3 model
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Continuous Autonomy for the AI SDK
Inference script for Oasis 500M
Tool for exploring and debugging transformer model behaviors
Easy Docker setup for Stable Diffusion with user-friendly UI
The official PyTorch implementation of Google's Gemma models
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Phi-3.5 for Mac: Locally-run Vision and Language Models
Fast-stable-diffusion + DreamBooth
Instructions on how to use the Realtime API on Microcontrollers
Foundation Models for Time Series
Safety reasoning models built-upon gpt-oss
Open-source, high-performance Mixture-of-Experts large language model
An implementation of model parallel GPT-2 and GPT-3-style models
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
Code for reproducing key results in the paper
Efficient 14B multimodal instruct model with edge deployment and FP8