Code for running inference with the SAM 3D Body Model 3DB
Port of Facebook's LLaMA model in C/C++
AlphaFold 3 inference pipeline
Official inference repo for FLUX.1 models
MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation
Code for running inference and finetuning with SAM 3 model
Inference framework for 1-bit LLMs
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Continuous Autonomy for the AI SDK
Inference script for Oasis 500M
High-Resolution Image Synthesis with Latent Diffusion Models
Tool for exploring and debugging transformer model behaviors
Easy Docker setup for Stable Diffusion with user-friendly UI
The official PyTorch implementation of Google's Gemma models
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Inference code for scalable emulation of protein equilibrium ensembles
Phi-3.5 for Mac: Locally-run Vision and Language Models
Fast-stable-diffusion + DreamBooth
Instructions on how to use the Realtime API on Microcontrollers
Foundation Models for Time Series
Global weather forecasting model using graph neural networks and JAX
Safety reasoning models built-upon gpt-oss
Open-source, high-performance Mixture-of-Experts large language model
An implementation of model parallel GPT-2 and GPT-3-style models
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)