OpenTinker is an RL-as-a-Service infrastructure for foundation models
The most powerful local music generation model
Visual Causal Flow
Fast stable diffusion on CPU and AI PC
Phi-3.5 for Mac: Locally-run Vision and Language Models
Models for object and human mesh reconstruction
Contexts Optical Compression
Fast-stable-diffusion + DreamBooth
Text and image to video generation: CogVideoX and CogVideo
Easy Docker setup for Stable Diffusion with user-friendly UI
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Code for running inference with the SAM 3D Body Model 3DB
Z80-μLM is a 2-bit quantized language model
Bidirectional token-classification model for identifiable info
High-Resolution Image Synthesis with Latent Diffusion Models
Pretrained time-series foundation model developed by Google Research
PyTorch code and models for the DINOv2 self-supervised learning
A 0.1B Omni model trained from scratch
26m function call model that runs on incredibly small devices
Open-source deep-learning framework
Diffusion Transformer with Fine-Grained Chinese Understanding
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Pokee Deep Research Model Open Source Repo
Stable Diffusion with Core ML on Apple Silicon
A state-of-the-art open visual language model