Visual Causal Flow
Wan2.1: Open and Advanced Large-Scale Video Generative Model
High-Resolution Image Synthesis with Latent Diffusion Models
Inference framework for 1-bit LLMs
Pokee Deep Research Model Open Source Repo
Stable Diffusion with Core ML on Apple Silicon
Phi-3.5 for Mac: Locally-run Vision and Language Models
Contexts Optical Compression
Z80-μLM is a 2-bit quantized language model
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Models for object and human mesh reconstruction
A state-of-the-art open visual language model
Renderer for the harmony response format to be used with gpt-oss
Code for running inference with the SAM 3D Body Model 3DB
PyTorch code and models for the DINOv2 self-supervised learning
gpt-oss-120b and gpt-oss-20b are two open-weight language models
StudioOllamaUI is a local, portable interface for Ollama
Diffusion Transformer with Fine-Grained Chinese Understanding
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Chat & pretrained large audio language model proposed by Alibaba Cloud
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Provides convenient access to the Anthropic REST API from any Python 3
Powerful open source image generation model
Fine-tuning ChatGLM-6B with PEFT
LLaMA: Open and Efficient Foundation Language Models