Official Python inference and LoRA trainer package
Official inference repo for FLUX.2 models
PyTorch implementation of JiT
This repository contains the official implementation of FastVLM
Recovering the Visual Space from Any Views
A Customizable Image-to-Video Model based on HunyuanVideo
Official repository for LTX-Video
Reference PyTorch implementation and models for DINOv3
Sharp Monocular Metric Depth in Less Than a Second
Implementation of the Surya Foundation Model for Heliophysics
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
High-resolution models for human tasks
A Unified Framework for Text-to-3D and Image-to-3D Generation
Generate Any 3D Scene in Seconds
Large-language-model & vision-language-model based on Linear Attention
High-Resolution Image Synthesis with Latent Diffusion Models
Blazeface is a lightweight model that detects faces in images
Powerful open source image generation model
Official PyTorch Implementation of "Scalable Diffusion Models"
Code release for "Masked-attention Mask Transformer
Layout-aware OCR model for multilingual document understanding
Qwen2.5-VL-3B-Instruct: Multimodal model for chat, vision & video
Fast 12B image model for high-quality text-to-image generation