High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Official Python inference and LoRA trainer package
Official inference repo for FLUX.2 models
PyTorch implementation of JiT
This repository contains the official implementation of FastVLM
Recovering the Visual Space from Any Views
Repo for SeedVR2 & SeedVR
High-Resolution Image Synthesis with Latent Diffusion Models
Qwen2.5-VL is the multimodal large language model series
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Official repository for LTX-Video
A Customizable Image-to-Video Model based on HunyuanVideo
Reference PyTorch implementation and models for DINOv3
Native and Compact Structured Latents for 3D Generation
An easy 1-click way to create beautiful artwork on your PC using AI
GPT4V-level open-source multi-modal model based on Llama3-8B
Open image model at the forefront of design
Text and image to video generation: CogVideoX and CogVideo
Global weather forecasting model using graph neural networks and JAX
Programmatic access to the AlphaGenome model
Sharp Monocular Metric Depth in Less Than a Second
Implementation of the Surya Foundation Model for Heliophysics
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
High-resolution models for human tasks
A Unified Framework for Text-to-3D and Image-to-3D Generation