High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Official Python inference and LoRA trainer package
Official inference repo for FLUX.2 models
PyTorch implementation of JiT
This repository contains the official implementation of FastVLM
Recovering the Visual Space from Any Views
Qwen2.5-VL is the multimodal large language model series
Wan2.1: Open and Advanced Large-Scale Video Generative Model
A Customizable Image-to-Video Model based on HunyuanVideo
Official repository for LTX-Video
Reference PyTorch implementation and models for DINOv3
An easy 1-click way to create beautiful artwork on your PC using AI
Open image model at the forefront of design
Text and image to video generation: CogVideoX and CogVideo
Programmatic access to the AlphaGenome model
Sharp Monocular Metric Depth in Less Than a Second
Implementation of the Surya Foundation Model for Heliophysics
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
High-resolution models for human tasks
A Unified Framework for Text-to-3D and Image-to-3D Generation
Generate Any 3D Scene in Seconds
Large-language-model & vision-language-model based on Linear Attention
High-Resolution Image Synthesis with Latent Diffusion Models
AI Suite for upscaling, interpolating & restoring images/videos
Blazeface is a lightweight model that detects faces in images