Official Python inference and LoRA trainer package
Official inference repo for FLUX.2 models
Synthesizing and manipulating 2048x1024 images with conditional GANs
PyTorch implementation of JiT
This repository contains the official implementation of FastVLM
Recovering the Visual Space from Any Views
Give Claude the ability to watch and understand videos
Generate high-definition story short videos with one click using AI
Knowledge Graph Generation from Any Text
Official repository for LTX-Video
A Customizable Image-to-Video Model based on HunyuanVideo
Reference PyTorch implementation and models for DINOv3
Tokenizer-Free TTS for Multilingual Speech Generation
StarVector is a foundation model for SVG generation
Sharp Monocular Metric Depth in Less Than a Second
Implementation of the Surya Foundation Model for Heliophysics
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
The 100 line AI agent that solves GitHub issues
High-resolution models for human tasks
A Unified Framework for Text-to-3D and Image-to-3D Generation
An agentless approach to automatically solve software development
A system for agentic LLM-powered data processing and ETL
Generate Any 3D Scene in Seconds
Large-language-model & vision-language-model based on Linear Attention
High-Resolution Image Synthesis with Latent Diffusion Models