Wan2.1: Open and Advanced Large-Scale Video Generative Model
Synthesizing and manipulating 2048x1024 images with conditional GANs
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Modular AI image and video generation web UI with extensible tools
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Command-line program to download image galleries and collections
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
ImageBind One Embedding Space to Bind Them All
A Powerful Native Multimodal Model for Image Generation
Director, Screenwriter, Producer, and Video Generator All-in-One
Sharp Monocular View Synthesis in Less Than a Second
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Award-Winning Open Source Video Editing Software
Text and image to video generation: CogVideoX and CogVideo
Tooling for the Common Objects In 3D dataset
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Generating Immersive, Explorable, and Interactive 3D Worlds
Capable of understanding text, audio, vision, video
PS2 Covers Collection
Sharp Monocular Metric Depth in Less Than a Second
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Language modeling in a sentence representation space
RGBD video generation model conditioned on camera input
Easily compute clip embeddings and build a clip retrieval system
An unsupervised and free tool for image and video dataset analysis