The most powerful and modular diffusion model GUI, api and backend
Code for running inference with the SAM 3D Body Model 3DB
Generating Immersive, Explorable, and Interactive 3D Worlds
A neural network that transforms a design mock-up into static websites
CogView4, CogView3-Plus and CogView3(ECCV 2024)
A free & open-source 2D sprite editor, made with the Godot Engine
Models for object and human mesh reconstruction
Re-editable LaTeX/ typst graphics for Inkscape
Typer, build great CLIs, based on Python type hints
FastAPI framework, high performance, easy to learn, fast to code
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Guiding Instruction-based Image Editing via Multimodal Large Language
CLIP, Predict the most relevant text snippet given an image
A formatter for Python files
Automatically find issues in image datasets
Fast image augmentation library and an easy-to-use wrapper
A Python 3 implementation built on GraalVM
File and Image Management Application for django
Open Source Differentiable Computer Vision Library
A Customizable Image-to-Video Model based on HunyuanVideo
Mozc - a Japanese Input Method Editor designed for multi-platform
Universal Radio Hacker: Investigate Wireless Protocols Like A Boss
CLI tool to extract (meta)data from PDF and manipulate PDF files
text and image to video generation: CogVideoX (2024) and CogVideo
Towards Real-World Vision-Language Understanding