FastAPI framework, high performance, easy to learn, fast to code
All Algorithms implemented in Python
The friendly Python Imaging Library fork
A Powerful Native Multimodal Model for Image Generation
Official DeiT repository
Code for running inference with the SAM 3D Body Model 3DB
Models for object and human mesh reconstruction
Easily turn large sets of image urls to an image dataset
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
A Customizable Image-to-Video Model based on HunyuanVideo
Contexts Optical Compression
A neural network that transforms a design mock-up into static websites
A formatter for Python files
Guiding Instruction-based Image Editing via Multimodal Large Language
CLIP, Predict the most relevant text snippet given an image
RGBD video generation model conditioned on camera input
A Unified Framework for Text-to-3D and Image-to-3D Generation
Multimodal-Driven Architecture for Customized Video Generation
Diffusion Transformer with Fine-Grained Chinese Understanding
Towards Real-World Vision-Language Understanding
A Python 3 implementation built on GraalVM
Python and JavaScript bindings for calling the Earth Engine API
Reference PyTorch implementation and models for DINOv3
Open-Sora: Democratizing Efficient Video Production for All
Official code for Style Aligned Image Generation via Shared Attention