FastAPI framework, high performance, easy to learn, fast to code
All Algorithms implemented in Python
The friendly Python Imaging Library fork
Models for object and human mesh reconstruction
Code for running inference with the SAM 3D Body Model 3DB
Official DeiT repository
A Powerful Native Multimodal Model for Image Generation
Easily turn large sets of image urls to an image dataset
A neural network that transforms a design mock-up into static websites
A Customizable Image-to-Video Model based on HunyuanVideo
A formatter for Python files
Contexts Optical Compression
Guiding Instruction-based Image Editing via Multimodal Large Language
CLIP, Predict the most relevant text snippet given an image
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Official inference repo for FLUX.2 models
RGBD video generation model conditioned on camera input
A Unified Framework for Text-to-3D and Image-to-3D Generation
Multimodal-Driven Architecture for Customized Video Generation
Towards Real-World Vision-Language Understanding
Code for running inference and finetuning with SAM 3 model
Python and JavaScript bindings for calling the Earth Engine API
A Python 3 implementation built on GraalVM
Reference PyTorch implementation and models for DINOv3
Official code for Style Aligned Image Generation via Shared Attention