Blender addons to make the bridge between Blender and geographic data
Towards Real-World Vision-Language Understanding
Automatically find issues in image datasets
Implementation of Imagen, Google's Text-to-Image Neural Network
A Unified Framework for Image Customization
Reference PyTorch implementation and models for DINOv3
AutoGluon: AutoML for Image, Text, and Tabular Data
An open source object detection toolbox based on PyTorch
ImageBind One Embedding Space to Bind Them All
21 Lessons, Get Started Building with Generative AI
Diffusion Transformer with Fine-Grained Chinese Understanding
Code for running inference and finetuning with SAM 3 model
Python data, Leaflet.js maps
Capable of understanding text, audio, vision, video
YOLOv5 is the world's most loved vision AI
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Lets make video diffusion practical
Blender pipeline for photorealistic training image generation
Contexts Optical Compression
List of free ChatGPT mirror sites, continuously updated
Open-Sora: Democratizing Efficient Video Production for All
Tensor search for humans
Implementation of a U-net complete with efficient attention
Official implementation of Watermark Anything with Localized Messages
Usable Implementation of "Bootstrap Your Own Latent" self-supervised