PyTorch code and models for V-JEPA self-supervised learning from video
PyTorch code and models for the DINOv2 self-supervised learning
ktrain is a Python library that makes deep learning AI more accessible
code for Mesh R-CNN, ICCV 2019
A Universal Customization Method for Single and Multi Conditioning
Implementation of Vision Transformer, a simple way to achieve SOTA
The repository provides code for running inference with SAM 2
Jittor is a high-performance deep learning framework
Any model. Any hardware. Zero compromise
A python library for self-supervised learning on images
Framework for building neural networks
Machine Learning Pipelines for Kubeflow
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
Visual Automation IDE — automate anything you see on screen
A Customizable Image-to-Video Model based on HunyuanVideo
Powerful open source image generation model
dashAI: an interactive platform for training, evaluating and deploying
A fast, powerful, and simple hierarchical vision transformer
Guiding Instruction-based Image Editing via Multimodal Large Language
The unofficial python package that returns response of Google Bard
Implements weak-to-strong learning for training stronger ML models
Official code for Style Aligned Image Generation via Shared Attention
Generate 3D objects conditioned on text or images
Official Code for DragGAN (SIGGRAPH 2023)