LISA: Reasoning Segmentation via Large Language Model
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
Code for running inference and finetuning with SAM 3 model
Marrying Grounding DINO with Segment Anything & Stable Diffusion
Code release for Cut and Learn for Unsupervised Object Detection
RF-DETR is a real-time object detection and segmentation
The repository provides code for running inference with SAM 2
Python Audio Analysis Library: Feature Extraction, Classification
Pluggable SOTA multi-object tracking modules for segmentation
Reference PyTorch implementation and models for DINOv3
GeoAI: Artificial Intelligence for Geospatial Data
HivisionIDPhotos: a lightweight and efficient AI ID photos tools
Photorealistic Synthetic Dataset for Holistic Indoor Scene
PyTorch code and models for the DINOv2 self-supervised learning
Ultralytics YOLO
Sandbox for training deep learning networks
Video understanding codebase from FAIR for reproducing video models
Advanced AI Explainability for computer vision
Provides code for running inference with the SegmentAnything Model
Open source demo platform where you can easily showcase your AI models
AI-powered tool for generating, optimizing, and translating subtitles
A robust, efficient, low-latency speech-to-text library
Recovering the Visual Space from Any Views
Declarative way to run AI models in React Native on device
Handwritten Text Recognition (HTR) system implemented with TensorFlow