Reverse engineering Gemini's SynthID detection
An unsupervised and free tool for image and video dataset analysis
Unsupervised Learning for Image Registration
Deep and Machine Learning for Microscopy
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
NLP Cloud serves high performance pre-trained or custom models for NER
We write your reusable computer vision tools
GeoAI: Artificial Intelligence for Geospatial Data
Virtual AI anchor that combines state-of-the-art technology
A lightweight vision library for performing large object detection
Contexts Optical Compression
Create HTML profiling reports from pandas DataFrame objects
DeepVariant is an analysis pipeline that uses a deep neural networks
The Multi-Agent Framework
Sandbox for training deep learning networks
Advanced AI Explainability for computer vision
Unified Multimodal Understanding and Generation Models
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
After 4/15/26 this project will be archived as 9 pipelines are conso
Scientific Visualisation Made Easy
computer vision projects | Fun AI projects related to computer vision
YoloV3 Implemented in Tensorflow 2.0
PyTorch implementation of MAE