"VideoRAG: Chat with Your Videos
AI-Powered Wiki Generator for GitHub/Gitlab/Bitbucket Repositories
Parse files for optimal RAG
Effortless data labeling with AI support from Segment Anything
Recovering the Visual Space from Any Views
LISA: Reasoning Segmentation via Large Language Model
StarVector is a foundation model for SVG generation
Suite of reference architectures for building GPU-accelerated vision
Elyra extends JupyterLab with an AI centric approach
Unified Multimodal Understanding and Generation Models
Agent-ready RPA suite with visual workflow automation tools engine
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Python inference and LoRA trainer package for the LTX-2 audio–video
Machine learning image inpainting task that removes watermarks
Full-stack AI Red Teaming platform
Official implementation of Watermark Anything with Localized Messages
A neural network that transforms a design mock-up into static websites
SAPIEN Manipulation Skill Framework
AI tool that converts GitHub repositories into interactive diagrams
Driving with Graph Visual Question Answering
Autoregressive Model Beats Diffusion
This repository contains the official implementation of FastVLM
Refer and Ground Anything Anywhere at Any Granularity
Self-supervised visual learning using momentum contrast in PyTorch
Contexts Optical Compression