Programs to process GoPro MP4 & Generic GPX/FIT files
RAG-Anything: All-in-One RAG Framework
Motion-controllable Video Generation via Latent Trajectory Guidance
Open-source platform for building enterprise-grade agents
An AI-powered data science team of agents
An open phone agent model & framework
A distributed and extensible workflow scheduler platform
[CVPR 2026 Oral] VGGT Omega
Claude code for everything except coding
From Addition, Subtraction, Multiplication, and Division to ML
Create HTML profiling reports from pandas DataFrame objects
Flock is a workflow-based low-code platform for building chatbots
From Paper to Presentation in One Click
Zero-code platform for building AI agents from natural language input
PyTorch3D is FAIR's library of reusable components for deep learning
InvokeAI is a leading creative engine for Stable Diffusion models
The open-source C/C++ package manager
Multilingual Document Layout Parsing in a Single Vision-Language Model
An on-premises, OCR-free unstructured data extraction
Repository containing notebooks of my posts on Medium
Harmonized and Coherent Human Image Animation
Marrying Grounding DINO with Segment Anything & Stable Diffusion
Let agents classify your bank transactions
Multimodal embedding and reranking models built on Qwen3-VL