An open phone agent model & framework
Pixel-Aligned 3D Generation from Images
[CVPR 2026 Oral] VGGT Omega
Claude code for everything except coding
Your CrewAI Powered Video Editing Assistant
ComfyUI wrapper nodes for HunyuanVideo
Create HTML profiling reports from pandas DataFrame objects
From Paper to Presentation in One Click
Zero-code platform for building AI agents from natural language input
PyTorch3D is FAIR's library of reusable components for deep learning
The open-source C/C++ package manager
InvokeAI is a leading creative engine for Stable Diffusion models
Multilingual Document Layout Parsing in a Single Vision-Language Model
An on-premises, OCR-free unstructured data extraction
Harmonized and Coherent Human Image Animation
Marrying Grounding DINO with Segment Anything & Stable Diffusion
Let agents classify your bank transactions
Multimodal embedding and reranking models built on Qwen3-VL
"Big Model" trains a visual multimodal VLM with 26M parameters
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
PaddlePaddle End-to-End Development Toolkit
Open source feature flagging and remote config service
A theme for Sublime Text 3 by Mattia Astorino
Cross-platform API testing client for humans
Unifying 3D Mesh Generation with Language Models