Motion-controllable Video Generation via Latent Trajectory Guidance
Open-source platform for building enterprise-grade agents
An open phone agent model & framework
Claude code for everything except coding
ComfyUI wrapper nodes for HunyuanVideo
From Addition, Subtraction, Multiplication, and Division to ML
Flock is a workflow-based low-code platform for building chatbots
From Paper to Presentation in One Click
Zero-code platform for building AI agents from natural language input
PyTorch3D is FAIR's library of reusable components for deep learning
InvokeAI is a leading creative engine for Stable Diffusion models
Multilingual Document Layout Parsing in a Single Vision-Language Model
An on-premises, OCR-free unstructured data extraction
Repository containing notebooks of my posts on Medium
Marrying Grounding DINO with Segment Anything & Stable Diffusion
Multimodal embedding and reranking models built on Qwen3-VL
"Big Model" trains a visual multimodal VLM with 26M parameters
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
PaddlePaddle End-to-End Development Toolkit
Unifying 3D Mesh Generation with Language Models
Open multimodal web agent built by Ai2
Learning agent trained in a diffusion world model
No-code LLM Platform to launch APIs and ETL Pipelines
Fast, powerful, git-native ticket tracking in a single bash script
Inference script for Oasis 500M