The library to build & auto-optimize LLM applications
OSINT, Discord, web & network utilities
Expressive Portrait Image Animation for Live Streaming
An open phone agent model & framework
Label Studio is a multi-type data labeling and annotation tool
ComfyUI reference implementation for IPAdapter models
An on-premises, OCR-free unstructured data extraction
Parallel computing with task scheduling
PaddlePaddle End-to-End Development Toolkit
3D Engine with Blender Integration
Cross-platform API testing client for humans
Create beautiful slides on the web using Claude's frontend skills
AI tool that converts GitHub repositories into interactive diagrams
Extension of Google Research’s PaperBanana
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
[CVPR 2026 Oral] VGGT Omega
Benchmarking Multimodal Agents for Open-Ended Tasks
PDF to Markdown with vision models
Static Analyzer for Solidity
AI framework to autonomously improve the performance of any AI system
Inference script for Oasis 500M
Agent S: an open agentic framework that uses computers like a human
Powerful framework for controlling Android and iOS devices
Programs to process GoPro MP4 & Generic GPX/FIT files
Gemma open-weight LLM library, from Google DeepMind