CRAB: Cross-environment Agent Benchmark for Multimodal Language Model
A tool for semi-automatic cell type classification, harmonization
Data Lake for Deep Learning. Build, manage, and query datasets
Node.js native addon build tool
Control Any Computer Using LLMs
OpenCL integration for Python, plus shiny features
CineCLI is a cross-platform command-line movie browser
The Memory layer for AI Agents
git-cola: The highly caffeinated Git GUI
The open-source C/C++ package manager
Music player and music library manager for Linux, Windows, and macOS
Your Fully-Automated Personal AI Assistant
Generate blog articles from video or audio
Build GUI for your Python program with JavaScript, HTML, and CSS
NVIDIA Federated Learning Application Runtime Environment
A Pragmatic VLA Foundation Model
Multimodal embedding and reranking models built on Qwen3-VL
Taming Stable Diffusion for Lip Sync
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
High-resolution models for human tasks
Next generation AWS IoT Client SDK for Python
GRR Rapid Response, remote live forensics for incident response
A lightweight text-to-speech model with zero-shot voice cloning
TorchMultimodal is a PyTorch library
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model