Learning agent trained in a diffusion world model
Fast, powerful, git-native ticket tracking in a single bash script
Inference script for Oasis 500M
TorchMultimodal is a PyTorch library
ICLR2024 Spotlight: curation/training code, metadata, distribution
PyTorch3D is FAIR's library of reusable components for deep learning
[CVPR 2025 Best Paper Award] VGGT
A command-line utility for taking automated screenshots of websites
Gracefully face hCaptcha challenge with multimodal llms
From Paper to Presentation in One Click
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Flexible Photo Recrafting While Preserving Your Identity
Towards Real-World Vision-Language Understanding
OCR expert VLM powered by Hunyuan's native multimodal architecture
Large-language-model & vision-language-model based on Linear Attention
Chat & pretrained large vision language model
airda(Air Data Agent
Installable / Portable Python Distribution for Everyone.
A software construction tool
Free offline SEM software with HTMT, bootstrapping & exports
Simulate chemical processes using advanced thermodynamic models
Virtual AI anchor that combines state-of-the-art technology
*VoxShare* is a simple Python-based push-to-talk multicast voice chat
Transform your DualShock 4 into a native Xbox 360 controller. v2.1.0
Desktop piano playable with a PC keyboard, mouse, or MIDI device.