Multimodal embedding and reranking models built on Qwen3-VL
"Big Model" trains a visual multimodal VLM with 26M parameters
A Python toolbox for gaining geometric insights
Modular quant framework
A theme for Sublime Text 3 by Mattia Astorino
Learning agent trained in a diffusion world model
General-purpose image editing model that delivers high-fidelity
Fast, powerful, git-native ticket tracking in a single bash script
An AI-powered data science team of agents
A Python library for extracting structured information
TorchMultimodal is a PyTorch library
ICLR2024 Spotlight: curation/training code, metadata, distribution
PyTorch3D is FAIR's library of reusable components for deep learning
[CVPR 2025 Best Paper Award] VGGT
The book "Performance Analysis and Tuning on Modern CPU"
3D plotting and mesh analysis through a streamlined interface
Unifying 3D Mesh Generation with Language Models
Gracefully face hCaptcha challenge with multimodal llms
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
GitLab automatic code review tool based on large models
Python module that helps you build complex pipelines of batch jobs
OCR expert VLM powered by Hunyuan's native multimodal architecture
A Pioneering Open-Source Alternative to GPT-4o
a parametric 3D CAD modeler