Models for object and human mesh reconstruction
An AI-powered security review GitHub Action using Claude
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
The official PyTorch implementation of Google's Gemma models
FlashMLA: Efficient Multi-head Latent Attention Kernels
Towards Real-World Vision-Language Understanding
The ChatGPT Retrieval Plugin lets you easily find personal documents
800,000 step-level correctness labels on LLM solutions to MATH problem