GPU environment management and cluster orchestration
Open source libraries and APIs to build custom preprocessing pipelines
Automated pull requests reviewing and issues triaging with ChatGPT
Qwen2.5-VL is the multimodal large language model series
A collection of reference Jupyter notebooks and demo AI/ML application
Phi-3.5 for Mac: Locally-run Vision and Language Models
VMAS is a vectorized differentiable simulator
Create Customized Software using Natural Language Idea
SGLang is a fast serving framework for large language models
A Unified Library for Parameter-Efficient Learning
AI Agent Application Development Framework
An unsupervised and free tool for image and video dataset analysis
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
Dough is a open source tool for steering AI animations with precision
Integrating LLMs into structured NLP pipelines
Revolutionizing Database Interactions with Private LLM Technology
Qlib is an AI-oriented quantitative investment platform
Sample code and notebooks for Generative AI on Google Cloud
Volcano Engine Reinforcement Learning for LLMs
DeepMind model for tracking arbitrary points across videos & robotics
Sharp Monocular Metric Depth in Less Than a Second
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
The repository provides code for running inference with SAM 2