Usable Implementation of "Bootstrap Your Own Latent" self-supervised
An open source implementation of CLIP
Determined, deep learning training platform
MMEditing is a low-level vision toolbox based on PyTorch
Modular quant framework
Library to help with training and evaluating neural networks
Qwen2.5-VL is the multimodal large language model series
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
LLM-based agent for general purpose software engineering tasks
Making Enterprise Data Intelligent and Responsive for AI
Advanced techniques for RAG systems
Fast and Universal 3D reconstruction model for versatile tasks
Implementation of Vision Transformer, a simple way to achieve SOTA
Official code for Style Aligned Image Generation via Shared Attention
A secure sandbox environment for malware developers and red teamers
A Model Context Protocol server for searching and analyzing arXiv
4M: Massively Multimodal Masked Modeling
Guiding Instruction-based Image Editing via Multimodal Large Language
This repository contains the official implementation of FastVLM
Refer and Ground Anything Anywhere at Any Granularity
A Model Context Protocol (MCP) Gateway & Registry
The official Meta Llama 3 GitHub site
Utilities intended for use with Llama models
Open-source platform for building enterprise-grade agents
FAIR Sequence Modeling Toolkit 2