MemU is an open-source memory framework for AI companions
SwarmZero's SDK for building AI agents, swarms of agents and much more
A framework for open autonomous economic agent (AEA) development
Clean and efficient FP8 GEMM kernels with fine-grained scaling
A simple, secure MCP-to-OpenAPI proxy server
The most powerful Android RPA agent framework
Implementation of "MobileCLIP" CVPR 2024
A fast, powerful, and simple hierarchical vision transformer
Code release for Cut and Learn for Unsupervised Object Detection
High-resolution models for human tasks
Video understanding codebase from FAIR for reproducing video models
CLIP, Predict the most relevant text snippet given an image
Research code artifacts for Code World Model (CWM)
A Unified Framework for Text-to-3D and Image-to-3D Generation
Multimodal Diffusion with Representation Alignment
Deep learning optimization library: makes distributed training easy
Official code for Style Aligned Image Generation via Shared Attention
A Model Context Protocol server for searching and analyzing arXiv
4M: Massively Multimodal Masked Modeling
Guiding Instruction-based Image Editing via Multimodal Large Language
This repository contains the official implementation of FastVLM
Refer and Ground Anything Anywhere at Any Granularity
The official Meta Llama 3 GitHub site
Utilities intended for use with Llama models
ICLR2024 Spotlight: curation/training code, metadata, distribution