tiktoken is a fast BPE tokeniser for use with OpenAI's models
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Video understanding codebase from FAIR for reproducing video models
PyTorch code and models for the DINOv2 self-supervised learning
An AI-powered security review GitHub Action using Claude
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Dataset of GPT-2 outputs for research in detection, biases, and more
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Multimodal Diffusion with Representation Alignment
Chat & pretrained large audio language model proposed by Alibaba Cloud
A series of math-specific large language models of our Qwen2 series
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Hackable and optimized Transformers building blocks
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Phi-3.5 for Mac: Locally-run Vision and Language Models
Official code for Style Aligned Image Generation via Shared Attention
The official PyTorch implementation of Google's Gemma models
Implementation of "MobileCLIP" CVPR 2024
4M: Massively Multimodal Masked Modeling
This repository contains the official implementation of FastVLM
FAIR Sequence Modeling Toolkit 2
ICLR2024 Spotlight: curation/training code, metadata, distribution
Language modeling in a sentence representation space
A PyTorch library for implementing flow matching algorithms