Reference PyTorch implementation and models for DINOv3
Implementation of "MobileCLIP" CVPR 2024
CLIP, Predict the most relevant text snippet given an image
Bidirectional token-classification model for identifiable info
Repo of Qwen2-Audio chat & pretrained large audio language model
Audio foundation model excelling in audio understanding
PyTorch code and models for the DINOv2 self-supervised learning
Designed for text embedding and ranking tasks
Dataset of GPT-2 outputs for research in detection, biases, and more
This repository contains the official implementation of research
RoBERTa Chinese pre-training model: RoBERTa for Chinese
Code release for "Masked-attention Mask Transformer
PyTorch implementation of MAE
Per-Pixel Classification is Not All You Need for Semantic Segmentation
The official pytorch implementation of our paper
Reproduces results of "Fixing the train-test resolution discrepancy"
Elegant PyTorch implementation of paper Model-Agnostic Meta-Learning
A library for Multilingual Unsupervised or Supervised word Embeddings