Reference PyTorch implementation and models for DINOv3
Implementation of "MobileCLIP" CVPR 2024
CLIP, Predict the most relevant text snippet given an image
Bidirectional token-classification model for identifiable info
Repo of Qwen2-Audio chat & pretrained large audio language model
Audio foundation model excelling in audio understanding
PyTorch code and models for the DINOv2 self-supervised learning
Designed for text embedding and ranking tasks
Encoder of greater-than-word length text trained on a variety of data
Dataset of GPT-2 outputs for research in detection, biases, and more
This repository contains the official implementation of research
RoBERTa Chinese pre-training model: RoBERTa for Chinese
Code release for "Masked-attention Mask Transformer
PyTorch implementation of MAE
The official pytorch implementation of our paper
Learning embeddings for classification, retrieval and ranking
Elegant PyTorch implementation of paper Model-Agnostic Meta-Learning
A library for Multilingual Unsupervised or Supervised word Embeddings
CLIP model fine-tuned for zero-shot fashion product classification
Robust BERT-based model for English with improved MLM training
Flexible text-to-text transformer model for multilingual NLP tasks
T5-Small: Lightweight text-to-text transformer for NLP tasks
CLIP ViT-bigG/14: Zero-shot image-text model trained on LAION-2B
Multimodal Transformer for document image understanding and layout
Compact English sentence embedding model for semantic search tasks