CLIP, Predict the most relevant text snippet given an image
Implementation of "MobileCLIP" CVPR 2024
Bidirectional token-classification model for identifiable info
Repo of Qwen2-Audio chat & pretrained large audio language model
Audio foundation model excelling in audio understanding
Designed for text embedding and ranking tasks
Dataset of GPT-2 outputs for research in detection, biases, and more
RoBERTa Chinese pre-training model: RoBERTa for Chinese