DeepSeek Coder: Let the Code Write Itself
Implementation of "MobileCLIP" CVPR 2024
Unified Multimodal Understanding and Generation Models
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
FlashMLA: Efficient Multi-head Latent Attention Kernels
Dataset of GPT-2 outputs for research in detection, biases, and more
CLIP model fine-tuned for zero-shot fashion product classification