ICLR2024 Spotlight: curation/training code, metadata, distribution
PyTorch code and models for the DINOv2 self-supervised learning
Reference PyTorch implementation and models for DINOv3
4M: Massively Multimodal Masked Modeling
Language modeling in a sentence representation space
Qwen-Image is a powerful image generation foundation model
VMZ: Model Zoo for Video Modeling
Chat & pretrained large audio language model proposed by Alibaba Cloud
A mix of GAN implementations including progressive growing
A library for Multilingual Unsupervised or Supervised word Embeddings