ICLR2024 Spotlight: curation/training code, metadata, distribution
PyTorch code and models for the DINOv2 self-supervised learning
Reference PyTorch implementation and models for DINOv3
Language modeling in a sentence representation space
4M: Massively Multimodal Masked Modeling
Qwen-Image is a powerful image generation foundation model
VMZ: Model Zoo for Video Modeling
Chat & pretrained large audio language model proposed by Alibaba Cloud
Software that can generate photos from paintings
A mix of GAN implementations including progressive growing
A library for Multilingual Unsupervised or Supervised word Embeddings
Flexible text-to-text transformer model for multilingual NLP tasks
An advanced bilingual image editing with semantic control
T5-Small: Lightweight text-to-text transformer for NLP tasks
Jan-v1-edge: efficient 1.7B reasoning model optimized for edge devices