Port of Facebook's LLaMA model in C/C++
DeepSeek Coder: Let the Code Write Itself
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Implementation of "MobileCLIP" CVPR 2024
High-resolution models for human tasks
Official DeiT repository
Memory-efficient and performant finetuning of Mistral's models
Unified Multimodal Understanding and Generation Models
High-Resolution Image Synthesis with Latent Diffusion Models
Let us control diffusion models
800,000 step-level correctness labels on LLM solutions to MATH problem
Learning to Act by Watching Unlabeled Online Videos
Code release for "Masked-attention Mask Transformer
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)