Hackable and optimized Transformers building blocks
Memory-efficient and performant finetuning of Mistral's models
DeepMind model for tracking arbitrary points across videos & robotics
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Renderer for the harmony response format to be used with gpt-oss
Chat & pretrained large audio language model proposed by Alibaba Cloud
A trainable PyTorch reproduction of AlphaFold 3
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
High-Resolution Image Synthesis with Latent Diffusion Models
Official DeiT repository
Powerful open source image generation model
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Open-source, high-performance Mixture-of-Experts large language model
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
AI Suite for upscaling, interpolating & restoring images/videos
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
Open Multilingual Multimodal Chat LMs
Example Discord bot written in Python that uses the completions API
Dataset of GPT-2 outputs for research in detection, biases, and more
Official code for Style Aligned Image Generation via Shared Attention
Let us control diffusion models
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Official repo for consistency models
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)