Official code for Style Aligned Image Generation via Shared Attention
4M: Massively Multimodal Masked Modeling
Language modeling in a sentence representation space
A PyTorch library for implementing flow matching algorithms
Official DeiT repository
An AI-powered security review GitHub Action using Claude
Dataset of GPT-2 outputs for research in detection, biases, and more
Memory-efficient and performant finetuning of Mistral's models
Pushing the Limits of Mathematical Reasoning in Open Language Models
Diffusion Transformer with Fine-Grained Chinese Understanding
Implementation of the Surya Foundation Model for Heliophysics
Open-weight, large-scale hybrid-attention reasoning model
Large-language-model & vision-language-model based on Linear Attention
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
The ChatGPT Retrieval Plugin lets you easily find personal documents
FlashMLA: Efficient Multi-head Latent Attention Kernels
High-Resolution Image Synthesis with Latent Diffusion Models
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
A Conversational Speech Generation Model
Open-Source Financial Large Language Models!
Powerful open source image generation model
Open-source, high-performance Mixture-of-Experts large language model
Qwen2.5-Coder is the code version of Qwen2.5, the large language model