MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation
Lets make video diffusion practical
FlashMLA: Efficient Multi-head Latent Attention Kernels
Open-source large language model family from Tencent Hunyuan
Block Diffusion for Ultra-Fast Speculative Decoding
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Accurate × Fast × Comprehensive
FAIR Sequence Modeling Toolkit 2
Hackable and optimized Transformers building blocks
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Large-language-model & vision-language-model based on Linear Attention
Official DeiT repository
This repository contains the official implementation of research
Reference implementation of the Transformer architecture optimized
Generate embeddings from large-scale graph-structured data
Model that fuses instruct, reasoning and agentic skills
LL model providing reasoning and conversational capabilities
Open language model developed by NVIDIA as part of Nemotron-3 family
High-performance MoE model with MLA, MTP, and multilingual reasoning