FAIR Sequence Modeling Toolkit 2
A PyTorch library for implementing flow matching algorithms
Hackable and optimized Transformers building blocks
Foundational Models for State-of-the-Art Speech and Text Translation
Memory-efficient and performant finetuning of Mistral's models
Analyze computation-communication overlap in V3/R1
Official implementation of DreamCraft3D
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Research code artifacts for Code World Model (CWM)
Controllable & emotion-expressive zero-shot TTS
DeepMind model for tracking arbitrary points across videos & robotics
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Language modeling in a sentence representation space
Advancing Formal Mathematical Reasoning via Reinforcement Learning
Clean and efficient FP8 GEMM kernels with fine-grained scaling
FlashMLA: Efficient Multi-head Latent Attention Kernels
Repo of Qwen2-Audio chat & pretrained large audio language model
MiniMax-M2, a model built for Max coding & agentic workflows
Open-weight, large-scale hybrid-attention reasoning model
Large-language-model & vision-language-model based on Linear Attention
Capable of understanding text, audio, vision, video
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1