Image generation model with single-stream diffusion transformer
DeepSeek LLM: Let there be answers
Python inference and LoRA trainer package for the LTX-2 audio–video
ICLR2024 Spotlight: curation/training code, metadata, distribution
Official DeiT repository
Official inference repo for FLUX.1 models
Towards self-verifiable mathematical reasoning
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
Research code artifacts for Code World Model (CWM)
A PyTorch library for implementing flow matching algorithms
Memory-efficient and performant finetuning of Mistral's models
The ChatGPT Retrieval Plugin lets you easily find personal documents
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Code release for ConvNeXt V2 model
Flexible text-to-text transformer model for multilingual NLP tasks
Robust BERT-based model for English with improved MLM training
JetBrains’ 4B parameter code model for completions
Multimodal Transformer for document image understanding and layout
Large language model developed and released by NVIDIA
Efficient English embedding model for semantic search and retrieval
Small 3B-base multimodal model ideal for custom AI on edge hardware
Versatile 8B-base multimodal LLM, flexible foundation for custom AI
Powerful 14B-base multimodal model — flexible base for fine-tuning
Frontier-scale 675B multimodal base model for custom AI training
Compact hybrid reasoning language model for intelligent responses