DeepSeek Coder: Let the Code Write Itself
Python inference and LoRA trainer package for the LTX-2 audio–video
Hackable and optimized Transformers building blocks
High-Fidelity and Controllable Generation of Textured 3D Assets
Official DeiT repository
DeepSeek LLM: Let there be answers
Let us control diffusion models
Implementation of model parallel autoregressive transformers on GPUs
A collection of high-quality models for the MuJoCo physics engine
Dual LSTM Encoder for Dialog Response Generation
LL model providing reasoning and conversational capabilities
Frontier-scale 675B multimodal base model for custom AI training
Quantized 675B multimodal instruct model optimized for NVFP4
VaultGemma: 1B DP-trained Gemma variant for private NLP tasks
Frontier-scale 675B multimodal instruct MoE model for enterprise AIMis