DeepSeek R1
Open-source, high-performance AI model with advanced reasoning
... integrates large-scale reinforcement learning (RL) without relying on supervised fine-tuning, enabling the model to develop advanced reasoning capabilities. This approach has resulted in performance comparable to leading models like OpenAI's o1, while maintaining cost-efficiency. To further support the research community, DeepSeek has released distilled versions of the model based on architectures such as LLaMA and Qwen.
This is a full repo snapshot ZIP file of the DeepSeek R1 code.