An experimental version of DeepSeek model
Inference framework for 1-bit LLMs
Towards self-verifiable mathematical reasoning
Hackable and optimized Transformers building blocks
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Open-source, high-performance Mixture-of-Experts large language model
Jan-v1-edge: efficient 1.7B reasoning model optimized for edge devices