Diversity-driven optimization and large-model reasoning ability
A theoretical reconstruction of the Claude Mythos architecture
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
800,000 step-level correctness labels on LLM solutions to MATH problem
Reasoning-powered OCR VLM for converting complex documents to Markdown
Jan-v1-edge: efficient 1.7B reasoning model optimized for edge devices