DeepScaleR

DeepScaleR

Agentica Project
Olmo 3

Olmo 3

Ai2
+
+

Related Products

  • ScriptSure
    30 Ratings
    Visit Website
  • RunPod
    205 Ratings
    Visit Website
  • PackageX OCR Scanning
    46 Ratings
    Visit Website
  • Setplex
    10 Ratings
    Visit Website
  • Ango Hub
    15 Ratings
    Visit Website
  • Adaptive Security
    83 Ratings
    Visit Website
  • Syncro
    538 Ratings
    Visit Website
  • CMW Platform
    681 Ratings
    Visit Website
  • Dynamo Software
    68 Ratings
    Visit Website
  • Nexo
    16,425 Ratings
    Visit Website

About

DeepScaleR is a 1.5-billion-parameter language model fine-tuned from DeepSeek-R1-Distilled-Qwen-1.5B using distributed reinforcement learning and a novel iterative context-lengthening strategy that gradually increases its context window from 8K to 24K tokens during training. It was trained on ~40,000 carefully curated mathematical problems drawn from competition-level datasets like AIME (1984–2023), AMC (pre-2023), Omni-MATH, and STILL. DeepScaleR achieves 43.1% accuracy on AIME 2024, a roughly 14.3 percentage point boost over the base model, and surpasses the performance of the proprietary O1-Preview model despite its much smaller size. It also posts strong results on a suite of math benchmarks (e.g., MATH-500, AMC 2023, Minerva Math, OlympiadBench), demonstrating that small, efficient models tuned with RL can match or exceed larger baselines on reasoning tasks.

About

Olmo 3 is a fully open model family spanning 7 billion and 32 billion parameter variants that delivers not only high-performing base, reasoning, instruction, and reinforcement-learning models, but also exposure of the entire model flow, including raw training data, intermediate checkpoints, training code, long-context support (65,536 token window), and provenance tooling. Starting with the Dolma 3 dataset (≈9 trillion tokens) and its disciplined mix of web text, scientific PDFs, code, and long-form documents, the pre-training, mid-training, and long-context phases shape the base models, which are then post-trained via supervised fine-tuning, direct preference optimisation, and RL with verifiable rewards to yield the Think and Instruct variants. The 32 B Think model is described as the strongest fully open reasoning model to date, competitively close to closed-weight peers in math, code, and complex reasoning.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Researchers, students, and developers interested in an AI model capable of mathematical reasoning and logic tasks without requiring heavy hardware

Audience

AI researchers, developers and enterprises needing a tool offering foundation models to inspect, fine-tune or deploy with full provenance and auditability

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

Free
Free Version
Free Trial

Pricing

Free
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Agentica Project
Founded: 2025
United States
agentica-project.com

Company Information

Ai2
Founded: 2014
United States
allenai.org/blog/olmo3

Alternatives

DeepCoder

DeepCoder

Agentica Project

Alternatives

Qwen3-Max

Qwen3-Max

Alibaba
MiniMax M1

MiniMax M1

MiniMax
Phi-4-reasoning

Phi-4-reasoning

Microsoft
Athene-V2

Athene-V2

Nexusflow
DeepSeek-V3.2

DeepSeek-V3.2

DeepSeek
CodeGemma

CodeGemma

Google

Categories

Categories

Integrations

No info available.

Integrations

No info available.
Claim DeepScaleR and update features and information
Claim DeepScaleR and update features and information
Claim Olmo 3 and update features and information
Claim Olmo 3 and update features and information