DeepScaleR

DeepScaleR

Agentica Project
+
+

Related Products

  • ScriptSure
    30 Ratings
    Visit Website
  • RunPod
    205 Ratings
    Visit Website
  • PackageX OCR Scanning
    46 Ratings
    Visit Website
  • Setplex
    10 Ratings
    Visit Website
  • Ango Hub
    15 Ratings
    Visit Website
  • Adaptive Security
    83 Ratings
    Visit Website
  • Evertune
    1 Rating
    Visit Website
  • CMW Platform
    681 Ratings
    Visit Website
  • Dynamo Software
    68 Ratings
    Visit Website
  • Nexo
    16,425 Ratings
    Visit Website

About

DeepScaleR is a 1.5-billion-parameter language model fine-tuned from DeepSeek-R1-Distilled-Qwen-1.5B using distributed reinforcement learning and a novel iterative context-lengthening strategy that gradually increases its context window from 8K to 24K tokens during training. It was trained on ~40,000 carefully curated mathematical problems drawn from competition-level datasets like AIME (1984–2023), AMC (pre-2023), Omni-MATH, and STILL. DeepScaleR achieves 43.1% accuracy on AIME 2024, a roughly 14.3 percentage point boost over the base model, and surpasses the performance of the proprietary O1-Preview model despite its much smaller size. It also posts strong results on a suite of math benchmarks (e.g., MATH-500, AMC 2023, Minerva Math, OlympiadBench), demonstrating that small, efficient models tuned with RL can match or exceed larger baselines on reasoning tasks.

About

Qwen3-Max-Thinking is Alibaba’s latest flagship reasoning-enhanced large language model, built as an extension of the Qwen3-Max family and designed to deliver state-of-the-art analytical performance and multi-step reasoning capabilities. It scales up from one of the largest parameter bases in the Qwen ecosystem and incorporates advanced reinforcement learning and adaptive tool integration so the model can leverage search, memory, and code interpreter functions dynamically during inference to address difficult multi-stage tasks with higher accuracy and contextual depth compared with standard generative responses. Qwen3-Max-Thinking introduces a unique Thinking Mode that exposes deliberate, step-by-step reasoning before final outputs, enabling transparency and traceability of logical chains, and can be tuned with configurable “thinking budgets” to balance performance quality with computational cost.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Researchers, students, and developers interested in an AI model capable of mathematical reasoning and logic tasks without requiring heavy hardware

Audience

Developers, researchers, and enterprise teams needing an advanced AI model for deep reasoning, complex problem-solving, and context-rich decisioning in applications like agents, analytics, and research tools

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

Free
Free Version
Free Trial

Pricing

No information available.
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Agentica Project
Founded: 2025
United States
agentica-project.com

Company Information

Alibaba
Founded: 1999
China
qwen.ai/blog

Alternatives

DeepCoder

DeepCoder

Agentica Project

Alternatives

Kimi K2.5

Kimi K2.5

Moonshot AI
Qwen3-Max

Qwen3-Max

Alibaba
Qwen2

Qwen2

Alibaba
Athene-V2

Athene-V2

Nexusflow
Qwen3

Qwen3

Alibaba
Phi-4-reasoning

Phi-4-reasoning

Microsoft
QwQ-32B

QwQ-32B

Alibaba

Categories

Categories

Integrations

No info available.

Integrations

No info available.
Claim DeepScaleR and update features and information
Claim DeepScaleR and update features and information
Claim Qwen3-Max-Thinking and update features and information
Claim Qwen3-Max-Thinking and update features and information