DeepScaleR

DeepScaleR

Agentica Project
+
+

Related Products

  • LeaseAccounting.app
    Visit Website
  • RunPod
    206 Ratings
    Visit Website
  • PackageX OCR Scanning
    46 Ratings
    Visit Website
  • Setplex
    10 Ratings
    Visit Website
  • Criminal IP ASM
    18 Ratings
    Visit Website
  • Adaptive Security
    88 Ratings
    Visit Website
  • Evertune
    1 Rating
    Visit Website
  • Eurekos
    78 Ratings
    Visit Website
  • CMW Platform
    685 Ratings
    Visit Website
  • Iru
    1,282 Ratings
    Visit Website

About

DeepScaleR is a 1.5-billion-parameter language model fine-tuned from DeepSeek-R1-Distilled-Qwen-1.5B using distributed reinforcement learning and a novel iterative context-lengthening strategy that gradually increases its context window from 8K to 24K tokens during training. It was trained on ~40,000 carefully curated mathematical problems drawn from competition-level datasets like AIME (1984–2023), AMC (pre-2023), Omni-MATH, and STILL. DeepScaleR achieves 43.1% accuracy on AIME 2024, a roughly 14.3 percentage point boost over the base model, and surpasses the performance of the proprietary O1-Preview model despite its much smaller size. It also posts strong results on a suite of math benchmarks (e.g., MATH-500, AMC 2023, Minerva Math, OlympiadBench), demonstrating that small, efficient models tuned with RL can match or exceed larger baselines on reasoning tasks.

About

Molmo 2 is a new suite of state-of-the-art open vision-language models with fully open weights, training data, and training code that extends the original Molmo family’s grounded image understanding to video and multi-image inputs, enabling advanced video understanding, pointing, tracking, dense captioning, and question-answering capabilities; all with strong spatial and temporal reasoning across frames. Molmo 2 includes three variants: an 8 billion-parameter model optimized for overall video grounding and QA, a 4 billion-parameter version designed for efficiency, and a 7 billion-parameter Olmo-backed model offering a fully open end-to-end architecture including the underlying language model. These models outperform earlier Molmo versions on core benchmarks and set new open-model high-water marks for image and video understanding tasks, often competing with substantially larger proprietary systems while training on a fraction of the data used by comparable closed models.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Researchers, students, and developers interested in an AI model capable of mathematical reasoning and logic tasks without requiring heavy hardware

Audience

Researchers, developers, and AI practitioners who need an open, state-of-the-art video and multi-image understanding model for grounded vision, tracking, and reasoning tasks

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

Free
Free Version
Free Trial

Pricing

No information available.
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Agentica Project
Founded: 2025
United States
agentica-project.com

Company Information

Ai2
Founded: 2014
United States
allenai.org/blog/molmo2

Alternatives

DeepCoder

DeepCoder

Agentica Project

Alternatives

Pixtral Large

Pixtral Large

Mistral AI
GLM-4.1V

GLM-4.1V

Zhipu AI
Phi-4-reasoning

Phi-4-reasoning

Microsoft
Devstral 2

Devstral 2

Mistral AI
Phi-2

Phi-2

Microsoft
Athene-V2

Athene-V2

Nexusflow

Categories

Categories

Integrations

Ai2 OLMoE
Bluesky
Hugging Face
Olmo 2
Threads

Integrations

Ai2 OLMoE
Bluesky
Hugging Face
Olmo 2
Threads
Claim DeepScaleR and update features and information
Claim DeepScaleR and update features and information
Claim Molmo 2 and update features and information
Claim Molmo 2 and update features and information