DeepScaleR

DeepScaleR

Agentica Project
FLAN-T5

FLAN-T5

Google
+
+

Related Products

  • LeaseAccounting.app
    Visit Website
  • RunPod
    211 Ratings
    Visit Website
  • PackageX OCR Scanning
    48 Ratings
    Visit Website
  • Evertune
    1 Rating
    Visit Website
  • Setplex
    10 Ratings
    Visit Website
  • Criminal IP ASM
    18 Ratings
    Visit Website
  • Gemini Enterprise Agent Platform
    967 Ratings
    Visit Website
  • Adaptive Security
    91 Ratings
    Visit Website
  • Eurekos
    82 Ratings
    Visit Website
  • Cloverleaf
    189 Ratings
    Visit Website

About

DeepScaleR is a 1.5-billion-parameter language model fine-tuned from DeepSeek-R1-Distilled-Qwen-1.5B using distributed reinforcement learning and a novel iterative context-lengthening strategy that gradually increases its context window from 8K to 24K tokens during training. It was trained on ~40,000 carefully curated mathematical problems drawn from competition-level datasets like AIME (1984–2023), AMC (pre-2023), Omni-MATH, and STILL. DeepScaleR achieves 43.1% accuracy on AIME 2024, a roughly 14.3 percentage point boost over the base model, and surpasses the performance of the proprietary O1-Preview model despite its much smaller size. It also posts strong results on a suite of math benchmarks (e.g., MATH-500, AMC 2023, Minerva Math, OlympiadBench), demonstrating that small, efficient models tuned with RL can match or exceed larger baselines on reasoning tasks.

About

FLAN-T5 was released in the paper Scaling Instruction-Finetuned Language Models - it is an enhanced version of T5 that has been finetuned in a mixture of tasks.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Researchers, students, and developers interested in an AI model capable of mathematical reasoning and logic tasks without requiring heavy hardware

Audience

Developers interested in a powerful large language model

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

Free
Free Version
Free Trial

Pricing

Free
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Agentica Project
Founded: 2025
United States
agentica-project.com

Company Information

Google
Founded: 1998
United States
huggingface.co/docs/transformers/model_doc/flan-t5

Alternatives

DeepCoder

DeepCoder

Agentica Project

Alternatives

T5

T5

Google
Mistral 7B

Mistral 7B

Mistral AI
Phi-4-reasoning

Phi-4-reasoning

Microsoft
Llama 2

Llama 2

Meta
Alpaca

Alpaca

Stanford Center for Research on Foundation Models (CRFM)
Athene-V2

Athene-V2

Nexusflow
Kimi K2

Kimi K2

Moonshot AI

Categories

Categories

Integrations

Forefront
Medical LLM

Integrations

Forefront
Medical LLM
Claim DeepScaleR and update features and information
Claim DeepScaleR and update features and information
Claim FLAN-T5 and update features and information
Claim FLAN-T5 and update features and information