DeepSWE-Preview

DeepSWE-Preview is a 32.8B parameter open-source coding agent trained solely with reinforcement learning (RL) to perform complex software engineering (SWE) tasks. Built on top of Qwen3-32B, it achieves 59% accuracy on the SWE-Bench-Verified benchmark—currently the highest among open-weight models. The model navigates and edits large codebases using tools like a file editor, bash execution, and search, within the R2E-Gym environment. Its training emphasizes sparse reward signals, test-time scaling, and innovative policy gradient strategies adapted from GRPO, DAPO, Dr.GRPO, and RLOO. DeepSWE-Preview showcases strong reasoning, file navigation, and patch submission skills. It is ideal for agent-based code repair, debugging, and PR generation across real-world repositories. The model is served using platforms like vLLM and Hugging Face TGI, with support for 64k context length and OpenAI-compatible APIs.

Features

Trained entirely via reinforcement learning with no supervised fine-tuning
#1 performance on SWE-Bench-Verified (59%) among open-weight agents
Built on Qwen3-32B with thinking mode and 64k context support
Uses R2E-Gym tools like file editor, bash, and search for task completion
Employs enhanced GRPO-based RL algorithm for stable and efficient training
Includes hybrid test-time scaling for high pass@1 accuracy
Sparse reward model simulates realistic software engineering feedback
Openly licensed (MIT) for accessible and extensible AI development

Project Samples

Project Activity

See All Activity >

Follow DeepSWE-Preview

DeepSWE-Preview Web Site

Other Useful Business Software

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

Rate This Project

User Reviews

Be the first to post a review of DeepSWE-Preview!

Additional Project Details

Programming Language

Python

Related Categories

Python AI Models

Registered

2025-07-04

Similar Business Software

Qwen3-Coder

Qwen3‑Coder is an agentic code model available in multiple sizes, led by the 480B‑parameter Mixture‑of‑Experts variant (35B active) that natively supports 256K‑token contexts (extendable to 1M) and achieves state‑of‑the‑art results comparable to Claude Sonnet 4. Pre‑training on 7.5T tokens (70 %...

See Software
Devstral

Devstral is an open source, agentic large language model (LLM) developed by Mistral AI in collaboration with All Hands AI, specifically designed for software engineering tasks. It excels at navigating complex codebases, editing multiple files, and resolving real-world issues, outperforming all...

See Software
Qwen3

Qwen3, the latest iteration of the Qwen family of large language models, introduces groundbreaking features that enhance performance across coding, math, and general capabilities. With models like the Qwen3-235B-A22B and Qwen3-30B-A3B, Qwen3 achieves impressive results compared to top-tier...

See Software