CodeContests, developed by Google DeepMind, is a large-scale competitive programming dataset designed for training and evaluating machine learning models on code generation and problem solving. This dataset played a central role in the development of AlphaCode, DeepMind’s model for solving programming problems at a human-competitive level, as published in Science. CodeContests aggregates problems and human-written solutions from multiple programming competition platforms, including AtCoder, Codeforces, CodeChef, Aizu, and HackerEarth. Each problem includes structured metadata, problem descriptions, paired input/output test cases, and multiple correct and incorrect solutions in various programming languages. The dataset is distributed in Riegeli format using Protocol Buffers, with separate training, validation, and test splits for reproducible machine learning experiments.

Features

  • Comprehensive dataset of competitive programming problems and solutions
  • Sourced from multiple online judges such as Codeforces, AtCoder, and CodeChef
  • Includes both correct and incorrect human solutions with test cases
  • Provided in Riegeli format with Protocol Buffer definitions
  • Tools for evaluating and executing code submissions in sandboxed environments
  • Used in training AlphaCode for program synthesis and competition-level reasoning

Project Samples

Project Activity

See All Activity >

Categories

Machine Learning

License

Apache License V2.0

Follow CodeContests

CodeContests Web Site

Other Useful Business Software
$300 Free Credits for Your Google Cloud Projects Icon
$300 Free Credits for Your Google Cloud Projects

Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
Start Free Trial
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of CodeContests!

Additional Project Details

Operating Systems

Linux

Programming Language

C++, Python

Related Categories

Python Machine Learning Software, C++ Machine Learning Software

Registered

2025-10-09