CodeContests

CodeContests, developed by Google DeepMind, is a large-scale competitive programming dataset designed for training and evaluating machine learning models on code generation and problem solving. This dataset played a central role in the development of AlphaCode, DeepMind’s model for solving programming problems at a human-competitive level, as published in Science. CodeContests aggregates problems and human-written solutions from multiple programming competition platforms, including AtCoder, Codeforces, CodeChef, Aizu, and HackerEarth. Each problem includes structured metadata, problem descriptions, paired input/output test cases, and multiple correct and incorrect solutions in various programming languages. The dataset is distributed in Riegeli format using Protocol Buffers, with separate training, validation, and test splits for reproducible machine learning experiments.

Features

Comprehensive dataset of competitive programming problems and solutions
Sourced from multiple online judges such as Codeforces, AtCoder, and CodeChef
Includes both correct and incorrect human solutions with test cases
Provided in Riegeli format with Protocol Buffer definitions
Tools for evaluating and executing code submissions in sandboxed environments
Used in training AlphaCode for program synthesis and competition-level reasoning

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow CodeContests

CodeContests Web Site

Other Useful Business Software

MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free

Rate This Project

User Reviews

Be the first to post a review of CodeContests!

Additional Project Details

Operating Systems

Linux

Programming Language

C++, Python

Related Categories

Python Machine Learning Software, C++ Machine Learning Software

Registered

2025-10-09

Similar Business Software

Vertex AI

Build, deploy, and scale machine learning (ML) models faster, with fully managed ML tools for any use case. Through Vertex AI Workbench, Vertex AI is natively integrated with BigQuery, Dataproc, and Spark. You can use BigQuery ML to create and execute machine learning models in BigQuery...

See Software
Aquarium

Aquarium's embedding technology surfaces the biggest problems in your model performance and finds the right data to solve them. Unlock the power of neural network embeddings without worrying about maintaining infrastructure or debugging embedding models. Automatically find the most critical...

See Software
Elham.ai

Elham.ai is an automated machine-learning platform that lets users build and deploy AI models with zero coding required. It offers a no-code interface where you can upload your datasets, select problem types (e.g., classification, regression, etc.), and let Elham handle data preprocessing,...

See Software
JADBio AutoML

JADBio is a state-of-the-art automated Machine Learning Platform without the need for coding. With its breakthrough algorithms it can solve open problems in machine learning. Anybody can use it and perform a sophisticated and correct machine learning analysis even if they do not know any math,...

See Software
Neural Designer

Neural Designer is a powerful software tool for developing and deploying machine learning models. It provides a user-friendly interface that allows users to build, train, and evaluate neural networks without requiring extensive programming knowledge. With a wide range of features and...

See Software
Arize AI

Automatically discover issues, diagnose problems, and improve models with Arize’s machine learning observability platform. Machine learning systems address mission critical needs for businesses and their customers every day, yet often fail to perform in the real world. Arize is an end-to-end...

See Software

Report inappropriate content

CodeContests

Large dataset of coding contests designed for AI and ML model training

Get an email when there's a new version of CodeContests

Features

Project Samples

Project Activity

Categories

License

Follow CodeContests

User Reviews

Additional Project Details

Operating Systems

Programming Language

Related Categories

Registered