reinforcement learning free download

Showing 27 open source projects for "reinforcement learning"

View related business solutions

Software Development Python Clear Filters & Widen Search

Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
1

Best-of Machine Learning with Python

A ranked list of awesome machine learning Python libraries

This curated list contains 900 awesome open-source projects with a total of 3.3M stars grouped into 34 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from GitHub and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Contributions are very welcome! General-purpose machine learning and deep learning...

Downloads: 1 This Week

Last Update: 2025-10-30
See Project
2

Ray

A unified framework for scalable computing

...Accelerate your hyperparameter search workloads with Ray Tune. Find the best model and reduce training costs by using the latest optimization algorithms. Deploy your machine learning models at scale with Ray Serve, a Python-first and framework agnostic model serving framework. Scale reinforcement learning (RL) with RLlib, a framework-agnostic RL library that ships with 30+ cutting-edge RL algorithms including A3C, DQN, and PPO. Easily build out scalable, distributed systems in Python with simple and composable primitives in Ray Core.

Downloads: 0 This Week

Last Update: 2026-03-20
See Project
3

Jittor

Jittor is a high-performance deep learning framework

Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators. The whole framework and meta-operators are compiled just in time. A powerful op compiler and tuner are integrated into Jittor. It allowed us to generate high-performance code specialized for your model. Jittor also contains a wealth of high-performance model libraries, including image recognition, detection, segmentation, generation, differentiable rendering, geometric learning, reinforcement learning, etc. ...

Downloads: 0 This Week

Last Update: 2025-07-28
See Project
4

RLax

Library of JAX-based building blocks for reinforcement learning agents

RLax (pronounced “relax”) is a JAX-based library developed by Google DeepMind that provides reusable mathematical building blocks for constructing reinforcement learning (RL) agents. Rather than implementing full algorithms, RLax focuses on the core functional operations that underpin RL methods—such as computing value functions, returns, policy gradients, and loss terms—allowing researchers to flexibly assemble their own agents. It supports both on-policy and off-policy learning, as well as value-based, policy-based, and model-based approaches. ...

Downloads: 0 This Week

Last Update: 2025-10-09
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
5

Recursive Language Models

General plug-and-play inference library for Recursive Language Models

RLM (short for Reinforcement Learning Models) is a modular framework that makes it easier to build, train, evaluate, and deploy reinforcement learning (RL) agents across a wide range of environments and tasks. It provides a consistent API that abstracts away many of the repetitive engineering patterns in RL research and application work, letting developers focus on modeling, experimentation, and fine-tuning rather than infrastructure plumbing.

Downloads: 0 This Week

Last Update: 2026-02-18
See Project
6

MuJoCo Playground

An open source library for GPU-accelerated robot learning

...MuJoCo Playground supports both the MJX JAX implementation and the Warp physics engine, enabling flexible use across research pipelines. The environments are designed for fast training, compatibility with reinforcement learning libraries, and real-time trajectory visualization using rscope.

Downloads: 3 This Week

Last Update: 2026-03-17
See Project
7

NVIDIA Warp

A Python framework for accelerated simulation, data generation

...It enables developers to write kernel-level code in Python that is automatically compiled into efficient CUDA kernels, combining ease of use with near-native performance. The framework is designed for applications such as robotics, reinforcement learning, physical simulation, and differentiable computing, where performance and flexibility are critical. Warp provides a set of primitives for working with arrays, geometry, and physics operations, allowing users to implement complex simulations without writing low-level CUDA code directly. It also supports differentiable programming, enabling gradients to be computed through simulation pipelines, which is particularly valuable for machine learning integration.

Downloads: 1 This Week

Last Update: 2026-03-18
See Project
8

Flax

Flax is a neural network library for JAX

...Flax emphasizes composability: optimizers, training loops, and checkpointing are provided as examples or utilities rather than monolithic frameworks, encouraging research-friendly customization. The library is widely used in vision, language, and reinforcement learning, often serving as a thin layer atop NumPy-like JAX primitives. Tutorials and examples show patterns for multi-host training, mixed precision, and advanced input pipelines that scale from laptops to TPUs.

Downloads: 2 This Week

Last Update: 2026-03-20
See Project
9

Tunix

A JAX-native LLM Post-Training Library

Tunix is a JAX-native library for post-training large language models, bringing supervised fine-tuning, reinforcement learning–based alignment, and knowledge distillation into one coherent toolkit. It embraces JAX’s strengths—functional programming, jit compilation, and effortless multi-device execution—so experiments scale from a single GPU to pods of TPUs with minimal code changes. The library is organized around modular pipelines for data loading, rollout, optimization, and evaluation, letting practitioners swap components without rewriting the whole stack. ...

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
AI-generated apps that pass security review
Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.

Try Retool free
10

Evolutionary Algorithm

Evolutionary Algorithm using Python

...Users can explore basic genetic algorithm setups, match phrase examples, pathfinding challenges, and microbial GA variants, as well as evolution strategy approaches like NES. The project also links classical evolutionary approaches with neural networks, illustrating how evolution can be used for model training in reinforcement learning and supervised contexts.

Downloads: 0 This Week

Last Update: 2026-02-12
See Project
11

learn2learn

A PyTorch Library for Meta-learning Research

Learn2Learn is a PyTorch-based library focused on meta-learning and few-shot learning research. It provides reusable components and meta-learning algorithms, making it easier to build, train, and evaluate models that can quickly adapt to new tasks with minimal data. Learn2Learn is widely used in research for tasks such as few-shot classification, reinforcement learning, and optimization.

Downloads: 0 This Week

Last Update: 2025-03-13
See Project
12

DeepMind Research

Implementations and code to accompany DeepMind publications

This repository collects reference implementations and illustrative code accompanying a wide range of DeepMind publications, making it easier for the research community to reproduce results, inspect algorithms, and build on prior work. The top level organizes many paper-specific directories across domains such as deep reinforcement learning, self-supervised vision, generative modeling, scientific ML, and program synthesis—for example BYOL, Perceiver/Perceiver IO, Enformer for genomics, MeshGraphNets for physics, RL Unplugged, Nowcasting for weather, and more. Each project folder typically includes its own README, scripts, and notebooks so you can run experiments or explore models in isolation, and many link to associated datasets or external environments like DeepMind Lab and StarCraft II. ...

Downloads: 2 This Week

Last Update: 2025-10-07
See Project
13

Reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI

Reinforcement-learning is a widely used educational repository that provides implementations, exercises, and solutions for a broad range of reinforcement learning algorithms, designed to complement foundational texts and courses in the field. The project collects popular approaches such as dynamic programming, Monte Carlo methods, temporal difference learning, Q-learning, SARSA, deep Q-networks, and policy gradient techniques, often demonstrated with Python and OpenAI Gym environments so users can experiment with agents learning in simulated tasks. ...

Downloads: 0 This Week

Last Update: 2026-02-12
See Project
14

Gym

Toolkit for developing and comparing reinforcement learning algorithms

Gym by OpenAI is a toolkit for developing and comparing reinforcement learning algorithms. It supports teaching agents, everything from walking to playing games like Pong or Pinball. Open source interface to reinforce learning tasks. The gym library provides an easy-to-use suite of reinforcement learning tasks. Gym provides the environment, you provide the algorithm. You can write your agent using your existing numerical computation library, such as TensorFlow or Theano. ...

Downloads: 2 This Week

Last Update: 2025-03-06
See Project
15

AlphaTensor

AI discovers faster, efficient algorithms for matrix multiplication

AlphaTensor, developed by Google DeepMind, is the research codebase accompanying the 2022 Nature publication “Discovering faster matrix multiplication algorithms with reinforcement learning.” The project demonstrates how reinforcement learning can be used to automatically discover efficient algorithms for matrix multiplication — a fundamental operation in computer science and numerical computation. The repository is organized into four main components: algorithms, benchmarking, nonequivalence, and recombination. ...

Downloads: 0 This Week

Last Update: 6 days ago
See Project
16

pyTorch Tutorials

Build your neural network easy and fast

pyTorch Tutorials is an open-source collection of hands-on tutorials designed to teach developers how to build neural networks with the PyTorch framework. It covers the fundamentals of PyTorch from basic tensor operations to constructing full neural network models, making it suitable for beginners and intermediate learners alike. The project is structured around clear, executable Python scripts and Jupyter notebooks that demonstrate regression, classification, convolutional networks,...

Downloads: 0 This Week

Last Update: 2026-02-12
See Project
17

ReinventCommunity

Jupyter Notebook tutorials for REINVENT 3.2

This repository is a collection of useful jupyter notebooks, code snippets and example JSON files illustrating the use of Reinvent 3.2.

Downloads: 0 This Week

Last Update: 2023-12-23
See Project
18

TRFL

TensorFlow Reinforcement Learning

TRFL, developed by Google DeepMind, is a TensorFlow-based library that provides a collection of essential building blocks for reinforcement learning (RL) algorithms. Pronounced “truffle,” it simplifies the implementation of RL agents by offering reusable components such as loss functions, value estimation tools, and temporal difference (TD) learning operators. The library is designed to integrate seamlessly with TensorFlow, allowing users to define differentiable RL objectives and train models using standard optimization routines. ...

Downloads: 0 This Week

Last Update: 7 days ago
See Project
19

Behaviour Suite Reinforcement Learning

bsuite is a collection of carefully-designed experiments

bsuite is a research framework developed by Google DeepMind that provides a comprehensive collection of experiments for evaluating the core capabilities of reinforcement learning (RL) agents. Its main goal is to identify, measure, and analyze fundamental aspects of learning efficiency and generalization in RL algorithms. The library enables researchers to benchmark their agents on standardized tasks, facilitating reproducible and transparent comparisons across different approaches. Each experiment in bsuite is meticulously designed to capture key challenges in RL, such as exploration, credit assignment, and stability. ...

Downloads: 0 This Week

Last Update: 2025-10-10
See Project
20

Tensor2Tensor

Library of deep learning models and datasets

Deep Learning (DL) has enabled the rapid advancement of many useful technologies, such as machine translation, speech recognition and object detection. In the research community, one can find code open-sourced by the authors to help in replicating their results and further advancing deep learning. However, most of these DL systems use unique setups that require significant engineering effort and may only work for a specific problem or architecture, making it hard to run new experiments and...

Downloads: 0 This Week

Last Update: 2021-05-24
See Project
21

Top Deep Learning Projects

A list of popular github projects related to deep learning

TopDeepLearning is a curated index of the most popular GitHub projects related to deep learning, ranked by their star count. Rather than being a library itself, it serves as a curated roadmap and reference guide for anyone exploring the deep learning ecosystem — from beginners to experienced practitioners. By aggregating high-star projects across frameworks (TensorFlow, PyTorch), tools (computer vision, NLP, reinforcement learning), tutorials, and research code, it helps users quickly discover reputable and well-maintained repositories. ...

Downloads: 0 This Week

Last Update: 2025-12-04
See Project
22

ChainerRL

ChainerRL is a deep reinforcement learning library

ChainerRL (this repository) is a deep reinforcement learning library that implements various state-of-the-art deep reinforcement algorithms in Python using Chainer, a flexible deep learning framework. PFRL is the PyTorch analog of ChainerRL. ChainerRL has a set of accompanying visualization tools in order to aid developers' ability to understand and debug their RL agents. With this visualization tool, the behavior of ChainerRL agents can be easily inspected from a browser UI. ...

Downloads: 0 This Week

Last Update: 2022-08-22
See Project
23

MADDPG

Code for the MADDPG algorithm from a paper

...Researchers can use it to reproduce the experiments presented in the paper, which demonstrate how agents learn behaviors such as coordination, competition, and communication. Although archived, MADDPG remains a widely cited baseline in multi-agent reinforcement learning research and has inspired further algorithmic developments.

Downloads: 2 This Week

Last Update: 7 hours ago
See Project
24

Baselines

High-quality implementations of reinforcement learning algorithms

...If you meant a different “baselines” (e.g. OpenAI Baselines for reinforcement learning), I can look up that specific one.

Downloads: 0 This Week

Last Update: 2025-10-02
See Project
25

RecNN

Reinforced Recommendation toolkit built around pytorch 1.7

This is my school project. It focuses on Reinforcement Learning for personalized news recommendation. The main distinction is that it tries to solve online off-policy learning with dynamically generated item embeddings. I want to create a library with SOTA algorithms for reinforcement learning recommendation, providing the level of abstraction you like.

Downloads: 0 This Week

Last Update: 2024-06-04
See Project