Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
Reinforcement Learning Frameworks
Reinforcement Learning Algorithms
Search Results

Search Results for "python q learning" - Page 2

x

Sort By:

Relevance

Clear All Filters

OS

Windows 62
Linux 61
Mac 61
More...
BSD 13
ChromeOS 13

Category

Artificial Intelligence 62
Software Development 13
Scientific/Engineering 5
Games 4
Education 2
Business 1
Database 1
Formats and Protocols 1
System 1

License

OSI-Approved Open Source 59
Creative Commons Attribution License 1

Translations

English 2

Programming Language

Python 54
C++ 4
Java 4
JavaScript 1

Status

Beta 3
Pre-Alpha 1
Alpha 1
Production/Stable 1

Showing 62 open source projects for "python q learning"

View related business solutions

Reinforcement Learning Algorithms Windows Clear Filters & Widen Search

One verification platform to secure the whole user journey
Handle every identity verification need in a single dashboard. Verify users, businesses or transactions, all while managing cases and deterring fraud.

Sumsub is a full-cycle verification platform that secures every step of the user journey. With Sumsub’s customizable KYC, KYB, AML, Transaction Monitoring and Fraud Prevention solutions, you can orchestrate your verification process, welcome more customers worldwide, meet compliance requirements, reduce costs and protect your business.

Learn More
Content Collaboration and File Sharing Software for Businesses
4,000+ companies trust Files.com to automate and secure business critical transfers.

Files.com provides unified control and reporting for all the file transfers in your business, no matter how they occur technically. Files.com acts as both a client and a server for SFTP, FTP, and AS2, meaning you can easily connect to any partner, customer, or system.

Learn More
1

H2O LLM Studio

Framework and no-code GUI for fine-tuning LLMs

Welcome to H2O LLM Studio, a framework and no-code GUI designed for fine-tuning state-of-the-art large language models (LLMs). You can also use H2O LLM Studio with the command line interface (CLI) and specify the configuration file that contains all the experiment parameters. To finetune using H2O LLM Studio with CLI, activate the pipenv environment by running make shell. With H2O LLM Studio, training your large language model is easy and intuitive. First, upload your dataset and then start...

Downloads: 0 This Week

Last Update: 2025-09-18
See Project
2

EasyRL

Reinforcement learning (RL) tutorial series

easy-rl is a beginner-friendly reinforcement learning (RL) tutorial series and framework developed by Datawhale China. It provides educational resources and implementations of various RL algorithms to help new researchers and practitioners learn RL concepts.

Downloads: 0 This Week

Last Update: 2025-03-13
See Project
3

Transformer Reinforcement Learning X

A repo for distributed training of language models with Reinforcement

trlX is a distributed training framework designed from the ground up to focus on fine-tuning large language models with reinforcement learning using either a provided reward function or a reward-labeled dataset. Training support for Hugging Face models is provided by Accelerate-backed trainers, allowing users to fine-tune causal and T5-based language models of up to 20B parameters, such as facebook/opt-6.7b, EleutherAI/gpt-neox-20b, and google/flan-t5-xxl. For models beyond 20B parameters,...

Downloads: 0 This Week

Last Update: 2024-08-03
See Project
4

T81 558

Applications of Deep Neural Networks

Deep learning is a group of exciting new technologies for neural networks. Through a combination of advanced training techniques and neural network architectural components, it is now possible to create neural networks that can handle tabular data, images, text, and audio as both input and output. Deep learning allows a neural network to learn hierarchies of information in a way that is like the function of the human brain. This course will introduce the student to classic neural network...

Downloads: 0 This Week

Last Update: 2023-03-27
See Project
Comprehensive Software Built for the Business of Senior Living
Senior living software will help you increase revenue, enhance care, simplify work, and create more meaningful resident outcomes.

Deliver a measurable ROI on workflow process improvements, revenue optimization, marketing and business development, and improved clinical outcomes. Seamlessly connect your entire community, empower your staff to work more efficiently, and generate a holistic view of your community’s financial performance to get the proactive insights you need to streamline your operations and optimize revenue. By centralizing all of your clinical and business operations onto one platform, you’ll give yourself the simplified solutions, economies of scale and enhanced security you need to become a world-class care provider. Gain better insight into your performance against your financial and occupancy goals, improve the safety and satisfaction of your residents and staff, and finally get ahead of the flood of issues you face every day.

Learn More
5

TradeMaster

TradeMaster is an open-source platform for quantitative trading

TradeMaster is a first-of-its-kind, best-in-class open-source platform for quantitative trading (QT) empowered by reinforcement learning (RL), which covers the full pipeline for the design, implementation, evaluation and deployment of RL-based algorithms. TradeMaster is composed of 6 key modules: 1) multi-modality market data of different financial assets at multiple granularities; 2) whole data preprocessing pipeline; 3) a series of high-fidelity data-driven market simulators for mainstream...

Downloads: 6 This Week

Last Update: 2023-12-18
See Project
6

ElegantRL

Massively Parallel Deep Reinforcement Learning

ElegantRL is an efficient and flexible deep reinforcement learning framework designed for researchers and practitioners. It focuses on simplicity, high performance, and supporting advanced RL algorithms.

Downloads: 0 This Week

Last Update: 2025-03-13
See Project
7

CleanRL

High-quality single file implementation of Deep Reinforcement Learning

CleanRL is a Deep Reinforcement Learning library that provides high-quality single-file implementation with research-friendly features. The implementation is clean and simple, yet we can scale it to run thousands of experiments using AWS Batch. CleanRL is not a modular library and therefore it is not meant to be imported. At the cost of duplicate code, we make all implementation details of a DRL algorithm variant easy to understand, so CleanRL comes with its own pros and cons. You should...

Downloads: 0 This Week

Last Update: 2022-11-14
See Project
8

Gym

Toolkit for developing and comparing reinforcement learning algorithms

Gym by OpenAI is a toolkit for developing and comparing reinforcement learning algorithms. It supports teaching agents, everything from walking to playing games like Pong or Pinball. Open source interface to reinforce learning tasks. The gym library provides an easy-to-use suite of reinforcement learning tasks. Gym provides the environment, you provide the algorithm. You can write your agent using your existing numerical computation library, such as TensorFlow or Theano. It makes no...

Downloads: 0 This Week

Last Update: 2025-03-06
See Project
9

Bullet Physics SDK

Real-time collision detection and multi-physics simulation for VR

...In addition, the simulator can be entirely run on CUDA for fast rollouts, in combination with Augmented Random Search. This allows for 1 million simulation steps per second. It is highly recommended to use PyBullet Python bindings for improved support for robotics, reinforcement learning and VR. Use pip install pybullet and checkout the PyBullet Quickstart Guide.

Downloads: 9 This Week

Last Update: 2022-09-25
See Project
Airlock Digital - Application Control (Allowlisting) Made Simple
Airlock Digital delivers an easy-to-manage and scalable application control solution to protect endpoints with confidence.

For organizations seeking the most effective way to prevent malware and ransomware in their environments. It has been designed to provide scalable, efficient endpoint security for organizations with even the most diverse architectures and rigorous compliance requirements. Built by practitioners for the world’s largest and most secure organizations, Airlock Digital delivers precision Application Control & Allowlisting for the modern enterprise.

Learn More
10

PySC2

StarCraft II learning environment

PySC2 is DeepMind's Python component of the StarCraft II Learning Environment (SC2LE). It exposes Blizzard Entertainment's StarCraft II Machine Learning API as a Python RL Environment. This is a collaboration between DeepMind and Blizzard to develop StarCraft II into a rich environment for RL research. PySC2 provides an interface for RL agents to interact with StarCraft 2, getting observations and sending actions.

Downloads: 1 This Week

Last Update: 2022-07-13
See Project
11

Machine Learning PyTorch Scikit-Learn

Code Repository for Machine Learning with PyTorch and Scikit-Learn

Initially, this project started as the 4th edition of Python Machine Learning. However, after putting so much passion and hard work into the changes and new topics, we thought it deserved a new title. So, what’s new? There are many contents and additions, including the switch from TensorFlow to PyTorch, new chapters on graph neural networks and transformers, a new section on gradient boosting, and many more that I will detail in a separate blog post.

Downloads: 4 This Week

Last Update: 2022-08-22
See Project
12

Trax

Deep learning with clear code and speed

...Trax includes basic models (like ResNet, LSTM, Transformer) and RL algorithms (like REINFORCE, A2C, PPO). It is also actively used for research and includes new models like the Reformer and new RL algorithms like AWR. Trax has bindings to a large number of deep learning datasets, including Tensor2Tensor and TensorFlow datasets. You can use Trax either as a library from your own python scripts and notebooks or as a binary from the shell, which can be more convenient for training large models. It runs without any changes on CPUs, GPUs and TPUs.

Downloads: 0 This Week

Last Update: 2021-10-26
See Project
13

ReinventCommunity

Jupyter Notebook tutorials for REINVENT 3.2

This repository is a collection of useful jupyter notebooks, code snippets and example JSON files illustrating the use of Reinvent 3.2.

Downloads: 0 This Week

Last Update: 2023-12-23
See Project
14

Hands-on Unsupervised Learning

Code for Hands-on Unsupervised Learning Using Python (O'Reilly Media)

This repo contains the code for the O'Reilly Media, Inc. book "Hands-on Unsupervised Learning Using Python: How to Build Applied Machine Learning Solutions from Unlabeled Data" by Ankur A. Patel. Many industry experts consider unsupervised learning the next frontier in artificial intelligence, one that may hold the key to the holy grail in AI research, the so-called general artificial intelligence. Since the majority of the world's data is unlabeled, conventional supervised learning cannot be applied; this is where unsupervised learning comes in. ...

Downloads: 1 This Week

Last Update: 2023-03-21
See Project
15

TensorLayer

Deep learning and reinforcement learning library for scientists

TensorLayer is a novel TensorFlow-based deep learning and reinforcement learning library designed for researchers and engineers. It provides an extensive collection of customizable neural layers to build advanced AI models quickly, based on this, the community open-sourced mass tutorials and applications. TensorLayer is awarded the 2017 Best Open Source Software by the ACM Multimedia Society. This project can also be found at OpenI and Gitee. 3.0.0 has been pre-released, the current version...

Downloads: 0 This Week

Last Update: 2022-01-17
See Project
16

Stable Baselines

A fork of OpenAI Baselines, implementations of reinforcement learning

Stable Baselines is a set of improved implementations of reinforcement learning algorithms based on OpenAI Baselines. You can read a detailed presentation of Stable Baselines in the Medium article. These algorithms will make it easier for the research community and industry to replicate, refine, and identify new ideas, and will create good baselines to build projects on top of. We expect these tools will be used as a base around which new ideas can be added, and as a tool for comparing a new...

Downloads: 0 This Week

Last Update: 2024-08-05
See Project
17

Tensor2Tensor

Library of deep learning models and datasets

Deep Learning (DL) has enabled the rapid advancement of many useful technologies, such as machine translation, speech recognition and object detection. In the research community, one can find code open-sourced by the authors to help in replicating their results and further advancing deep learning. However, most of these DL systems use unique setups that require significant engineering effort and may only work for a specific problem or architecture, making it hard to run new experiments and...

Downloads: 0 This Week

Last Update: 2021-05-24
See Project
18

RL Baselines Zoo

A collection of 100+ pre-trained RL agents using Stable Baselines

RL Baselines Zoo is a comprehensive training framework and collection of pre-trained RL agents using Stable-Baselines3. It offers tools for training, tuning, and evaluating RL algorithms across many standard environments, including MuJoCo, Atari, and robotics simulations. Designed for reproducible RL research and benchmarking, it includes scripts, hyperparameter presets, and best practices for training robust agents.

Downloads: 0 This Week

Last Update: 2025-03-13
See Project
19

ChainerRL

ChainerRL is a deep reinforcement learning library

ChainerRL (this repository) is a deep reinforcement learning library that implements various state-of-the-art deep reinforcement algorithms in Python using Chainer, a flexible deep learning framework. PFRL is the PyTorch analog of ChainerRL. ChainerRL has a set of accompanying visualization tools in order to aid developers' ability to understand and debug their RL agents. With this visualization tool, the behavior of ChainerRL agents can be easily inspected from a browser UI. ...

Downloads: 0 This Week

Last Update: 2022-08-22
See Project
20

Spinning Up in Deep RL

Educational resource to help anyone learn deep reinforcement learning

Welcome to Spinning Up in Deep RL! This is an educational resource produced by OpenAI that makes it easier to learn about deep reinforcement learning (deep RL). For the unfamiliar, reinforcement learning (RL) is a machine learning approach for teaching agents how to solve tasks by trial and error. Deep RL refers to the combination of RL with deep learning. At OpenAI, we believe that deep learning generally, and deep reinforcement learning specifically, will play central roles in the...

Downloads: 0 This Week

Last Update: 2021-09-02
See Project
21

Deep Learning Drizzle

Drench yourself in Deep Learning, Reinforcement Learning

Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures! Optimization courses which form the foundation for ML, DL, RL. Computer Vision courses which are DL & ML heavy. Speech recognition courses which are DL heavy. Structured Courses on Geometric, Graph Neural Networks. Section on Autonomous Vehicles. Section on Computer Graphics with ML/DL focus.

Downloads: 0 This Week

Last Update: 2022-07-29
See Project
22

RecNN

Reinforced Recommendation toolkit built around pytorch 1.7

This is my school project. It focuses on Reinforcement Learning for personalized news recommendation. The main distinction is that it tries to solve online off-policy learning with dynamically generated item embeddings. I want to create a library with SOTA algorithms for reinforcement learning recommendation, providing the level of abstraction you like.

Downloads: 1 This Week

Last Update: 2024-06-04
See Project
23

Texar

Toolkit for Machine Learning, Natural Language Processing

Texar is a toolkit aiming to support a broad set of machine learning, especially natural language processing and text generation tasks. Texar provides a library of easy-to-use ML modules and functionalities for composing whatever models and algorithms. The tool is designed for both researchers and practitioners for fast prototyping and experimentation. Texar was originally developed and is actively contributed by Petuum and CMU in collaboration with other institutes. A mirror of this...

Downloads: 0 This Week

Last Update: 2022-08-08
See Project
24

Coach

Enables easy experimentation with state of the art algorithms

...Coach collects statistics from the training process and supports advanced visualization techniques for debugging the agent being trained. Coach supports many state-of-the-art reinforcement learning algorithms, which are separated into three main classes - value optimization, policy optimization, and imitation learning. Coach supports a large number of environments which can be solved using reinforcement learning.

Downloads: 0 This Week

Last Update: 2022-08-09
See Project
25

Project Malmo

A platform for Artificial Intelligence experimentation on Minecraft

How can we develop artificial intelligence that learns to make sense of complex environments? That learns from others, including humans, how to interact with the world? That learns transferable skills throughout its existence, and applies them to solve new, challenging problems? Project Malmo sets out to address these core research challenges, addressing them by integrating (deep) reinforcement learning, cognitive science, and many ideas from artificial intelligence. The Malmo platform...

Downloads: 7 This Week

Last Update: 2023-03-23
See Project

Previous
1
You're on page 2
3
Next

Related Searches

algorithmic trading python

llm

lstm

gym software

physics simulation

machine learning

translator

credit card database

self-learning ai

arabic speech recognition

Related Categories

Artificial Intelligence

Software Development

Scientific/Engineering

Games

Education

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2025 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

×

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: