q learning algorithm free download

deep-q-learning

Minimal Deep Q Learning (DQN & DDQN) implementations in Keras

The deep-q-learning repository authored by keon provides a Python-based implementation of the Deep Q-Learning algorithm — a cornerstone method in reinforcement learning. It implements the core logic needed to train an agent using Q-learning with neural networks (i.e. approximating Q-values via deep nets), setting up environment interaction loops, experience replay, network updates, and policy behavior.

Downloads: 0 This Week

Last Update: 2026-05-18

See Project

RL with PyTorch

Clean, Robust, and Unified PyTorch implementation

RL with PyTorch is a research-oriented repository that provides implementations of deep reinforcement learning algorithms using the PyTorch framework. The project focuses on helping developers and researchers understand reinforcement learning methods by providing clean and reproducible implementations of well-known algorithms. It includes code for popular deep reinforcement learning techniques such as Deep Q-Networks, policy gradient methods, actor-critic architectures, and other modern RL approaches. ...

Downloads: 0 This Week

Last Update: 2026-03-11

See Project

Interpretable machine learning

Book about interpretable machine learning

This book is about interpretable machine learning. Machine learning is being built into many products and processes of our daily lives, yet decisions made by machines don't automatically come with an explanation. An explanation increases the trust in the decision and in the machine learning model. As the programmer of an algorithm you want to know whether you can trust the learned model.

Downloads: 7 This Week

Last Update: 2025-03-13

See Project

Homemade Machine Learning

Python examples of popular machine learning algorithms

homemade-machine-learning is a repository by Oleksii Trekhleb containing Python implementations of classic machine-learning algorithms done “from scratch”, meaning you don’t rely heavily on high-level libraries but instead write the logic yourself to deepen understanding. Each algorithm is accompanied by mathematical explanations, visualizations (often via Jupyter notebooks), and interactive demos so you can tweak parameters, data, and observe outcomes in real time. ...

Downloads: 0 This Week

Last Update: 2025-11-23

See Project

X's Recommendation Algorithm

Source code for the X Recommendation Algorithm

...While certain components (such as safety layers, spam detection, or private data) are excluded, the release provides valuable insights into the design of real-world machine learning–driven ranking systems. The project is intended as a reference for researchers, developers, and the public to study, experiment with, and better understand the mechanisms behind social media content.

Downloads: 2 This Week

Last Update: 1 day ago

See Project

how-to-optim-algorithm-in-cuda

How to optimize some algorithm in cuda

how-to-optim-algorithm-in-cuda is an open educational repository focused on teaching developers how to optimize algorithms for high-performance execution on GPUs using CUDA. The project combines technical notes, code examples, and practical experiments that demonstrate how common computational kernels can be optimized to improve speed and memory efficiency.

Downloads: 2 This Week

Last Update: 3 days ago

See Project

ML-NLP

This project is a common knowledge point and code implementation

ML-NLP is a large open-source repository that collects theoretical knowledge, practical explanations, and code examples related to machine learning, deep learning, and natural language processing. The project is designed primarily as a learning resource for algorithm engineers and students preparing for technical interviews in machine learning or NLP roles. It compiles important concepts that frequently appear in machine learning discussions, including neural network architectures, training methods, and common algorithmic techniques. ...

Downloads: 0 This Week

Last Update: 2026-03-10

See Project

machine-learning-refined

Master the fundamentals of machine learning, deep learning

machine-learning-refined is an educational repository designed to help students and practitioners understand machine learning algorithms through intuitive explanations and interactive examples. The project accompanies a series of textbooks and teaching materials that focus on making machine learning concepts accessible through visual demonstrations and simple code implementations. Instead of presenting algorithms purely through mathematical derivations, the repository emphasizes geometric...

Downloads: 0 This Week

Last Update: 2026-03-12

See Project

Machine learning algorithms

Minimal and clean examples of machine learning algorithms

Machine learning algorithms is an open-source repository that provides minimal and clean implementations of machine learning algorithms written primarily in Python. The project focuses on demonstrating how fundamental machine learning methods work internally by implementing them from scratch rather than relying on high-level libraries. This approach allows learners to study the mathematical and algorithmic details behind widely used models in a transparent and readable way. The repository...

Downloads: 0 This Week

Last Update: 2026-05-07

See Project

Adapters

A Unified Library for Parameter-Efficient Learning

Adapters is an add-on library to HuggingFace's Transformers, integrating 10+ adapter methods into 20+ state-of-the-art Transformer models with minimal coding overhead for training and inference. Adapters provide a unified interface for efficient fine-tuning and modular transfer learning, supporting a myriad of features like full-precision or quantized training (e.g. Q-LoRA, Q-Bottleneck Adapters, or Q-PrefixTuning), adapter merging via task arithmetics or the composition of multiple adapters via composition blocks, allowing advanced research in parameter-efficient transfer learning for NLP tasks.

Downloads: 0 This Week

Last Update: 2026-04-26

See Project

All RL Algorithms from Scratch

Implementation of all RL algorithms in a simpler way

...Implemented topics include Q-learning, SARSA, Expected SARSA, Dyna-Q, REINFORCE, PPO, A2C, A3C, DDPG, SAC, TRPO, DQN, MADDPG, QMIX, HAC, MCTS, and PlaNet. The code prioritizes clarity, experimentation, and mathematical intuition over production speed. A companion cheat sheet gives learners a quick reference for formulas, pseudocode, and key concepts.

Downloads: 0 This Week

Last Update: 2026-07-06

See Project

PyGAD

Source code of PyGAD, Python 3 library for building genetic algorithms

PyGAD is an open-source easy-to-use Python 3 library for building the genetic algorithm and optimizing machine learning algorithms. It supports Keras and PyTorch. PyGAD supports optimizing both single-objective and multi-objective problems. PyGAD supports different types of crossover, mutation, and parent selection. PyGAD allows different types of problems to be optimized using the genetic algorithm by customizing the fitness function.

Downloads: 0 This Week

Last Update: 2026-06-05

See Project

DreamerV3

Mastering Diverse Domains through World Models

DreamerV3 is an open-source implementation of a reinforcement learning algorithm that uses world models to train intelligent agents capable of learning complex behaviors across many environments. The system works by building an internal model of the environment and then using that model to simulate possible future outcomes of actions, allowing the agent to learn from imagined experiences rather than only from real interactions.

Downloads: 0 This Week

Last Update: 2026-05-25

See Project

SHAP

A game theoretic approach to explain the output of ml models

SHAP (SHapley Additive exPlanations) is a game theoretic approach to explain the output of any machine learning model. It connects optimal credit allocation with local explanations using the classic Shapley values from game theory and their related extensions. While SHAP can explain the output of any machine learning model, we have developed a high-speed exact algorithm for tree ensemble methods. Fast C++ implementations are supported for XGBoost, LightGBM, CatBoost, scikit-learn and pyspark tree models. ...

Downloads: 1 This Week

Last Update: 2026-05-28

See Project

LightZero

[NeurIPS 2023 Spotlight] LightZero

LightZero is an efficient, scalable, and open-source framework implementing MuZero, a powerful model-based reinforcement learning algorithm that learns to predict rewards and transitions without explicit environment models. Developed by OpenDILab, LightZero focuses on providing a highly optimized and user-friendly platform for both academic research and industrial applications of MuZero and similar algorithms.

Downloads: 3 This Week

Last Update: 2025-04-09

See Project

D4RL

Collection of reference environments, offline reinforcement learning

D4RL (Datasets for Deep Data-Driven Reinforcement Learning) is a benchmark suite focused on offline reinforcement learning — i.e., learning policies from fixed datasets rather than via online interaction with the environment. It contains standardized environments, tasks and datasets (observations, actions, rewards, terminals) aimed at enabling reproducible research in offline RL. Researchers can load a dataset for a given task (e.g., maze navigation, manipulation) and apply their algorithm without the need to collect fresh transitions, which accelerates experimentation and comparison. ...

Downloads: 0 This Week

Last Update: 2025-11-25

See Project

zvt

Modular quant framework

For practical trading, a complex algorithm is fragile, a complex algorithm building on a complex facility is more fragile, complex algorithm building on a complex facility by a complex team is more and more fragile. zvt wants to provide a simple facility for building a straightforward algorithm. Technologies come and technologies go, but market insight is forever. Your world is built by core concepts inside you, so it’s you. zvt world is built by core concepts inside the market, so it’s zvt....

Downloads: 0 This Week

Last Update: 2026-01-18

See Project

DeepTutor

AI-Powered Personalized Learning Assistant

DeepTutor is an AI-powered tutoring and learning assistant framework designed to automatically teach, explain, and reinforce academic or technical concepts in depth according to a learner’s specific needs. It goes beyond simple Q&A by constructing multi-stage educational narratives, breaking down complex topics into sequenced “lesson steps,” and offering prompts, examples, and exercises that build on each other in a logical curriculum.

Downloads: 4 This Week

Last Update: 2 days ago

See Project

R1-V

Witness the aha moment of VLM with less than $3

R1-V is an initiative aimed at enhancing the generalization capabilities of Vision-Language Models (VLMs) through Reinforcement Learning in Visual Reasoning (RLVR). The project focuses on building a comprehensive framework that emphasizes algorithm enhancement, efficiency optimization, and task diversity to achieve general vision-language intelligence and visual/GUI agents. The team's long-term goal is to contribute impactful open-source research in this domain.

Downloads: 0 This Week

Last Update: 2025-03-19

See Project

openTSNE

Extensible, parallel implementations of t-SNE

openTSNE is a modular Python implementation of t-Distributed Stochasitc Neighbor Embedding (t-SNE) [1], a popular dimensionality-reduction algorithm for visualizing high-dimensional data sets. openTSNE incorporates the latest improvements to the t-SNE algorithm, including the ability to add new data points to existing embeddings [2], massive speed improvements [3] [4] [5], enabling t-SNE to scale to millions of data points, and various tricks to improve the global alignment of the resulting...

Downloads: 0 This Week

Last Update: 2024-08-19

See Project

Appfl

Advanced Privacy-Preserving Federated Learning framework

APPFL (Advanced Privacy-Preserving Federated Learning) is a Python framework enabling researchers to easily build and benchmark privacy-aware federated learning solutions. It supports flexible algorithm development, differential privacy, secure communications, and runs efficiently on HPC and multi-GPU setups.

Downloads: 0 This Week

Last Update: 2026-03-09

See Project

EduCDM

The Model Zoo of cognitive diagnosis models

The Model Zoo of Cognitive Diagnosis Models, including classic Item Response Ranking (IRT), Multidimensional Item Response Ranking (MIRT), Deterministic Input, Noisy "And" model(DINA), and advanced Fuzzy Cognitive Diagnosis Framework (FuzzyCDF), Neural Cognitive Diagnosis Model (NCDM), Item Response Ranking framework (IRR), Incremental Cognitive Diagnosis (ICD) and Knowledge-association baesd extension of NeuralCD (KaNCD). Cognitive diagnosis model (CDM) for intelligent educational systems is a type of model that infers students' knowledge states from their learning behaviors (especially exercise response logs). Typically, the input of a CDM could be the students' response logs of items (i.e., exercises/questions), the Q-matrix that denotes the correlation between items and knowledge concepts (skills). The output is the diagnosed student knowledge states, such as students' abilities and students' proficiencies on each knowledge concepts.

Downloads: 2 This Week

Last Update: 2024-10-25

See Project

AIDE ML

AI-Driven Exploration in the Space of Code

AIDE ML is an open-source research framework designed to explore automated machine learning development through agent-based search and code optimization. The project implements the AIDE algorithm, which uses a tree-search strategy guided by large language models to iteratively generate, evaluate, and refine code. Instead of relying on manual experimentation, the agent autonomously drafts machine learning pipelines, debugs errors, and benchmarks performance against user-defined evaluation metrics. ...

Downloads: 0 This Week

Last Update: 2026-03-09

See Project

DeepPavlov

A library for deep learning end-to-end dialog systems and chatbots

...It has comprehensive and flexible tools that let developers and NLP researchers create production-ready conversational skills and complex multi-skill conversational assistants. Use BERT and other state-of-the-art deep learning models to solve classification, NER, Q&A and other NLP tasks. DeepPavlov Agent allows building industrial solutions with multi-skill integration via API services.

Downloads: 1 This Week

Last Update: 2024-08-12

See Project

MLJAR Studio

Python package for AutoML on Tabular Data with Feature Engineering

We are working on new way for visual programming. We developed a desktop application called MLJAR Studio. It is a notebook-based development environment with interactive code recipes and a managed Python environment. All running locally on your machine. We are waiting for your feedback. The mljar-supervised is an Automated Machine Learning Python package that works with tabular data. It is designed to save time for a data scientist. It abstracts the common way to preprocess the data,...

Downloads: 0 This Week

Last Update: 2026-06-11

See Project

Search Results for "q learning algorithm"

Showing 111 open source projects for "q learning algorithm"

deep-q-learning

RL with PyTorch

Interpretable machine learning

Homemade Machine Learning

X's Recommendation Algorithm

how-to-optim-algorithm-in-cuda

ML-NLP

machine-learning-refined

Machine learning algorithms

Adapters

All RL Algorithms from Scratch

PyGAD

DreamerV3

SHAP

LightZero

D4RL

zvt

DeepTutor

R1-V

openTSNE

Appfl

EduCDM

AIDE ML

DeepPavlov

MLJAR Studio

Search Results for "q learning algorithm"

Showing 111 open source projects for "q learning algorithm"

deep-q-learning

RL with PyTorch

Interpretable machine learning

Homemade Machine Learning

X's Recommendation Algorithm

how-to-optim-algorithm-in-cuda

ML-NLP

machine-learning-refined

Machine learning algorithms

Adapters

All RL Algorithms from Scratch

PyGAD

DreamerV3

SHAP

LightZero

D4RL

zvt

DeepTutor

R1-V

openTSNE

Appfl

EduCDM

AIDE ML

DeepPavlov

MLJAR Studio

Related Searches

Related Categories