Page 5 | python q learning free download

DI-engine

OpenDILab Decision AI Engine

DI-engine is a unified reinforcement learning (RL) platform for reproducible and scalable RL research. It offers modular pipelines for various RL algorithms, with an emphasis on production-level training and evaluation.

Downloads: 0 This Week

Last Update: 2025-03-13

See Project

deepjazz

Deep learning driven jazz generation using Keras & Theano

deepjazz is a deep learning project that generates jazz music using recurrent neural networks trained on MIDI files. The repository demonstrates how machine learning can learn musical structure and produce original compositions. It uses the Keras and Theano libraries to build a two-layer Long Short-Term Memory network capable of learning temporal patterns in music. The system analyzes musical sequences from an input MIDI file and then generates new musical notes that follow similar stylistic...

Downloads: 0 This Week

Last Update: 2026-03-19

See Project

SLM Lab

Modular Deep Reinforcement Learning framework in PyTorch

SLM Lab is a modular and extensible deep reinforcement learning framework designed for research and practical applications. It provides implementations of various state-of-the-art RL algorithms and emphasizes reproducibility, scalability, and detailed experiment tracking. SLM Lab is structured around a flexible experiment management system, allowing users to define, run, and analyze RL experiments efficiently.

Downloads: 0 This Week

Last Update: 2026-03-04

See Project

MLC LLM

Universal LLM Deployment Engine with ML Compilation

MLC LLM is a machine learning compiler and deployment framework designed to enable efficient execution of large language models across a wide range of hardware platforms. The project focuses on compiling models into optimized runtimes that can run natively on devices such as GPUs, mobile processors, browsers, and edge hardware. By leveraging machine learning compilation techniques, mlc-llm produces high-performance inference engines that maintain consistent APIs across platforms. The system...

Downloads: 18 This Week

Last Update: 2026-03-09

See Project

verl

Volcano Engine Reinforcement Learning for LLMs

VERL is a reinforcement-learning–oriented toolkit designed to train and align modern AI systems, from language models to decision-making agents. It brings together supervised fine-tuning, preference modeling, and online RL into one coherent training stack so teams can move from raw data to aligned policies with minimal glue code. The library focuses on scalability and efficiency, offering distributed training loops, mixed precision, and replay/buffering utilities that keep accelerators busy....

Downloads: 0 This Week

Last Update: 2026-03-16

See Project

MEDIUM_NoteBook

Repository containing notebooks of my posts on Medium

...Because the notebooks are designed as educational materials, they often emphasize readability and reproducibility so that readers can easily run and modify the examples. The project is useful for learners who want to explore machine learning concepts interactively using Python and common data science libraries.

Downloads: 0 This Week

Last Update: 2026-03-12

See Project

VoxelMorph

Unsupervised Learning for Image Registration

VoxelMorph is an open-source deep learning framework designed for medical image registration, a process that aligns multiple medical scans into a common spatial coordinate system. Traditional image registration techniques typically rely on optimization procedures that must be executed separately for each pair of images, which can be computationally expensive and slow. VoxelMorph approaches the problem using neural networks that learn to predict deformation fields that transform one image so...

Downloads: 0 This Week

Last Update: 2026-03-15

See Project

Data-Science-Interview-Questions-Answers

Curated list of data science interview questions and answers

...The repository focuses on core data science fundamentals rather than acting as a software framework, which makes it especially useful as a study and revision resource. Its content is organized into subject-specific documents that cover machine learning, deep learning, statistics, probability, Python, SQL and databases, and resume-based interview questions. That structure makes it practical for users who want to study by topic, strengthen weak areas, or simulate the range of questions they may encounter in interviews.

Downloads: 0 This Week

Last Update: 2026-03-10

See Project

Data Science Interviews

Data science interview questions and answers

Data Science Interviews is an open-source repository that collects common data science interview questions along with community-provided answers and explanations. The project serves as a preparation resource for students, job seekers, and professionals who want to review the technical knowledge required for data science roles. The repository organizes questions into different categories including theoretical machine learning concepts, technical programming questions, and probability or...

Downloads: 0 This Week

Last Update: 2026-03-10

See Project

TTRL

Test-Time Reinforcement Learning

TTRL is an open-source framework for test-time reinforcement learning in large language models, with a particular focus on reasoning tasks where ground-truth labels are not available during inference. The project addresses the problem of how to generate useful reward signals from unlabeled test-time data, and its central insight is that common test-time scaling practices such as majority voting can be repurposed into reward estimates for online reinforcement learning. This makes the...

Downloads: 0 This Week

Last Update: 2026-03-10

See Project

D4RL

Collection of reference environments, offline reinforcement learning

D4RL (Datasets for Deep Data-Driven Reinforcement Learning) is a benchmark suite focused on offline reinforcement learning — i.e., learning policies from fixed datasets rather than via online interaction with the environment. It contains standardized environments, tasks and datasets (observations, actions, rewards, terminals) aimed at enabling reproducible research in offline RL. Researchers can load a dataset for a given task (e.g., maze navigation, manipulation) and apply their algorithm...

Downloads: 0 This Week

Last Update: 2025-11-25

See Project

SCAIL

Towards Studio-Grade Character Animation via In-Context Learning of 3D

SCAIL is a project developed by the ZAI Organization, focusing on AI-driven research initiatives. While specific documentation about SCAIL’s exact goals and implementation is limited from the repository context alone, the project appears to be part of a collection of machine learning and AI research tools that facilitate scalable model development, evaluation, or application workflows. Given its listing alongside other ZAI projects like speech recognition and text-to-speech systems, SCAIL...

Downloads: 0 This Week

Last Update: 2026-05-06

See Project

AIDE ML

AI-Driven Exploration in the Space of Code

...The system repeatedly improves its generated code by exploring different implementation paths and selecting the best-performing solutions. AIDE ML is packaged as a Python toolkit with built-in utilities such as command-line tools, configuration presets, and visualization interfaces that allow researchers to observe how the search process evolves. The framework is designed for experimentation and academic research into automated programming and machine learning optimization.

Downloads: 0 This Week

Last Update: 2026-03-09

See Project

Perfect Roadmap To Learn Data Science

Basic To Intermediate Python data science guide

Perfect Roadmap To Learn Data Science In 2025 is an extended, updated learning pathway curated for the modern data-science landscape — blending classical data-analysis, statistics, machine learning, deep learning, computer vision, NLP, as well as current deployment and MLOps practices to prepare learners for data-science careers in 2025. The roadmap is organized to guide learners systematically: starting with Python fundamentals and math/statistics, then progressing through classical machine-learning, deep-learning, data preprocessing, feature engineering, and onto domain-specific applications like computer vision or NLP, ending with deployment, real-world project construction, and best practices for production readiness. ...

Downloads: 0 This Week

Last Update: 2025-12-02

See Project

PySINDy

A package for the sparse identification of nonlinear dynamical systems

PySINDy is a Python library that implements the Sparse Identification of Nonlinear Dynamics (SINDy) method for discovering mathematical models of dynamical systems from data. The framework focuses on identifying governing equations that describe the behavior of complex physical systems by selecting sparse combinations of candidate functions. Instead of fitting a purely predictive machine learning model, PySINDy attempts to recover interpretable differential equations that explain how a system evolves over time. ...

Downloads: 0 This Week

Last Update: 2026-03-12

See Project

skfolio

Python library for portfolio optimization built on top of scikit-learn

skfolio is a Python library designed for portfolio optimization and financial risk management that integrates closely with the scikit-learn ecosystem. The project provides a unified machine learning-style framework for building, validating, and comparing portfolio allocation strategies using financial data. By following the familiar scikit-learn API design, the library allows quantitative researchers and developers to apply techniques such as model selection, cross-validation, and hyperparameter tuning to portfolio construction workflows. ...

Downloads: 0 This Week

Last Update: 2026-04-21

See Project

pytudes

Python programs, usually short, of considerable difficulty

...It is useful for programmers who want to study elegant Python code while learning how experienced developers approach problem solving. Many examples emphasize clarity and compactness rather than framework-heavy engineering. pytudes is best understood as a learning library, a coding style reference, and a set of practical programming studies.

Downloads: 1 This Week

Last Update: 4 days ago

See Project

SimpleHTR

Handwritten Text Recognition (HTR) system implemented with TensorFlow

SimpleHTR is an open-source implementation of a handwriting text recognition system based on deep learning techniques. The project focuses on converting images of handwritten text into machine-readable digital text using neural networks. The system uses a combination of convolutional neural networks and recurrent neural networks to extract visual features and model sequential character patterns in handwriting. It also employs connectionist temporal classification (CTC) to align predicted...

Downloads: 2 This Week

Last Update: 2026-03-12

See Project

Megatron

Ongoing research training transformer models at scale

Megatron is a large, powerful transformer developed by the Applied Deep Learning Research team at NVIDIA. This repository is for ongoing research on training large transformer language models at scale. We developed efficient, model-parallel (tensor, sequence, and pipeline), and multi-node pre-training of transformer based models such as GPT, BERT, and T5 using mixed precision. Megatron is also used in NeMo Megatron, a framework to help enterprises overcome the challenges of building and...

Downloads: 0 This Week

Last Update: 4 days ago

See Project

AutoTrain Advanced

Faster and easier training and deployments

AutoTrain Advanced is an open-source machine learning training framework developed by Hugging Face that simplifies the process of training and fine-tuning state-of-the-art AI models. The project provides a no-code and low-code interface that allows users to train models using custom datasets without needing extensive expertise in machine learning engineering. It supports a wide range of tasks including text classification, sequence-to-sequence modeling, token classification, sentence...

Downloads: 1 This Week

Last Update: 2026-04-15

See Project

freeCodeCamp

freeCodeCamp.org's open-source codebase and curriculum

freeCodeCamp is a nonprofit educational platform that offers a self-paced curriculum for learning web development, programming, data visualization, APIs, and algorithms. It features interactive coding challenges, real-world projects, and guided progress through topic modules, culminating in certificates for completed tracks. A key aspect is that students contribute to open-source projects for nonprofits or internal tooling as part of their learning, reinforcing both technical and...

Downloads: 48 This Week

Last Update: 6 days ago

See Project

Diffusion for World Modeling

Learning agent trained in a diffusion world model

Diffusion for World Modeling is an experimental reinforcement learning system that trains intelligent agents inside a simulated environment generated by a diffusion-based world model. The project introduces the idea of using diffusion models, commonly used for image generation, to simulate the dynamics of an environment and predict future states based on previous observations and actions. Instead of interacting directly with a real environment, the reinforcement learning agent learns within...

Downloads: 0 This Week

Last Update: 2026-03-12

See Project

The Data Engineering Handbook

Links to everything you'd ever want to learn about data engineering

The Data Engineering Handbook is a comprehensive, community-curated repository that aggregates essential learning resources for anyone interested in becoming a professional data engineer. Rather than being a code project itself, it’s a learning handbook that links to books, articles, tutorials, community groups, boot camps, and real-world project examples that collectively form a roadmap to mastering data engineering skills. It includes beginner and intermediate boot camps, interview guides,...

Downloads: 2 This Week

Last Update: 2026-04-02

See Project

Humanoid-Gym

Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real

Humanoid-Gym is a reinforcement learning framework designed to train locomotion and control policies for humanoid robots using high-performance simulation environments. The system is built on top of NVIDIA Isaac Gym, which allows large-scale parallel simulation of robotic environments directly on GPU hardware. Its primary goal is to enable efficient training of humanoid robots in simulation while enabling policies to transfer effectively to real-world hardware without additional training....

Downloads: 0 This Week

Last Update: 2026-03-15

See Project

Diffrax

Numerical differential equation solvers in JAX

Diffrax is a numerical differential equation solving library built for the JAX ecosystem, with a strong focus on composability, differentiability, and high-performance scientific computing. The project provides tools for solving ordinary differential equations, stochastic differential equations, controlled differential equations, and related systems in a way that fits naturally into modern machine learning and differentiable programming workflows. Because it is written to work closely with...

Downloads: 0 This Week

Last Update: 2026-03-12

See Project

Search Results for "python q learning" - Page 5

667 projects for "python q learning" with 1 filter applied:

DI-engine

deepjazz

SLM Lab

MLC LLM

verl

MEDIUM_NoteBook

VoxelMorph

Data-Science-Interview-Questions-Answers

Data Science Interviews

TTRL

D4RL

SCAIL

AIDE ML

Perfect Roadmap To Learn Data Science

PySINDy

skfolio

pytudes

SimpleHTR

Megatron

AutoTrain Advanced

freeCodeCamp

Diffusion for World Modeling

The Data Engineering Handbook

Humanoid-Gym

Diffrax

Search Results for "python q learning" - Page 5

667 projects for "python q learning" with 1 filter applied:

DI-engine

deepjazz

SLM Lab

MLC LLM

verl

MEDIUM_NoteBook

VoxelMorph

Data-Science-Interview-Questions-Answers

Data Science Interviews

TTRL

D4RL

SCAIL

AIDE ML

Perfect Roadmap To Learn Data Science

PySINDy

skfolio

pytudes

SimpleHTR

Megatron

AutoTrain Advanced

freeCodeCamp

Diffusion for World Modeling

The Data Engineering Handbook

Humanoid-Gym

Diffrax

Related Searches

Related Categories