Search Results for "q learning algorithm" - Page 5

Showing 305 open source projects for "q learning algorithm"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • 1
    javascript-algorithms

    javascript-algorithms

    JavaScript implementations of computer science algorithms

    javascript-algorithms is a repository that implements a comprehensive collection of classic data structures and algorithmic solutions in JavaScript — including sorting, searching, graph algorithms, dynamic programming, and more. It serves as a learning and reference resource for developers wanting to understand how standard algorithms can be expressed in JavaScript, bridging the gap between theoretical algorithm knowledge and practical implementation usable in web or Node.js contexts. Because algorithms are implemented in a language (JavaScript) many developers use daily, the repo becomes especially useful for students, interview-preparation candidates, or anyone wanting to learn algorithmic thinking without switching languages. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    YoloV3 Implemented in TensorFlow 2.0

    YoloV3 Implemented in TensorFlow 2.0

    YoloV3 Implemented in Tensorflow 2.0

    YoloV3 Implemented in TensorFlow 2.0 is built using TensorFlow 2.0. The project provides a modern deep learning implementation of the popular YOLOv3 algorithm, which is widely used for real-time object detection in images and video streams. YOLOv3 works by dividing an image into grid regions and predicting bounding boxes and class probabilities simultaneously, allowing objects to be detected quickly and efficiently. The repository includes training scripts, inference tools, and configuration files that make it possible to train custom object detection models on user-defined datasets. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    AnyTrading

    AnyTrading

    The most simple, flexible, and comprehensive OpenAI Gym trading

    gym-anytrading is an OpenAI Gym-compatible environment designed for developing and testing reinforcement learning algorithms on trading strategies. It simulates trading environments for financial markets, including stocks and forex.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 4
    minimalRL-pytorch

    minimalRL-pytorch

    Implementations of basic RL algorithms with minimal lines of codes

    ...The repository includes examples of widely used reinforcement learning methods such as REINFORCE, Deep Q-Networks, Proximal Policy Optimization, and Actor-Critic architectures. Most experiments are designed to run quickly using the CartPole environment so that users can focus on understanding algorithm logic rather than computational infrastructure.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 5

    Lumi-HSP

    This is an AI language model that can predict Heart failure or stroke

    Using thsi AI model, you can predict the chances of heart stroke and heart failure. HIGLIGHTS : 1. Accuracy of this model is 95% 2. This model uses the powerful Machine Learning algorithm "GradientBoosting" for predicting the outcomes. 3. An easy to use model and accessible to everyone.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    AB3DMOT

    AB3DMOT

    Official Python Implementation for "3D Multi-Object Tracking

    AB3DMOT is a real-time 3D multi-object tracking framework designed for applications such as autonomous driving and robotics perception. The system processes detection results from 3D object detectors that analyze LiDAR point clouds and uses them to track multiple objects across consecutive frames. Its tracking pipeline relies on a combination of classical algorithms, including a Kalman filter for state estimation and the Hungarian algorithm for data association between detected objects and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    LightFM

    LightFM

    A Python implementation of LightFM, a hybrid recommendation algorithm

    LightFM is a Python implementation of a number of popular recommendation algorithms for both implicit and explicit feedback, including efficient implementation of BPR and WARP ranking losses. It's easy to use, fast (via multithreaded model estimation), and produces high-quality results. It also makes it possible to incorporate both item and user metadata into the traditional matrix factorization algorithms. It represents each user and item as the sum of the latent representations of their...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 8
    PARL

    PARL

    A high-performance distributed training framework

    PARL is a scalable reinforcement learning framework built on top of PaddlePaddle. It focuses on modularity and ease of use, supporting distributed training and a variety of RL algorithms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    auto-sklearn

    auto-sklearn

    Automated machine learning with scikit-learn

    auto-sklearn is an automated machine learning toolkit and a drop-in replacement for a scikit-learn estimator. auto-sklearn frees a machine learning user from algorithm selection and hyperparameter tuning. It leverages recent advantages in Bayesian optimization, meta-learning and ensemble construction. Auto-sklearn 2.0 includes latest research on automatically configuring the AutoML system itself and contains a multitude of improvements which speed up the fitting the AutoML system. auto-sklearn 2.0 works the same way as regular auto-sklearn. auto-sklearn is licensed the same way as scikit-learn, namely the 3-clause BSD license.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Error to trace to log to deploy. One click. No SSH. Icon
    Error to trace to log to deploy. One click. No SSH.

    Catch the cause before the pager goes off.

    AppSignal links every error to the trace, the trace to the log, the log to the deploy that shipped it.
    Free 30 days.
  • 10
    FFCV

    FFCV

    Fast Forward Computer Vision (and other ML workloads!)

    ffcv is a drop-in data loading system that dramatically increases data throughput in model training. From gridding to benchmarking to fast research iteration, there are many reasons to want faster model training. Below we present premade codebases for training on ImageNet and CIFAR, including both (a) extensible codebases and (b) numerous premade training configurations.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    The Art of Programming

    The Art of Programming

    A collection of practical tips can be found at the bottom of this page

    ...In July 2023, work on the second edition was announced, which expands the project with updated content, new problems inspired by recent big-tech interviews, and introductions to modern machine learning techniques such as XGBoost, CNNs, RNNs, and LSTMs. This collection serves both as a historical record of algorithm problem-solving and as a living resource for programmers preparing for interviews.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 12
    Reinforcement-learning

    Reinforcement-learning

    Implementation of Reinforcement Learning Algorithms. Python, OpenAI

    Reinforcement-learning is a widely used educational repository that provides implementations, exercises, and solutions for a broad range of reinforcement learning algorithms, designed to complement foundational texts and courses in the field. The project collects popular approaches such as dynamic programming, Monte Carlo methods, temporal difference learning, Q-learning, SARSA, deep Q-networks, and policy gradient techniques, often demonstrated with Python and OpenAI Gym environments so users can experiment with agents learning in simulated tasks. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Reinforcement Learning Methods

    Reinforcement Learning Methods

    Simple Reinforcement learning tutorials

    Reinforcement-Learning-with-TensorFlow is an educational repository that walks through key reinforcement learning algorithms implemented in TensorFlow. It provides clear code examples for foundational techniques like Q-learning, policy gradients, deep Q-networks, actor-critic methods, and value function approximation within familiar simulation environments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    CleanRL

    CleanRL

    High-quality single file implementation of Deep Reinforcement Learning

    CleanRL is a Deep Reinforcement Learning library that provides high-quality single-file implementation with research-friendly features. The implementation is clean and simple, yet we can scale it to run thousands of experiments using AWS Batch. CleanRL is not a modular library and therefore it is not meant to be imported. At the cost of duplicate code, we make all implementation details of a DRL algorithm variant easy to understand, so CleanRL comes with its own pros and cons. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 15
    PixelAnnotationTool

    PixelAnnotationTool

    Annotate quickly images

    Software that allows you to manually and quickly annotate images in directories. The method is pseudo manual because it uses the algorithm watershed marked of OpenCV. The general idea is to manually provide the marker with brushes and then to launch the algorithm. If at first pass the segmentation needs to be corrected, the user can refine the markers by drawing new ones on the erroneous areas.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Resources for Competitive Programming

    Resources for Competitive Programming

    Competitive Programming & System Design Resources

    Competitive Programming Resources is a curated collection for learners preparing for competitive programming, coding interviews, system design, and problem solving. It gathers video tutorials, practice platforms, algorithm references, interview preparation links, and system design materials in one place. The repository is not a software library but a structured learning hub for people who want to improve algorithmic thinking and technical interview readiness. It points users toward platforms such as Codeforces, CodeChef, LeetCode, AtCoder, TopCoder, SPOJ, HackerRank, and Project Euler. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    FedLab

    FedLab

    A flexible Federated Learning Framework based on PyTorch

    A Python-based framework for federated learning simulation, emphasizing modularity, communication efficiency, and algorithmic flexibility. Supports both server- and client-side customization for research and development purposes.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 18
    Gym

    Gym

    Toolkit for developing and comparing reinforcement learning algorithms

    Gym by OpenAI is a toolkit for developing and comparing reinforcement learning algorithms. It supports teaching agents, everything from walking to playing games like Pong or Pinball. Open source interface to reinforce learning tasks. The gym library provides an easy-to-use suite of reinforcement learning tasks. Gym provides the environment, you provide the algorithm. You can write your agent using your existing numerical computation library, such as TensorFlow or Theano. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 19
    GFPGAN

    GFPGAN

    GFPGAN aims at developing Practical Algorithms

    GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration. Colab Demo for GFPGAN; (Another Colab Demo for the original paper model) Online demo: Huggingface (return only the cropped face) Online demo: Replicate.ai (may need to sign in, return the whole image). Online demo: Baseten.co (backed by GPU, returns the whole image). We provide a clean version of GFPGAN, which can run without CUDA extensions. So that it can run in Windows or on CPU mode. GFPGAN aims at developing...
    Downloads: 60 This Week
    Last Update:
    See Project
  • 20
    MTCNN Face Detection Alignment

    MTCNN Face Detection Alignment

    Joint Face Detection and Alignment

    MTCNN_face_detection_alignment is an implementation of the “Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks” algorithm. The algorithm uses a cascade of three convolutional networks (P-Net, R-Net, O-Net) to jointly detect faces (bounding boxes) and align facial landmarks in a coarse-to-fine manner, leveraging multi-task learning. Non-maximum suppression and bounding box regression at each stage. The repository includes Caffe / MATLAB code, support scripts, and instructions for dependencies. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Python ML Jupyter Notebooks

    Python ML Jupyter Notebooks

    Practice and tutorial-style notebooks

    ...The repository is designed to help learners understand both the theory and practical implementation of machine learning algorithms through step-by-step code examples. Many notebooks include explanations of algorithm behavior, data preparation techniques, and evaluation methods for machine learning models. The project also includes examples that demonstrate how to apply machine learning to real-world datasets and practical business problems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Auto-PyTorch

    Auto-PyTorch

    Automatic architecture search and hyperparameter optimization

    While early AutoML frameworks focused on optimizing traditional ML pipelines and their hyperparameters, another trend in AutoML is to focus on neural architecture search. To bring the best of these two worlds together, we developed Auto-PyTorch, which jointly and robustly optimizes the network architecture and the training hyperparameters to enable fully automated deep learning (AutoDL). Auto-PyTorch is mainly developed to support tabular data (classification, regression) and time series...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    WaveFunctionCollapse

    WaveFunctionCollapse

    Bitmap & tilemap generation from a single example

    This program generates bitmaps that are locally similar to the input bitmap. WFC initializes output bitmap in a completely unobserved state, where each pixel value is in superposition of colors of the input bitmap (so if the input was black & white then the unobserved states are shown in different shades of grey). The coefficients in these superpositions are real numbers, not complex numbers, so it doesn't do the actual quantum mechanics, but it was inspired by QM. Then the program goes into...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Reskin Sensor Library

    Reskin Sensor Library

    ReSkin Sensor Interfacing Library

    ...However, the same properties of conformal contact result in faster deterioration of soft sensors and larger variations in their response characteristics over time and across samples, inhibiting their ability to be long-lasting and replaceable. ReSkin is a tactile soft sensor that leverages machine learning and magnetic sensing to offer a low-cost, diverse and compact solution for long-term use. Magnetic sensing separates the electronic circuitry from the passive-interface, making it easier to replace interfaces as they wear out while allowing for a wide variety of form factors. Machine learning allows us to learn sensor response models that are robust to variations across fabrication and time, and our self-supervised learning algorithm enables finer performance enhancement.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    MuZero General

    MuZero General

    A commented and documented implementation of MuZero

    muzero-general is an open-source implementation of the MuZero reinforcement learning algorithm introduced by DeepMind. MuZero is a model-based reinforcement learning method that combines neural networks with Monte Carlo Tree Search to learn decision-making policies without requiring explicit knowledge of the environment’s dynamics. The repository provides a well-documented and commented implementation designed primarily for educational purposes.
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo