Search Results for "q learning algorithm" - Page 7

Showing 305 open source projects for "q learning algorithm"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    karatasi - flip cards on iPhone
    Flip card learning program for iPhone with a spaced learning algorithm. Create your own databases and edit the cards directly on the iPhone. Import Palm databases or csv-formatted files and backup your data with our Java application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Deep-Learning-for-Recommendation-Systems

    Deep-Learning-for-Recommendation-Systems

    This repository contains Deep Learning based articles

    Deep-Learning-for-Recommendation-Systems is a curated repository that aggregates research papers, articles, and code related to deep learning methods for recommender systems. The project organizes influential academic work covering topics such as collaborative filtering, neural recommendation models, and deep feature learning. It includes references to papers describing architectures like collaborative deep learning, neural autoregressive models, and convolutional approaches to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Python Machine Learning

    Python Machine Learning

    The "Python Machine Learning (2nd edition)" book code repository

    This repository accompanies the well-known textbook “Python Machine Learning, 2nd Edition” by Sebastian Raschka and Vahid Mirjalili, serving as a complete codebase of examples, notebooks, scripts and supporting materials for the book. It covers a wide range of topics including supervised learning, unsupervised learning, dimensionality reduction, model evaluation, deep learning with TensorFlow, and embedding models into web apps. Each chapter has Jupyter notebooks and Python scripts that...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    jieba

    jieba

    Stuttering Chinese word segmentation

    "Jaba" Chinese word segmentation, do the best Python Chinese word segmentation component. Four word segmentation modes are supported. Precise mode, which tries to cut the sentence most precisely, suitable for text analysis. Full mode, scans all the words that can be formed into words in the sentence, the speed is very fast, but the ambiguity cannot be resolved. The search engine mode, on the basis of the precise mode, divides the long words again to improve the recall rate, which is suitable...
    Downloads: 5 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 5
    RecNN

    RecNN

    Reinforced Recommendation toolkit built around pytorch 1.7

    This is my school project. It focuses on Reinforcement Learning for personalized news recommendation. The main distinction is that it tries to solve online off-policy learning with dynamically generated item embeddings. I want to create a library with SOTA algorithms for reinforcement learning recommendation, providing the level of abstraction you like.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Machine Learning From Scratch

    Machine Learning From Scratch

    Bare bones NumPy implementations of machine learning models

    ML-From-Scratch is an open-source machine learning project that demonstrates how to implement common machine learning algorithms using only basic Python and NumPy rather than relying on high-level frameworks. The goal of the project is to help learners understand how machine learning algorithms work internally by building them step by step from fundamental mathematical operations. The repository includes implementations of algorithms ranging from simple models such as linear regression and...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    CCZero (中国象棋Zero)

    CCZero (中国象棋Zero)

    Implement AlphaZero/AlphaGo Zero methods on Chinese chess

    ChineseChess-AlphaZero is a project that implements the AlphaZero algorithm for the game of Chinese Chess (Xiangqi). It adapts DeepMind’s AlphaZero method—combining neural networks and Monte Carlo Tree Search (MCTS)—to learn and play Chinese Chess without prior human data. The system includes self-play, training, and evaluation pipelines tailored to Xiangqi's unique game mechanics.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    benchm-ml

    benchm-ml

    A benchmark of commonly used open source implementations

    ...The benchmarks cover algorithms like logistic regression, random forest, gradient boosting, and deep neural networks, and they compare across toolkits such as scikit-learn, R packages, xgboost, H2O, Spark MLlib, etc. The repository is structured in logical folders, each corresponding to algorithm categories.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Coach

    Coach

    Enables easy experimentation with state of the art algorithms

    Coach is a python framework that models the interaction between an agent and an environment in a modular way. With Coach, it is possible to model an agent by combining various building blocks, and training the agent on multiple environments. The available environments allow testing the agent in different fields such as robotics, autonomous driving, games and more. It exposes a set of easy-to-use APIs for experimenting with new RL algorithms and allows simple integration of new environments...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • 10
    Active Learning

    Active Learning

    Framework and examples for active learning with machine learning model

    ...The main experiment runner (run_experiment.py) supports a wide range of configurations, including batch sizes, dataset subsets, model selection, and data preprocessing options. It includes several established active learning strategies such as uncertainty sampling, k-center greedy selection, and bandit-based methods, while also allowing for custom algorithm implementations. The framework integrates with both classical machine learning models (SVM, logistic regression) and neural networks.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    Lori's Help

    Lori's Help

    An Android app to help people with Down Syndrome in their literacy

    Lori Help is an Android application that provides support for the literacy of people with Down syndrome. The application has 4 activities to aid in learning, 3 of them with emphasis on the literacy process and 1 focused on sensory stimuli. Application activities are monitored by a biofeedback algorithm (known as Attention Meter). The algorithm observes the variations of the user's micro facial expressions with the intention of measuring the level of attention during the accomplishment of the activities. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    easy12306

    easy12306

    Automatic recognition of 12306 verification code

    Automatic recognition of 12306 verification code using machine learning algorithm. Identify never-before-seen pictures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    spark-ml-source-analysis

    spark-ml-source-analysis

    Spark ml algorithm principle analysis and specific source code

    spark-ml-source-analysis is a technical repository that analyzes the internal implementation of machine learning algorithms within Apache Spark’s MLlib library. The project aims to help developers and data scientists understand how distributed machine learning algorithms are implemented and optimized inside the Spark ecosystem. Instead of providing a runnable software system, the repository focuses on explaining algorithm principles and examining the underlying source code used in Spark’s machine learning package. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    Objective Function Analysis

    An alternative to neural nets for machine learning.

    Objective Function Analysis models knowledge as a multi-dimensional probability density function (MD-PDF) of the perceptions and responses (which are themselves perceptions) of an entity and an objective function (OF). The learning algorithm is the action of choosing a response, given the perceptions, which maximizes the objective function. The MD-PDF is initially seeded by a uniform random number generator. The response is used to evaluate the OF and the OF is either reinforced or diminished in the probability subspace formed by the perceptions and responses. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    PythonRobotics

    PythonRobotics

    Python sample codes and textbook for robotics algorithms

    PythonRobotics is a Python code collection and textbook for learning robotics algorithms through readable examples. It covers practical topics such as localization, mapping, path planning, path tracking, control, SLAM, and autonomous navigation. The project is written to make each algorithm’s core idea easy to understand, rather than hiding the logic behind large frameworks. It keeps dependencies minimal so learners can focus on the math, implementation, and behavior of each robotics method....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    OpenDino

    Open Source Java platform for Optimization, DoE, and Learning.

    OpenDino is an open source Java platform for optimization, design of experiment and learning. It provides a graphical user interface (GUI) and a platform which simplifies integration of new algorithms as "Modules". Implemented Modules Evolutionary Algorithms: - CMA-ES - (1+1)-ES - Differential Evolution Deterministic optimization algorithm: - SIMPLEX Learning: - a simple Artificial Neural Net Optimization problems: - test functions - interface for executing other programs (solvers) - parallel execution of problems - distributed execution of problems via socket connection between computers Others: - data storage - data analyser and viewer
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    DeepTraffic

    DeepTraffic

    DeepTraffic is a deep reinforcement learning competition

    DeepTraffic is a deep reinforcement learning simulation designed to teach and evaluate autonomous driving algorithms in a dense highway environment. The system presents a simulated multi-lane highway where an AI-controlled vehicle must navigate traffic while maximizing speed and avoiding collisions. Participants design neural network policies that determine the vehicle’s actions, such as accelerating, decelerating, changing lanes, or maintaining speed. The project was created as part of an...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Play-With-Sort-OC

    Play-With-Sort-OC

    Repository implemented in Objective-C with sorting algorithms

    Play-With-Sort-OC is a learning-oriented repository implemented in Objective-C that demonstrates several classic sorting algorithms with code examples (selection sort, bubble sort, insertion sort, quick sort variants, heap sort, etc). The goal is educational; by showing how each algorithm works with animations or clear visualizations in an iOS/Objective-C context, the author helps developers understand not just the “how” but also the “why” behind each algorithm.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Lihang

    Lihang

    Statistical learning methods (2nd edition) [Li Hang]

    Lihang is an open-source repository that provides educational notes, mathematical derivations, and code implementations based on the book Statistical Learning Methods by Li Hang. The repository aims to help readers understand the theoretical foundations of machine learning algorithms through practical implementations and detailed explanations. It includes notebooks and scripts that demonstrate how key algorithms such as perceptrons, decision trees, logistic regression, support vector...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Dynamic Routing Between Capsules

    Dynamic Routing Between Capsules

    A PyTorch implementation of the NIPS 2017 paper

    ...Instead of scalar neuron activations, capsules output vectors that encode both the presence of features and their spatial properties such as orientation or pose. The repository implements the dynamic routing algorithm between capsules, which allows lower-level features to route their outputs to higher-level structures that best represent the detected patterns. This approach enables the model to capture part-to-whole relationships in visual data more effectively than standard CNNs. The project serves primarily as a research implementation that demonstrates how capsule networks can be built and trained using modern deep learning frameworks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Data Algorithm/leetcode/lintcode

    Data Algorithm/leetcode/lintcode

    Data Structure and Algorithm notes

    This work is some notes of learning and practicing data structures and algorithms. Part I is a brief introduction of basic data structures and algorithms, such as, linked lists, stack, queues, trees, sorting and etc. This book notes about learning data structure and algorithms. It was written in Simplified Chinese but other languages such as English and Traditional Chinese are also working in progress.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Deep Reinforcement Learning TensorFlow

    Deep Reinforcement Learning TensorFlow

    TensorFlow implementation of Deep Reinforcement Learning papers

    Deep Reinforcement Learning TensorFlow is a comprehensive TensorFlow codebase that implements several foundational deep reinforcement learning algorithms for educational and experimental use. The repository focuses on clarity and modularity so users can study how different RL approaches are built and compare their behavior across environments. It includes implementations of well-known algorithms such as Deep Q-Networks (DQN), policy gradients, and related variants, demonstrating how neural networks can be trained through interaction with simulated environments. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Deep Reinforcement Learning for Keras

    Deep Reinforcement Learning for Keras

    Deep Reinforcement Learning for Keras.

    keras-rl implements some state-of-the-art deep reinforcement learning algorithms in Python and seamlessly integrates with the deep learning library Keras. Furthermore, keras-rl works with OpenAI Gym out of the box. This means that evaluating and playing around with different algorithms is easy. Of course, you can extend keras-rl according to your own needs. You can use built-in Keras callbacks and metrics or define your own. Even more so, it is easy to implement your own environments and...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 24
    Easy Machine Learning

    Easy Machine Learning

    Easy Machine Learning is a general-purpose dataflow-based system

    ...Our platform Easy Machine Learning presents a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real-world tasks. In the system, a learning task is formulated as a directed acyclic graph (DAG) in which each node represents an operation (e.g. a machine learning algorithm), and each edge represents the flow of the data from one node to its descendants.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 25

    spark-msna

    Algorithm on Spark for aligning multiple similar DNA/RNA sequences

    The algorithm uses suffix tree for identifying common substrings and uses a modified Needleman-Wunsch algorithm for pairwise alignments. In order to improve the efficiency of pairwise alignments, an unsupervised learning based on clustering technique is used to create a knowledge base to guide them.
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo