python q learning free download

Showing 29 open source projects for "python q learning"

View related business solutions

Unix Shell Clear Filters & Widen Search

Resolve Support Tickets 2x Faster with ServoDesk
Full access to Enterprise features. No credit card required.

What if You Could Automate 90% of Your Repetitive Tasks in Under 30 Days? At ServoDesk, we help businesses like yours automate operations with AI, allowing you to cut service times in half and increase productivity by 25% - without hiring more staff.

Try ServoDesk for free
Best Visitor Management System
Instantly Notify Staff Of Deliveries And Guest Arrivals To Increase Your Efficiency

<p class="mb-4">Do stacks of paperwork pile up at the front desk area? Or are your receptionists constantly filing reports, guest log-in information and NDAs – taking them away from other important tasks? Not anymore! Our Visitor Management System automates all these processes, streamlining your workflow. Guests can complete inductions, sign NDAs, fill in their contact details and much more using the easy software. These records are then automatically filed and stored, making life easy for receptionists and the HR team. Claim your FREE 7-day trial and experience how VisitUs can transform your workplace!</p>

Try it Free
1

ImageReward

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences

ImageReward is the first general-purpose human preference reward model (RM) designed for evaluating text-to-image generation, introduced alongside the NeurIPS 2023 paper ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation. Trained on 137k expert-annotated image pairs, ImageReward significantly outperforms existing scoring methods like CLIP, Aesthetic, and BLIP in capturing human visual preferences. It is provided as a Python package (image-reward) that enables quick scoring of generated images against textual prompts, with APIs for ranking, scoring, and filtering outputs. ...

Downloads: 2 This Week

Last Update: 3 days ago
See Project
2

RLax

Library of JAX-based building blocks for reinforcement learning agents

RLax (pronounced “relax”) is a JAX-based library developed by Google DeepMind that provides reusable mathematical building blocks for constructing reinforcement learning (RL) agents. Rather than implementing full algorithms, RLax focuses on the core functional operations that underpin RL methods—such as computing value functions, returns, policy gradients, and loss terms—allowing researchers to flexibly assemble their own agents. It supports both on-policy and off-policy learning, as well as...

Downloads: 0 This Week

Last Update: 2025-10-09
See Project
3

Generative AI for Beginners (Version 3)

21 Lessons, Get Started Building with Generative AI

...Each lesson includes a short video, a written guide, runnable samples for Azure OpenAI, the GitHub Marketplace Model Catalog, and the OpenAI API, plus a “Keep Learning” section for deeper study.

Downloads: 9 This Week

Last Update: 3 days ago
See Project
4

System Design Primer

Learn how to design large-scale systems

System Design Primer is a curated, open source collection of resources that helps engineers learn how to design large-scale systems. The project is structured as a comprehensive guide covering core system design concepts, trade-offs, and patterns necessary for building scalable, reliable, and maintainable systems. It offers both theoretical foundations—such as scalability principles, the CAP theorem, and consistency models—and practical exercises, including real-world system design interview...

Downloads: 2 This Week

Last Update: 3 days ago
See Project
New Relic provides the most powerful cloud-based observability platform built to help companies create more perfect software.
Get a live and in-depth view of your network, infrastructure, applications, end-user experience, machine learning models and more.

Correlate issues across your stack. Debug and collaborate from your IDE. AI assistance at every step. All in one connected experience - not a maze of charts.

Start for Free
5

VMZ (Video Model Zoo)

VMZ: Model Zoo for Video Modeling

The codebase was designed to help researchers and practitioners quickly reproduce FAIR’s results and leverage robust pre-trained backbones for downstream tasks. It also integrates Gradient Blending, an audio-visual modeling method that fuses modalities effectively (available in the Caffe2 implementation). Although VMZ is now archived and no longer actively maintained, it remains a valuable reference for understanding early large-scale video model training, transfer learning, and multimodal...

Downloads: 2 This Week

Last Update: 7 hours ago
See Project
6

Diplomacy Cicero

Code for Cicero, an AI agent that plays the game of Diplomacy

...It is designed to play the board game Diplomacy by combining open-domain natural language negotiation with strategic planning. The repository includes training code, model checkpoints, and infrastructure for both language modelling (via the ParlAI framework) and reinforcement learning for strategy agents. It supports two variants: Cicero (which handles full “press” negotiation) and Diplodocus (a variant focused on no-press diplomacy) as described in the README. The codebase is implemented primarily in Python with performance-critical components in C++ (via pybind11 bindings) and is configured to run in a high‐GPU cluster environment. ...

Downloads: 2 This Week

Last Update: 7 hours ago
See Project
7

Public APIs

A collective list of free APIs

public-apis is a collaboratively maintained repository that provides an extensive, categorized list of publicly available APIs for developers. Curated by community contributors and the team at APILayer, it serves as a centralized resource for discovering APIs across a wide range of domains, including data, machine learning, weather, entertainment, and finance. The project aims to make API exploration and integration more accessible by offering a single, organized index of open and...

Downloads: 2 This Week

Last Update: 3 days ago
See Project
8

alphageometry

AI-driven neuro-symbolic solver for high-school geometry problems

AlphaGeometry, developed by Google DeepMind, is a theorem-proving system that combines symbolic reasoning with deep learning to solve challenging geometry problems, such as those found in mathematical Olympiads. The repository provides the full implementation of DDAR (Deductive Difference and Abductive Reasoning) and AlphaGeometry, two automated geometry solvers described in the 2024 Nature paper “Solving Olympiad Geometry without Human Demonstrations.” AlphaGeometry integrates a symbolic...

Downloads: 11 This Week

Last Update: 3 days ago
See Project
9

Seeker

Accurately Locate Smartphones using Social Engineering

Seeker is an open source project that demonstrates how to obtain precise location information from devices using social engineering and web-based techniques. The tool sets up a phishing page that asks for location permissions, allowing GPS and other device data to be shared if the user consents. It can capture latitude, longitude, accuracy, altitude, direction, and even speed, with results displayed in a terminal. The project supports both manual deployment and tunneling services like Ngrok...

Downloads: 5 This Week

Last Update: 3 days ago
See Project
Empower Your Workforce and Digitize Your Shop Floor
Benefits to Manufacturers

Easily connect to most tools and equipment on the shop floor, enabling efficient data collection and boosting productivity with vital insights. Turn information into action to generate new ideas and better processes.

Learn More
10

NGINX Admin’s Handbook

How to improve NGINX performance, security, and other important things

nginx-admins-handbook is a practical, in-depth guide for configuring, securing, and operating NGINX across real-world deployments. It distills years of research, notes, and field experience into a single handbook that complements the official docs with concrete rules, explanations, and curated external references. The handbook spans fundamentals and advanced topics alike, from HTTP and SSL/TLS basics to reverse proxy patterns, performance tuning, debugging workflows, and hardening...

Downloads: 1 This Week

Last Update: 3 days ago
See Project
11

fairseq2

FAIR Sequence Modeling Toolkit 2

fairseq2 is a modern, modular sequence modeling framework developed by Meta AI Research as a complete redesign of the original fairseq library. Built from the ground up for scalability, composability, and research flexibility, fairseq2 supports a broad range of language, speech, and multimodal content generation tasks, including instruction fine-tuning, reinforcement learning from human feedback (RLHF), and large-scale multilingual modeling. Unlike the original fairseq—which evolved into a...

Downloads: 0 This Week

Last Update: 2025-11-07
See Project
12

TobbeOS

TobbeOS is a GNU/Linux distro based on Arch linux

Welcome to TobbeOS!! TobbeOS is a GNU/Linux distro based on Arch linux, it is a full desktop with Qtile as its window manager and much more. Important keys to know about: Mod = Super key (Windows key) Mod + E - Open Emacs - Need to install with script tobbeos/emacs_install.sh Mod + Return - Open Terminal (Kitty) Mod + s - Show hotkeys Mod + left Shift + s - Show alias for fish shell Mod + left Shift + Tab - Open web browser (Zen) Mod + left Shift + q - kill window Mod + d -...

Downloads: 171 This Week

Last Update: 2025-11-13
See Project
13

Ubix Linux

The Pocket Datalab

Ubix stands for Universal Business Intelligence Computing System. Ubix Linux is an open-source, Debian-based Linux distribution geared towards data acquisition, transformation, analysis and presentation. Ubix Linux purpose is to offer a tiny but versatile datalab. Ubix Linux is easily accessible, resource-efficient and completely portable on a simple USB key. Ubix Linux is a perfect toolset for learning data analysis and artificial intelligence basics on small to medium...

Downloads: 56 This Week

Last Update: 2024-03-08
See Project
14

MLPACK C++ machine learning library

MLPACK is a C++ machine learning library with emphasis on scalability, speed, and ease-of-use. Its aim is to make machine learning possible for novice users by means of a simple, consistent API, while simultaneously exploiting C++ language features to provide maximum performance and flexibility for expert users. * More info + downloads: https://mlpack.org * Git repo: https://github.com/mlpack/mlpack

Downloads: 0 This Week

Last Update: 2023-06-28
See Project
15

SVoice (Speech Voice Separation)

We provide a PyTorch implementation of the paper Voice Separation

SVoice is a PyTorch-based implementation of Facebook Research’s study on speaker voice separation as described in the paper “Voice Separation with an Unknown Number of Multiple Speakers.” This project presents a deep learning framework capable of separating mixed audio sequences where several people speak simultaneously, without prior knowledge of how many speakers are present. The model employs gated neural networks with recurrent processing blocks that disentangle voices over multiple...

Downloads: 1 This Week

Last Update: 4 days ago
See Project
16

Behaviour Suite Reinforcement Learning

bsuite is a collection of carefully-designed experiments

bsuite is a research framework developed by Google DeepMind that provides a comprehensive collection of experiments for evaluating the core capabilities of reinforcement learning (RL) agents. Its main goal is to identify, measure, and analyze fundamental aspects of learning efficiency and generalization in RL algorithms. The library enables researchers to benchmark their agents on standardized tasks, facilitating reproducible and transparent comparisons across different approaches. Each...

Downloads: 0 This Week

Last Update: 2025-10-10
See Project
17

CNN for Image Retrieval

cnn-for-image-retrieval is a research-oriented project that demonstrates the use of convolutional neural networks (CNNs) for image retrieval tasks. The repository provides implementations of CNN-based methods to extract feature representations from images and use them for similarity-based retrieval. It focuses on applying deep learning techniques to improve upon traditional handcrafted descriptors by learning features directly from data. The code includes training and evaluation scripts that...

Downloads: 3 This Week

Last Update: 3 days ago
See Project
18

Consistent Depth

We estimate dense, flicker-free, geometrically consistent depth

Consistent Depth is a research project developed by Facebook Research that presents an algorithm for reconstructing dense and geometrically consistent depth information for all pixels in a monocular video. The system builds upon traditional structure-from-motion (SfM) techniques to provide geometric constraints while integrating a convolutional neural network trained for single-image depth estimation. During inference, the model fine-tunes itself to align with the geometric constraints of a...

Downloads: 1 This Week

Last Update: 3 days ago
See Project
19

Ansible Examples

A few starter examples of ansible playbooks, to show features

This repository collects practical, real-world examples of using Ansible to automate infrastructure, deployments, and configurations. Each directory demonstrates a specific use case—ranging from setting up web servers, load balancers, and databases to orchestrating multi-tier applications in cloud environments. The examples highlight common Ansible practices such as organizing inventories, writing reusable playbooks, using roles, and handling variables and templates. They’re designed to be...

Downloads: 2 This Week

Last Update: 4 days ago
See Project
20

I3D models trained on Kinetics

Convolutional neural network model for video classification

Kinetics-I3D, developed by Google DeepMind, provides trained models and implementation code for the Inflated 3D ConvNet (I3D) architecture introduced in the paper “Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset” (CVPR 2017). The I3D model extends the 2D convolutional structure of Inception-v1 into 3D, allowing it to capture spatial and temporal information from videos for action recognition. This repository includes pretrained I3D models on the Kinetics dataset, with...

Downloads: 0 This Week

Last Update: 6 days ago
See Project
21

Evolution Strategies Starter

Code for the paper "Evolution Strategies.."

evolution-strategies-starter is an archived OpenAI research project that provides a distributed implementation of the algorithm described in the paper “Evolution Strategies as a Scalable Alternative to Reinforcement Learning” by Tim Salimans, Jonathan Ho, Xi Chen, and Ilya Sutskever. The repository demonstrates how to scale Evolution Strategies (ES) for reinforcement learning tasks using a master-worker architecture, where the master node broadcasts parameters to multiple workers, and the...

Downloads: 1 This Week

Last Update: 3 days ago
See Project
22

Gin Config

Gin provides a lightweight configuration framework for Python

Gin Config is a lightweight and flexible configuration framework for Python built around dependency injection. It enables developers to manage complex parameter hierarchies—particularly common in machine learning experiments—without relying on boilerplate configuration classes or protos. By decorating functions and classes with @gin.configurable, Gin allows their parameters to be overridden using simple configuration files (.gin) or command-line bindings.

Downloads: 4 This Week

Last Update: 3 days ago
See Project
23

Caffe2

Caffe2 is a lightweight, modular, and scalable deep learning framework

Caffe2 is a lightweight, modular, and scalable deep learning framework. Building on the original Caffe, Caffe2 is designed with expression, speed, and modularity in mind. Caffe2 is a deep learning framework that provides an easy and straightforward way for you to experiment with deep learning and leverage community contributions of new models and algorithms. You can bring your creations to scale using the power of GPUs in the cloud or to the masses on mobile with Caffe2’s cross-platform...

Downloads: 0 This Week

Last Update: 2022-08-01
See Project
24

PrettyTensor

Pretty Tensor: Fluent Networks in TensorFlow

Pretty Tensor is a high-level API built on top of TensorFlow that simplifies the process of creating and managing deep learning models. It wraps TensorFlow tensors in a chainable object syntax, allowing developers to build multi-layer neural networks with concise and readable code. Pretty Tensor preserves full compatibility with TensorFlow’s core functionality while providing syntactic sugar for defining complex architectures such as convolutional and recurrent networks. The library’s design...

Downloads: 1 This Week

Last Update: 4 days ago
See Project
25

CLUE - C Learning Undergrad Environment

Tools to support the learning of the C programming language

CLUE (C Learning Undergraduate Environment) is a software allowing students to work on assignments in the C language while benefiting from support for peer testing, hyper-linked tutorials to help them understand compiler error messages, detection of "novice errors" with warnings hyper-linked to tutorials. Sponsored by National Science Foundation under award CCLI #0836863.

Downloads: 0 This Week

Last Update: 2014-05-10
See Project