python q learning free download

RWARE

MuA multi-agent reinforcement learning environment

robotic-warehouse is a simulation environment and framework for robotic warehouse automation, enabling research and development of AI and robotic agents to manage warehouse logistics, such as item picking and transport.

Downloads: 0 This Week

Last Update: 2025-03-13

See Project

AgentUniverse

agentUniverse is a LLM multi-agent framework

AgentUniverse is a multi-agent AI framework that enables coordination between multiple intelligent agents for complex task execution and automation.

Downloads: 0 This Week

Last Update: 2025-11-17

See Project

Multi-Agent Orchestrator

Flexible and powerful framework for managing multiple AI agents

Multi-Agent Orchestrator is an AI coordination framework that enables multiple intelligent agents to work together to complete complex, multi-step workflows.

Downloads: 0 This Week

Last Update: 2025-06-24

See Project

Habitat-Lab

A modular high-level library to train embodied AI agents

Habitat-Lab is a modular high-level library for end-to-end development in embodied AI. It is designed to train agents to perform a wide variety of embodied AI tasks in indoor environments, as well as develop agents that can interact with humans in performing these tasks. Allowing users to train agents in a wide variety of single and multi-agent tasks (e.g. navigation, rearrangement, instruction following, question answering, human following), as well as define novel tasks. Configuring and...

Downloads: 0 This Week

Last Update: 2026-05-07

See Project

OpenJarvis

Personal AI, On Personal Devices

OpenJarvis is an open-source framework designed to build personal AI agents that run primarily on local devices rather than relying on cloud infrastructure. Developed as part of the Intelligence Per Watt research initiative, it focuses on improving the efficiency and practicality of on-device AI systems. The framework provides shared primitives for building local-first agents, along with evaluation tools that measure performance using metrics such as energy consumption, latency, cost, and...

Downloads: 186 This Week

Last Update: 5 days ago

See Project

VectorizedMultiAgentSimulator (VMAS)

VMAS is a vectorized differentiable simulator

VectorizedMultiAgentSimulator is a high-performance, vectorized simulator for multi-agent systems, focusing on large-scale agent interactions in shared environments. It is designed for research in multi-agent reinforcement learning, robotics, and autonomous systems where thousands of agents need to be simulated efficiently.

Downloads: 0 This Week

Last Update: 2025-11-10

See Project

Youtu-Agent

A simple yet powerful agent framework that delivers with models

Youtu-Agent is an open-source framework developed to simplify the creation, execution, and evaluation of autonomous AI agents. The system focuses on reducing the complexity traditionally involved in configuring large language model agents by providing a modular architecture that separates execution environments, tools, and context management. This structure allows developers to rapidly assemble agent systems capable of performing tasks such as research, file processing, and data analysis....

Downloads: 0 This Week

Last Update: 2026-03-10

See Project

MetaClaw

Just talk to your agent

MetaClaw is an AI or agent-oriented system that appears to focus on advanced control, coordination, or training of autonomous agents, potentially within reinforcement learning or tool-using environments. The project likely emphasizes meta-level reasoning, where agents are not only executing tasks but also adapting their strategies based on feedback and performance signals. It may incorporate mechanisms for learning from interactions, improving decision-making over time, and generalizing...

Downloads: 0 This Week

Last Update: 2026-04-11

See Project

rLLM

Democratizing Reinforcement Learning for LLMs

rLLM is an open-source framework for building and training post-training language agents via reinforcement learning — that is, using reinforcement signals to fine-tune or adapt language models (LLMs) into customizable agents for real-world tasks. With rLLM, developers can define custom “agents” and “environments,” and then train those agents via reinforcement learning workflows, possibly surpassing what vanilla fine-tuning or supervised learning might provide. The project is designed to...

Downloads: 0 This Week

Last Update: 2026-04-30

See Project

Hermes Agent

The agent that grows with you

Hermes Agent is a fully open-source autonomous AI agent designed to run persistently on your own machine or server, becoming more capable the longer it operates by learning from experience and building reusable procedural skills. Rather than functioning as a stateless chatbot, it maintains long-term memory across sessions and can generate searchable “Skill Documents” that capture how it solved complex tasks so it doesn’t start from scratch each time. The agent interfaces with messaging...

Downloads: 72 This Week

Last Update: 2 days ago

See Project

PilottAI

Python framework for building scalable multi-agent systems

pilottai is an AI-based autonomous drone navigation system utilizing reinforcement learning for real-time decision-making. It is designed for simulating and training drones to fly safely through dynamic environments using AI-based controllers.

Downloads: 0 This Week

Last Update: 2025-12-01

See Project

verl-agent

Designed for training LLM/VLM agents via RL

verl-agent is an open-source reinforcement learning framework designed to train large language model agents and vision-language model agents for complex interactive environments. Built as an extension of the veRL reinforcement learning infrastructure, the project focuses on enabling scalable training for agents that perform multi-step reasoning and decision-making tasks. The framework supports multi-turn interactions between agents and their environments, allowing the system to receive...

Downloads: 0 This Week

Last Update: 2026-03-10

See Project

Dash Data Agent

Self-learning data agent that grounds its answers in layers of content

Dash is a self-learning data agent built by the Agno AI community that generates grounded answers to English queries over structured data by synthesizing SQL and reasoning based on six layers of context, improving automatically with each run. It sidesteps common limitations of simple text-to-SQL agents by incorporating multiple context layers — including schema structure, human annotations, known query patterns, institutional knowledge from docs, machine-discovered error patterns, and live...

Downloads: 0 This Week

Last Update: 2026-04-08

See Project

LiteMultiAgent

The Library for LLM-based multi-agent applications

LiteMultiAgent is a lightweight and extensible multi-agent reinforcement learning (MARL) platform designed for rapid experimentation. It allows researchers to design and test coordination, competition, and collaboration scenarios in simulated environments.

Downloads: 0 This Week

Last Update: 2025-03-13

See Project

Academic Research Skills for Claude Code

Academic Research Skills is a structured learning repository aimed at improving users’ ability to conduct rigorous academic research, particularly in technical and scientific domains. It compiles methodologies, frameworks, and best practices for literature review, critical analysis, and research writing. The project is designed as a self-guided resource, helping learners understand how to evaluate sources, synthesize information, and develop strong arguments. It likely integrates examples,...

Downloads: 4 This Week

Last Update: 2026-05-18

See Project

CUDA Agent

Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

CUDA Agent is a research-driven agentic reinforcement learning system designed to automatically generate and optimize high-performance CUDA kernels for GPU workloads. The project addresses the long-standing challenge that efficient CUDA programming typically requires deep hardware expertise by training an autonomous coding agent capable of iterative improvement through execution feedback. Its architecture combines large-scale data synthesis, a skill-augmented CUDA development environment,...

Downloads: 0 This Week

Last Update: 2026-03-03

See Project

Semantic Router

Superfast AI decision making and processing of multi-modal data

Semantic Router is a superfast decision-making layer for your LLMs and agents. Rather than waiting for slow, unreliable LLM generations to make tool-use or safety decisions, we use the magic of semantic vector space — routing our requests using semantic meaning. Combining LLMs with deterministic rules means we can be confident that our AI systems behave as intended. Cramming agent tools into the limited context window is expensive, slow, and fundamentally limited. Semantic Router enables...

Downloads: 3 This Week

Last Update: 2026-05-23

See Project

Agent Reinforcement Trainer

Train multi-step agents for real-world tasks using GRPO

Agent Reinforcement Trainer, or ART is an open-source reinforcement learning framework tailored to training large language model agents through experience, making them more reliable and performant on multi-turn, multi-step tasks. Instead of just manually crafting prompts or relying on supervised fine-tuning, ART uses techniques like Group Relative Policy Optimization (GRPO) to let agents learn from environmental feedback and reward signals.

Downloads: 0 This Week

Last Update: 2026-03-13

See Project

Live Agent Studio

Open source AI Agents hosted on the oTTomator Live Agent Studio

...Each agent in the collection is designed for a specific use case — such as content summarization, task automation, travel planning, or RAG workflows — and is provided with the code or configuration needed to explore and extend it on your own, making the repository both a learning resource and a practical starting point for real projects. The repository is community focused, with sample agents like tweet generators, smart selectors, research assistants, and multi-tool workflows that show how agents can integrate with tools like n8n or custom Python code. Because it’s tied to the broader Live Agent Studio ecosystem, users can experiment with deploying and using these agents in a hosted environment.

Downloads: 0 This Week

Last Update: 2026-01-26

See Project

Hello-Agents

Building an Intelligent Agent from Scratch

Hello Agents is an open educational project designed to teach developers how to understand, design, and build AI-native agents from the ground up through structured tutorials and practical examples. The project focuses on guiding learners beyond superficial framework usage toward deeper comprehension of agent architecture, reasoning loops, and real-world implementation patterns. It walks users through core concepts such as ReAct-style reasoning, tool usage, memory handling, and multi-step...

Downloads: 2 This Week

Last Update: 2026-02-25

See Project

Dendrite

Tools to build web AI agents that can authenticate

Dendrite Python SDK is a toolkit for building web AI agents that can authenticate, interact with, and extract data from any website, facilitating web automation tasks.

Downloads: 1 This Week

Last Update: 2025-01-29

See Project

OpenHarness

Open Agent Harness with a built-in personal agent, Ohmo

OpenHarness is an open-source framework developed to support large-scale machine learning workflows, particularly in the context of training, evaluating, and benchmarking AI models. It provides a structured environment for orchestrating experiments, managing datasets, and standardizing evaluation processes across different models. The project focuses on reproducibility and scalability, allowing researchers and engineers to run consistent experiments while tracking results effectively. It...

Downloads: 1 This Week

Last Update: 2026-05-07

See Project

autoresearch

AI agents autonomously run and improve ML experiments overnight

autoresearch is an experimental framework that enables AI agents to autonomously conduct machine learning research by iteratively modifying and training models. Created by Andrej Karpathy, the project allows an agent to edit the model training code, run short experiments, evaluate results, and repeat the process without human intervention. Each experiment runs for a fixed five-minute training window, enabling rapid iteration and consistent comparison across architectural or hyperparameter...

Downloads: 0 This Week

Last Update: 2026-03-26

See Project

Hindsight

Hindsight: Agent Memory That Learns

Hindsight is an advanced, open-source memory system for AI agents designed to enable long-term learning, reasoning, and consistency across interactions by treating memory as a first-class component of intelligence rather than a simple retrieval layer. It addresses one of the core limitations of modern AI agents, which is their inability to retain and meaningfully use past experiences over time, by introducing a structured, biomimetic memory architecture inspired by how human memory works....

Downloads: 2 This Week

Last Update: 3 days ago

See Project

TrustGraph

Deploy reasoning AI agents powered by agentic graph RAG in minutes

TrustGraph is an AI-driven framework designed to assess and visualize trust relationships within networks, aiding in the analysis of trustworthiness and influence among entities.

Downloads: 3 This Week

Last Update: 2 days ago

See Project

Search Results for "python q learning"

Showing 72 open source projects for "python q learning"

RWARE

AgentUniverse

Multi-Agent Orchestrator

Habitat-Lab

OpenJarvis

VectorizedMultiAgentSimulator (VMAS)

Youtu-Agent

MetaClaw

rLLM

Hermes Agent

PilottAI

verl-agent

Dash Data Agent

LiteMultiAgent

Academic Research Skills for Claude Code

CUDA Agent

Semantic Router

Agent Reinforcement Trainer

Live Agent Studio

Hello-Agents

Dendrite

OpenHarness

autoresearch

Hindsight

TrustGraph

Search Results for "python q learning"

Showing 72 open source projects for "python q learning"

RWARE

AgentUniverse

Multi-Agent Orchestrator

Habitat-Lab

OpenJarvis

VectorizedMultiAgentSimulator (VMAS)

Youtu-Agent

MetaClaw

rLLM

Hermes Agent

PilottAI

verl-agent

Dash Data Agent

LiteMultiAgent

Academic Research Skills for Claude Code

CUDA Agent

Semantic Router

Agent Reinforcement Trainer

Live Agent Studio

Hello-Agents

Dendrite

OpenHarness

autoresearch

Hindsight

TrustGraph

Related Searches

Related Categories