python q learning free download

AgentUniverse

agentUniverse is a LLM multi-agent framework

AgentUniverse is a multi-agent AI framework that enables coordination between multiple intelligent agents for complex task execution and automation.

Downloads: 0 This Week

Last Update: 2025-11-17

See Project

Multi-Agent Orchestrator

Flexible and powerful framework for managing multiple AI agents

Multi-Agent Orchestrator is an AI coordination framework that enables multiple intelligent agents to work together to complete complex, multi-step workflows.

Downloads: 0 This Week

Last Update: 2026-07-14

See Project

Habitat-Lab

A modular high-level library to train embodied AI agents

Habitat-Lab is a modular high-level library for end-to-end development in embodied AI. It is designed to train agents to perform a wide variety of embodied AI tasks in indoor environments, as well as develop agents that can interact with humans in performing these tasks. Allowing users to train agents in a wide variety of single and multi-agent tasks (e.g. navigation, rearrangement, instruction following, question answering, human following), as well as define novel tasks. Configuring and...

Downloads: 0 This Week

Last Update: 2026-05-07

See Project

OpenJarvis

Personal AI, On Personal Devices

OpenJarvis is an open-source framework designed to build personal AI agents that run primarily on local devices rather than relying on cloud infrastructure. Developed as part of the Intelligence Per Watt research initiative, it focuses on improving the efficiency and practicality of on-device AI systems. The framework provides shared primitives for building local-first agents, along with evaluation tools that measure performance using metrics such as energy consumption, latency, cost, and...

Downloads: 64 This Week

Last Update: 2026-05-25

See Project

MetaClaw

Just talk to your agent

MetaClaw is an AI or agent-oriented system that appears to focus on advanced control, coordination, or training of autonomous agents, potentially within reinforcement learning or tool-using environments. The project likely emphasizes meta-level reasoning, where agents are not only executing tasks but also adapting their strategies based on feedback and performance signals. It may incorporate mechanisms for learning from interactions, improving decision-making over time, and generalizing...

Downloads: 0 This Week

Last Update: 2026-04-11

See Project

Hermes Agent

The agent that grows with you

Hermes Agent is a fully open-source autonomous AI agent designed to run persistently on your own machine or server, becoming more capable the longer it operates by learning from experience and building reusable procedural skills. Rather than functioning as a stateless chatbot, it maintains long-term memory across sessions and can generate searchable “Skill Documents” that capture how it solved complex tasks so it doesn’t start from scratch each time. The agent interfaces with messaging...

Downloads: 90 This Week

Last Update: 2 days ago

See Project

Academic Research Skills for Claude Code

Academic Research Skills is a structured learning repository aimed at improving users’ ability to conduct rigorous academic research, particularly in technical and scientific domains. It compiles methodologies, frameworks, and best practices for literature review, critical analysis, and research writing. The project is designed as a self-guided resource, helping learners understand how to evaluate sources, synthesize information, and develop strong arguments. It likely integrates examples,...

Downloads: 11 This Week

Last Update: 2026-07-22

See Project

Hello-Agents

Building an Intelligent Agent from Scratch

Hello Agents is an open educational project designed to teach developers how to understand, design, and build AI-native agents from the ground up through structured tutorials and practical examples. The project focuses on guiding learners beyond superficial framework usage toward deeper comprehension of agent architecture, reasoning loops, and real-world implementation patterns. It walks users through core concepts such as ReAct-style reasoning, tool usage, memory handling, and multi-step...

Downloads: 10 This Week

Last Update: 2026-07-17

See Project

CUDA Agent

Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

CUDA Agent is a research-driven agentic reinforcement learning system designed to automatically generate and optimize high-performance CUDA kernels for GPU workloads. The project addresses the long-standing challenge that efficient CUDA programming typically requires deep hardware expertise by training an autonomous coding agent capable of iterative improvement through execution feedback. Its architecture combines large-scale data synthesis, a skill-augmented CUDA development environment,...

Downloads: 2 This Week

Last Update: 2026-03-03

See Project

Dash Data Agent

Self-learning data agent that grounds its answers in layers of content

Dash is a self-learning data agent built by the Agno AI community that generates grounded answers to English queries over structured data by synthesizing SQL and reasoning based on six layers of context, improving automatically with each run. It sidesteps common limitations of simple text-to-SQL agents by incorporating multiple context layers — including schema structure, human annotations, known query patterns, institutional knowledge from docs, machine-discovered error patterns, and live...

Downloads: 0 This Week

Last Update: 2026-07-10

See Project

Live Agent Studio

Open source AI Agents hosted on the oTTomator Live Agent Studio

...Each agent in the collection is designed for a specific use case — such as content summarization, task automation, travel planning, or RAG workflows — and is provided with the code or configuration needed to explore and extend it on your own, making the repository both a learning resource and a practical starting point for real projects. The repository is community focused, with sample agents like tweet generators, smart selectors, research assistants, and multi-tool workflows that show how agents can integrate with tools like n8n or custom Python code. Because it’s tied to the broader Live Agent Studio ecosystem, users can experiment with deploying and using these agents in a hosted environment.

Downloads: 1 This Week

Last Update: 2026-01-26

See Project

Semantic Router

Superfast AI decision making and processing of multi-modal data

Semantic Router is a superfast decision-making layer for your LLMs and agents. Rather than waiting for slow, unreliable LLM generations to make tool-use or safety decisions, we use the magic of semantic vector space — routing our requests using semantic meaning. Combining LLMs with deterministic rules means we can be confident that our AI systems behave as intended. Cramming agent tools into the limited context window is expensive, slow, and fundamentally limited. Semantic Router enables...

Downloads: 5 This Week

Last Update: 6 days ago

See Project

Agent Lightning

The absolute trainer to light up AI agents

Agent Lightning is an open-source framework developed by Microsoft to train and optimize AI agents using techniques like reinforcement learning (RL), supervised fine-tuning, and automatic prompt optimization, with minimal or zero changes to existing agent code. It’s designed to be compatible with a wide range of agent architectures and frameworks — from LangChain and OpenAI Agent SDKs to AutoGen and custom Python agents — making it broadly applicable across different agent tooling ecosystems. ...

Downloads: 1 This Week

Last Update: 2026-02-06

See Project

Agent Reinforcement Trainer

Train multi-step agents for real-world tasks using GRPO

Agent Reinforcement Trainer, or ART is an open-source reinforcement learning framework tailored to training large language model agents through experience, making them more reliable and performant on multi-turn, multi-step tasks. Instead of just manually crafting prompts or relying on supervised fine-tuning, ART uses techniques like Group Relative Policy Optimization (GRPO) to let agents learn from environmental feedback and reward signals.

Downloads: 0 This Week

Last Update: 2026-03-13

See Project

AgentScope

Build and run agents you can see, understand and trust

AgentScope is a production-ready agent framework designed to help developers build, deploy, and scale intelligent agentic applications. It provides essential abstractions that evolve with advancing LLM capabilities, emphasizing reasoning, tool use, and flexible orchestration rather than rigid prompt constraints. With built-in support for ReAct agents, memory, planning, human-in-the-loop control, and real-time voice interaction, developers can create powerful agents in minutes. AgentScope...

Downloads: 5 This Week

Last Update: 2026-07-23

See Project

TrustGraph

Deploy reasoning AI agents powered by agentic graph RAG in minutes

TrustGraph is an AI-driven framework designed to assess and visualize trust relationships within networks, aiding in the analysis of trustworthiness and influence among entities.

Downloads: 4 This Week

Last Update: 3 days ago

See Project

OpenHarness

Open Agent Harness with a built-in personal agent, Ohmo

OpenHarness is an open-source framework developed to support large-scale machine learning workflows, particularly in the context of training, evaluating, and benchmarking AI models. It provides a structured environment for orchestrating experiments, managing datasets, and standardizing evaluation processes across different models. The project focuses on reproducibility and scalability, allowing researchers and engineers to run consistent experiments while tracking results effectively. It...

Downloads: 1 This Week

Last Update: 2026-05-07

See Project

FinRobot

An Open-Source AI Agent Platform for Financial Analysis using LLMs

FinRobot is an open-source AI framework focused on automating financial data workflows by combining data ingestion, feature engineering, model training, and automated decision-making pipelines tailored for quantitative finance applications. It provides developers and quants with structured modules to fetch market data, process time series, generate technical indicators, and construct features appropriate for machine learning models, while also supporting backtesting and evaluation metrics to...

Downloads: 2 This Week

Last Update: 2026-07-07

See Project

autoresearch

AI agents autonomously run and improve ML experiments overnight

autoresearch is an experimental framework that enables AI agents to autonomously conduct machine learning research by iteratively modifying and training models. Created by Andrej Karpathy, the project allows an agent to edit the model training code, run short experiments, evaluate results, and repeat the process without human intervention. Each experiment runs for a fixed five-minute training window, enabling rapid iteration and consistent comparison across architectural or hyperparameter...

Downloads: 0 This Week

Last Update: 2026-03-26

See Project

Scientific Agent Skills

A set of ready to use Agent Skills for research, science, engineering

...It supports any AI agent compatible with the Agent Skills standard, including tools such as Cursor, Claude Code, Codex, and Gemini CLI. The repository includes 135 skills across scientific domains such as genomics, cheminformatics, clinical research, medical imaging, machine learning, physics, materials science, geospatial analysis, and scientific writing. Each skill provides curated documentation, examples, best practices, and integration guidance so agents can execute complex workflows more reliably. It is especially useful for researchers who need AI assistance with databases, Python libraries, literature review, data analysis, and scientific communication. ...

Downloads: 7 This Week

Last Update: 1 day ago

See Project

Diplomacy Cicero

Code for Cicero, an AI agent that plays the game of Diplomacy

...It is designed to play the board game Diplomacy by combining open-domain natural language negotiation with strategic planning. The repository includes training code, model checkpoints, and infrastructure for both language modelling (via the ParlAI framework) and reinforcement learning for strategy agents. It supports two variants: Cicero (which handles full “press” negotiation) and Diplodocus (a variant focused on no-press diplomacy) as described in the README. The codebase is implemented primarily in Python with performance-critical components in C++ (via pybind11 bindings) and is configured to run in a high‐GPU cluster environment. ...

Downloads: 1 This Week

Last Update: 7 days ago

See Project

Agent Zero

Agent Zero AI framework

Agent Zero is not a predefined agentic framework. It is designed to be dynamic, organically growing, and learning as you use it. Agent Zero is fully transparent, readable, comprehensible, customizable and interactive. Agent Zero uses the computer as a tool to accomplish its (your) tasks. Agents can communicate with their superiors and subordinates, asking questions, giving instructions, and providing guidance. Instruct your agents in the system prompt on how to communicate effectively. The...

Downloads: 22 This Week

Last Update: 5 days ago

See Project

Cybergod

A program that can do anything to earn money without human operators

AGI Computer Control is an experimental autonomous software system designed to operate independently and generate income without human intervention. It aims to simulate artificial general intelligence (AGI) by leveraging evolutionary algorithms, deep active inference, and other advanced AI techniques. The project explores the boundaries of machine autonomy and self-directed behavior in computational environments.

Downloads: 1 This Week

Last Update: 2025-05-21

See Project

Agent S

Agent S: an open agentic framework that uses computers like a human

Agent S is an open-source agentic framework designed to enable autonomous computer use through an Agent-Computer Interface (ACI). Built to operate graphical user interfaces like a human, it allows AI agents to perceive screens, reason about tasks, and execute actions across macOS, Windows, and Linux systems. The latest version, Agent S3, surpasses human-level performance on the OSWorld benchmark, demonstrating state-of-the-art results in complex multi-step computer tasks. Agent S combines...

Downloads: 4 This Week

Last Update: 2025-12-16

See Project

Dendrite

Tools to build web AI agents that can authenticate

Dendrite Python SDK is a toolkit for building web AI agents that can authenticate, interact with, and extract data from any website, facilitating web automation tasks.

Downloads: 0 This Week

Last Update: 2025-01-29

See Project

Search Results for "python q learning"

Showing 54 open source projects for "python q learning"

AgentUniverse

Multi-Agent Orchestrator

Habitat-Lab

OpenJarvis

MetaClaw

Hermes Agent

Academic Research Skills for Claude Code

Hello-Agents

CUDA Agent

Dash Data Agent

Live Agent Studio

Semantic Router

Agent Lightning

Agent Reinforcement Trainer

AgentScope

TrustGraph

OpenHarness

FinRobot

autoresearch

Scientific Agent Skills

Diplomacy Cicero

Agent Zero

Cybergod

Agent S

Dendrite

Search Results for "python q learning"

Showing 54 open source projects for "python q learning"

AgentUniverse

Multi-Agent Orchestrator

Habitat-Lab

OpenJarvis

MetaClaw

Hermes Agent

Academic Research Skills for Claude Code

Hello-Agents

CUDA Agent

Dash Data Agent

Live Agent Studio

Semantic Router

Agent Lightning

Agent Reinforcement Trainer

AgentScope

TrustGraph

OpenHarness

FinRobot

autoresearch

Scientific Agent Skills

Diplomacy Cicero

Agent Zero

Cybergod

Agent S

Dendrite

Related Searches

Related Categories