Showing 72 open source projects for "python q learning"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • 1
    RWARE

    RWARE

    MuA multi-agent reinforcement learning environment

    robotic-warehouse is a simulation environment and framework for robotic warehouse automation, enabling research and development of AI and robotic agents to manage warehouse logistics, such as item picking and transport.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    AgentUniverse

    AgentUniverse

    agentUniverse is a LLM multi-agent framework

    AgentUniverse is a multi-agent AI framework that enables coordination between multiple intelligent agents for complex task execution and automation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Multi-Agent Orchestrator

    Multi-Agent Orchestrator

    Flexible and powerful framework for managing multiple AI agents

    Multi-Agent Orchestrator is an AI coordination framework that enables multiple intelligent agents to work together to complete complex, multi-step workflows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Habitat-Lab

    Habitat-Lab

    A modular high-level library to train embodied AI agents

    Habitat-Lab is a modular high-level library for end-to-end development in embodied AI. It is designed to train agents to perform a wide variety of embodied AI tasks in indoor environments, as well as develop agents that can interact with humans in performing these tasks. Allowing users to train agents in a wide variety of single and multi-agent tasks (e.g. navigation, rearrangement, instruction following, question answering, human following), as well as define novel tasks. Configuring and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 5
    OpenJarvis

    OpenJarvis

    Personal AI, On Personal Devices

    OpenJarvis is an open-source framework designed to build personal AI agents that run primarily on local devices rather than relying on cloud infrastructure. Developed as part of the Intelligence Per Watt research initiative, it focuses on improving the efficiency and practicality of on-device AI systems. The framework provides shared primitives for building local-first agents, along with evaluation tools that measure performance using metrics such as energy consumption, latency, cost, and...
    Downloads: 186 This Week
    Last Update:
    See Project
  • 6
    VectorizedMultiAgentSimulator (VMAS)

    VectorizedMultiAgentSimulator (VMAS)

    VMAS is a vectorized differentiable simulator

    VectorizedMultiAgentSimulator is a high-performance, vectorized simulator for multi-agent systems, focusing on large-scale agent interactions in shared environments. It is designed for research in multi-agent reinforcement learning, robotics, and autonomous systems where thousands of agents need to be simulated efficiently.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Youtu-Agent

    Youtu-Agent

    A simple yet powerful agent framework that delivers with models

    Youtu-Agent is an open-source framework developed to simplify the creation, execution, and evaluation of autonomous AI agents. The system focuses on reducing the complexity traditionally involved in configuring large language model agents by providing a modular architecture that separates execution environments, tools, and context management. This structure allows developers to rapidly assemble agent systems capable of performing tasks such as research, file processing, and data analysis....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    MetaClaw

    MetaClaw

    Just talk to your agent

    MetaClaw is an AI or agent-oriented system that appears to focus on advanced control, coordination, or training of autonomous agents, potentially within reinforcement learning or tool-using environments. The project likely emphasizes meta-level reasoning, where agents are not only executing tasks but also adapting their strategies based on feedback and performance signals. It may incorporate mechanisms for learning from interactions, improving decision-making over time, and generalizing...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    rLLM

    rLLM

    Democratizing Reinforcement Learning for LLMs

    rLLM is an open-source framework for building and training post-training language agents via reinforcement learning — that is, using reinforcement signals to fine-tune or adapt language models (LLMs) into customizable agents for real-world tasks. With rLLM, developers can define custom “agents” and “environments,” and then train those agents via reinforcement learning workflows, possibly surpassing what vanilla fine-tuning or supervised learning might provide. The project is designed to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 10
    Hermes Agent

    Hermes Agent

    The agent that grows with you

    Hermes Agent is a fully open-source autonomous AI agent designed to run persistently on your own machine or server, becoming more capable the longer it operates by learning from experience and building reusable procedural skills. Rather than functioning as a stateless chatbot, it maintains long-term memory across sessions and can generate searchable “Skill Documents” that capture how it solved complex tasks so it doesn’t start from scratch each time. The agent interfaces with messaging...
    Downloads: 72 This Week
    Last Update:
    See Project
  • 11
    PilottAI

    PilottAI

    Python framework for building scalable multi-agent systems

    pilottai is an AI-based autonomous drone navigation system utilizing reinforcement learning for real-time decision-making. It is designed for simulating and training drones to fly safely through dynamic environments using AI-based controllers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    verl-agent

    verl-agent

    Designed for training LLM/VLM agents via RL

    verl-agent is an open-source reinforcement learning framework designed to train large language model agents and vision-language model agents for complex interactive environments. Built as an extension of the veRL reinforcement learning infrastructure, the project focuses on enabling scalable training for agents that perform multi-step reasoning and decision-making tasks. The framework supports multi-turn interactions between agents and their environments, allowing the system to receive...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Dash Data Agent

    Dash Data Agent

    Self-learning data agent that grounds its answers in layers of content

    Dash is a self-learning data agent built by the Agno AI community that generates grounded answers to English queries over structured data by synthesizing SQL and reasoning based on six layers of context, improving automatically with each run. It sidesteps common limitations of simple text-to-SQL agents by incorporating multiple context layers — including schema structure, human annotations, known query patterns, institutional knowledge from docs, machine-discovered error patterns, and live...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    LiteMultiAgent

    LiteMultiAgent

    The Library for LLM-based multi-agent applications

    LiteMultiAgent is a lightweight and extensible multi-agent reinforcement learning (MARL) platform designed for rapid experimentation. It allows researchers to design and test coordination, competition, and collaboration scenarios in simulated environments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Academic Research Skills for Claude Code

    Academic Research Skills for Claude Code

    Academic Research Skills for Claude Code

    Academic Research Skills is a structured learning repository aimed at improving users’ ability to conduct rigorous academic research, particularly in technical and scientific domains. It compiles methodologies, frameworks, and best practices for literature review, critical analysis, and research writing. The project is designed as a self-guided resource, helping learners understand how to evaluate sources, synthesize information, and develop strong arguments. It likely integrates examples,...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16
    CUDA Agent

    CUDA Agent

    Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

    CUDA Agent is a research-driven agentic reinforcement learning system designed to automatically generate and optimize high-performance CUDA kernels for GPU workloads. The project addresses the long-standing challenge that efficient CUDA programming typically requires deep hardware expertise by training an autonomous coding agent capable of iterative improvement through execution feedback. Its architecture combines large-scale data synthesis, a skill-augmented CUDA development environment,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Semantic Router

    Semantic Router

    Superfast AI decision making and processing of multi-modal data

    Semantic Router is a superfast decision-making layer for your LLMs and agents. Rather than waiting for slow, unreliable LLM generations to make tool-use or safety decisions, we use the magic of semantic vector space — routing our requests using semantic meaning. Combining LLMs with deterministic rules means we can be confident that our AI systems behave as intended. Cramming agent tools into the limited context window is expensive, slow, and fundamentally limited. Semantic Router enables...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    Agent Reinforcement Trainer

    Agent Reinforcement Trainer

    Train multi-step agents for real-world tasks using GRPO

    Agent Reinforcement Trainer, or ART is an open-source reinforcement learning framework tailored to training large language model agents through experience, making them more reliable and performant on multi-turn, multi-step tasks. Instead of just manually crafting prompts or relying on supervised fine-tuning, ART uses techniques like Group Relative Policy Optimization (GRPO) to let agents learn from environmental feedback and reward signals.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Live Agent Studio

    Live Agent Studio

    Open source AI Agents hosted on the oTTomator Live Agent Studio

    ...Each agent in the collection is designed for a specific use case — such as content summarization, task automation, travel planning, or RAG workflows — and is provided with the code or configuration needed to explore and extend it on your own, making the repository both a learning resource and a practical starting point for real projects. The repository is community focused, with sample agents like tweet generators, smart selectors, research assistants, and multi-tool workflows that show how agents can integrate with tools like n8n or custom Python code. Because it’s tied to the broader Live Agent Studio ecosystem, users can experiment with deploying and using these agents in a hosted environment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Hello-Agents

    Hello-Agents

    Building an Intelligent Agent from Scratch

    Hello Agents is an open educational project designed to teach developers how to understand, design, and build AI-native agents from the ground up through structured tutorials and practical examples. The project focuses on guiding learners beyond superficial framework usage toward deeper comprehension of agent architecture, reasoning loops, and real-world implementation patterns. It walks users through core concepts such as ReAct-style reasoning, tool usage, memory handling, and multi-step...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    Dendrite

    Dendrite

    Tools to build web AI agents that can authenticate

    Dendrite Python SDK is a toolkit for building web AI agents that can authenticate, interact with, and extract data from any website, facilitating web automation tasks.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    OpenHarness

    OpenHarness

    Open Agent Harness with a built-in personal agent, Ohmo

    OpenHarness is an open-source framework developed to support large-scale machine learning workflows, particularly in the context of training, evaluating, and benchmarking AI models. It provides a structured environment for orchestrating experiments, managing datasets, and standardizing evaluation processes across different models. The project focuses on reproducibility and scalability, allowing researchers and engineers to run consistent experiments while tracking results effectively. It...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    autoresearch

    autoresearch

    AI agents autonomously run and improve ML experiments overnight

    autoresearch is an experimental framework that enables AI agents to autonomously conduct machine learning research by iteratively modifying and training models. Created by Andrej Karpathy, the project allows an agent to edit the model training code, run short experiments, evaluate results, and repeat the process without human intervention. Each experiment runs for a fixed five-minute training window, enabling rapid iteration and consistent comparison across architectural or hyperparameter...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Hindsight

    Hindsight

    Hindsight: Agent Memory That Learns

    Hindsight is an advanced, open-source memory system for AI agents designed to enable long-term learning, reasoning, and consistency across interactions by treating memory as a first-class component of intelligence rather than a simple retrieval layer. It addresses one of the core limitations of modern AI agents, which is their inability to retain and meaningfully use past experiences over time, by introducing a structured, biomimetic memory architecture inspired by how human memory works....
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    TrustGraph

    TrustGraph

    Deploy reasoning AI agents powered by agentic graph RAG in minutes

    TrustGraph is an AI-driven framework designed to assess and visualize trust relationships within networks, aiding in the analysis of trustworthiness and influence among entities.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB