Showing 50 open source projects for "python q learning"

View related business solutions
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    AgentUniverse

    AgentUniverse

    agentUniverse is a LLM multi-agent framework

    AgentUniverse is a multi-agent AI framework that enables coordination between multiple intelligent agents for complex task execution and automation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Multi-Agent Orchestrator

    Multi-Agent Orchestrator

    Flexible and powerful framework for managing multiple AI agents

    Multi-Agent Orchestrator is an AI coordination framework that enables multiple intelligent agents to work together to complete complex, multi-step workflows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Habitat-Lab

    Habitat-Lab

    A modular high-level library to train embodied AI agents

    Habitat-Lab is a modular high-level library for end-to-end development in embodied AI. It is designed to train agents to perform a wide variety of embodied AI tasks in indoor environments, as well as develop agents that can interact with humans in performing these tasks. Allowing users to train agents in a wide variety of single and multi-agent tasks (e.g. navigation, rearrangement, instruction following, question answering, human following), as well as define novel tasks. Configuring and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    OpenJarvis

    OpenJarvis

    Personal AI, On Personal Devices

    OpenJarvis is an open-source framework designed to build personal AI agents that run primarily on local devices rather than relying on cloud infrastructure. Developed as part of the Intelligence Per Watt research initiative, it focuses on improving the efficiency and practicality of on-device AI systems. The framework provides shared primitives for building local-first agents, along with evaluation tools that measure performance using metrics such as energy consumption, latency, cost, and...
    Downloads: 176 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 5
    MetaClaw

    MetaClaw

    Just talk to your agent

    MetaClaw is an AI or agent-oriented system that appears to focus on advanced control, coordination, or training of autonomous agents, potentially within reinforcement learning or tool-using environments. The project likely emphasizes meta-level reasoning, where agents are not only executing tasks but also adapting their strategies based on feedback and performance signals. It may incorporate mechanisms for learning from interactions, improving decision-making over time, and generalizing...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Hermes Agent

    Hermes Agent

    The agent that grows with you

    Hermes Agent is a fully open-source autonomous AI agent designed to run persistently on your own machine or server, becoming more capable the longer it operates by learning from experience and building reusable procedural skills. Rather than functioning as a stateless chatbot, it maintains long-term memory across sessions and can generate searchable “Skill Documents” that capture how it solved complex tasks so it doesn’t start from scratch each time. The agent interfaces with messaging...
    Downloads: 75 This Week
    Last Update:
    See Project
  • 7
    Dash Data Agent

    Dash Data Agent

    Self-learning data agent that grounds its answers in layers of content

    Dash is a self-learning data agent built by the Agno AI community that generates grounded answers to English queries over structured data by synthesizing SQL and reasoning based on six layers of context, improving automatically with each run. It sidesteps common limitations of simple text-to-SQL agents by incorporating multiple context layers — including schema structure, human annotations, known query patterns, institutional knowledge from docs, machine-discovered error patterns, and live...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    CUDA Agent

    CUDA Agent

    Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

    CUDA Agent is a research-driven agentic reinforcement learning system designed to automatically generate and optimize high-performance CUDA kernels for GPU workloads. The project addresses the long-standing challenge that efficient CUDA programming typically requires deep hardware expertise by training an autonomous coding agent capable of iterative improvement through execution feedback. Its architecture combines large-scale data synthesis, a skill-augmented CUDA development environment,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Semantic Router

    Semantic Router

    Superfast AI decision making and processing of multi-modal data

    Semantic Router is a superfast decision-making layer for your LLMs and agents. Rather than waiting for slow, unreliable LLM generations to make tool-use or safety decisions, we use the magic of semantic vector space — routing our requests using semantic meaning. Combining LLMs with deterministic rules means we can be confident that our AI systems behave as intended. Cramming agent tools into the limited context window is expensive, slow, and fundamentally limited. Semantic Router enables...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 10
    Agent Reinforcement Trainer

    Agent Reinforcement Trainer

    Train multi-step agents for real-world tasks using GRPO

    Agent Reinforcement Trainer, or ART is an open-source reinforcement learning framework tailored to training large language model agents through experience, making them more reliable and performant on multi-turn, multi-step tasks. Instead of just manually crafting prompts or relying on supervised fine-tuning, ART uses techniques like Group Relative Policy Optimization (GRPO) to let agents learn from environmental feedback and reward signals.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Live Agent Studio

    Live Agent Studio

    Open source AI Agents hosted on the oTTomator Live Agent Studio

    ...Each agent in the collection is designed for a specific use case — such as content summarization, task automation, travel planning, or RAG workflows — and is provided with the code or configuration needed to explore and extend it on your own, making the repository both a learning resource and a practical starting point for real projects. The repository is community focused, with sample agents like tweet generators, smart selectors, research assistants, and multi-tool workflows that show how agents can integrate with tools like n8n or custom Python code. Because it’s tied to the broader Live Agent Studio ecosystem, users can experiment with deploying and using these agents in a hosted environment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Academic Research Skills for Claude Code

    Academic Research Skills for Claude Code

    Academic Research Skills for Claude Code

    Academic Research Skills is a structured learning repository aimed at improving users’ ability to conduct rigorous academic research, particularly in technical and scientific domains. It compiles methodologies, frameworks, and best practices for literature review, critical analysis, and research writing. The project is designed as a self-guided resource, helping learners understand how to evaluate sources, synthesize information, and develop strong arguments. It likely integrates examples,...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Hello-Agents

    Hello-Agents

    Building an Intelligent Agent from Scratch

    Hello Agents is an open educational project designed to teach developers how to understand, design, and build AI-native agents from the ground up through structured tutorials and practical examples. The project focuses on guiding learners beyond superficial framework usage toward deeper comprehension of agent architecture, reasoning loops, and real-world implementation patterns. It walks users through core concepts such as ReAct-style reasoning, tool usage, memory handling, and multi-step...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Dendrite

    Dendrite

    Tools to build web AI agents that can authenticate

    Dendrite Python SDK is a toolkit for building web AI agents that can authenticate, interact with, and extract data from any website, facilitating web automation tasks.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    autoresearch

    autoresearch

    AI agents autonomously run and improve ML experiments overnight

    autoresearch is an experimental framework that enables AI agents to autonomously conduct machine learning research by iteratively modifying and training models. Created by Andrej Karpathy, the project allows an agent to edit the model training code, run short experiments, evaluate results, and repeat the process without human intervention. Each experiment runs for a fixed five-minute training window, enabling rapid iteration and consistent comparison across architectural or hyperparameter...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    TrustGraph

    TrustGraph

    Deploy reasoning AI agents powered by agentic graph RAG in minutes

    TrustGraph is an AI-driven framework designed to assess and visualize trust relationships within networks, aiding in the analysis of trustworthiness and influence among entities.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17
    OpenManus

    OpenManus

    Open-source AI agent framework

    OpenManus is an open-source AI agent framework designed to autonomously execute complex, multi-step tasks by combining reasoning, planning, and tool use. It enables developers to build agents that can think, act, and iterate toward goals rather than simply responding to prompts. The platform emphasizes task decomposition, allowing agents to break down objectives into smaller steps and execute them sequentially or recursively. OpenManus supports integration with external tools, APIs, and...
    Downloads: 32 This Week
    Last Update:
    See Project
  • 18
    Agent Lightning

    Agent Lightning

    The absolute trainer to light up AI agents

    Agent Lightning is an open-source framework developed by Microsoft to train and optimize AI agents using techniques like reinforcement learning (RL), supervised fine-tuning, and automatic prompt optimization, with minimal or zero changes to existing agent code. It’s designed to be compatible with a wide range of agent architectures and frameworks — from LangChain and OpenAI Agent SDKs to AutoGen and custom Python agents — making it broadly applicable across different agent tooling ecosystems. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Agent Zero

    Agent Zero

    Agent Zero AI framework

    Agent Zero is not a predefined agentic framework. It is designed to be dynamic, organically growing, and learning as you use it. Agent Zero is fully transparent, readable, comprehensible, customizable and interactive. Agent Zero uses the computer as a tool to accomplish its (your) tasks. Agents can communicate with their superiors and subordinates, asking questions, giving instructions, and providing guidance. Instruct your agents in the system prompt on how to communicate effectively. The...
    Downloads: 25 This Week
    Last Update:
    See Project
  • 20
    Potpie

    Potpie

    Create custom engineering agents for your codebase

    Potpie is an AI-powered data analysis tool that automates the exploration and visualization of datasets, assisting users in uncovering insights without extensive coding.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    OpenHarness

    OpenHarness

    Open Agent Harness with a built-in personal agent, Ohmo

    OpenHarness is an open-source framework developed to support large-scale machine learning workflows, particularly in the context of training, evaluating, and benchmarking AI models. It provides a structured environment for orchestrating experiments, managing datasets, and standardizing evaluation processes across different models. The project focuses on reproducibility and scalability, allowing researchers and engineers to run consistent experiments while tracking results effectively. It...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Director

    Director

    AI video agents framework for next-gen video interactions

    Director is a video database management system designed to organize, search, and retrieve large collections of video content efficiently.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    DB-GPT

    DB-GPT

    Revolutionizing Database Interactions with Private LLM Technology

    DB-GPT is an experimental open-source project that uses localized GPT large models to interact with your data and environment. With this solution, you can be assured that there is no risk of data leakage, and your data is 100% private and secure.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 24
    Scientific Agent Skills

    Scientific Agent Skills

    A set of ready to use Agent Skills for research, science, engineering

    ...It supports any AI agent compatible with the Agent Skills standard, including tools such as Cursor, Claude Code, Codex, and Gemini CLI. The repository includes 135 skills across scientific domains such as genomics, cheminformatics, clinical research, medical imaging, machine learning, physics, materials science, geospatial analysis, and scientific writing. Each skill provides curated documentation, examples, best practices, and integration guidance so agents can execute complex workflows more reliably. It is especially useful for researchers who need AI assistance with databases, Python libraries, literature review, data analysis, and scientific communication. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    AgentScope

    AgentScope

    Build and run agents you can see, understand and trust

    AgentScope is a production-ready agent framework designed to help developers build, deploy, and scale intelligent agentic applications. It provides essential abstractions that evolve with advancing LLM capabilities, emphasizing reasoning, tool use, and flexible orchestration rather than rigid prompt constraints. With built-in support for ReAct agents, memory, planning, human-in-the-loop control, and real-time voice interaction, developers can create powerful agents in minutes. AgentScope...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB