Showing 31 open source projects for "test"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Context for your AI agents Icon
    Context for your AI agents

    Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.

    Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
    Try for free
  • 1
    codename goose

    codename goose

    AI coding agent that's more than suggestions - install, execute, edit+

    Goose is an open-source, extensible AI agent that enhances the software development process by going beyond traditional code suggestions. It allows developers to install, execute, edit, and test code with any large language model (LLM), facilitating a more efficient and streamlined workflow. Designed to operate locally within a developer's environment, Goose integrates seamlessly with various tools and platforms, providing a customizable and powerful assistant for coding tasks. Its architecture supports extensibility, enabling users to tailor the agent to their specific needs and preferences. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 2
    Kheish

    Kheish

    Kheish: A multi-role LLM agent for tasks like code auditing

    Kheish is a framework designed for cybersecurity professionals to automate penetration testing tasks, providing tools to streamline security assessments.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 3
    SWE-agent

    SWE-agent

    SWE-agent takes a GitHub issue and tries to automatically fix it

    ...GPT-4) into software engineering agents that can resolve issues in real GitHub repositories. On the SWE-bench, the SWE-agent resolves 12.47% of issues, achieving state-of-the-art performance on the full test set. We accomplish our results by designing simple LM-centric commands and feedback formats to make it easier for the LM to browse the repository, and view, edit, and execute code files. We call this an Agent-Computer Interface (ACI).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    npcpy

    npcpy

    The AI toolkit for the AI developer

    npcpy is a Python-based agent framework and command-line toolkit (the NPC Shell) for developers to build, test, and integrate AI agents into their workflows, including both command-line and GUI interfaces via NPC Studio. Welcome to npcpy, the core library of the NPC Toolkit that supercharges natural language processing pipelines and agent tooling. npcpy is a flexible framework for building state-of-the-art applications and conducting novel research with LLMs.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Rent Manager Software Icon
    Rent Manager Software

    Landlords, multi-family homes, manufactured home communities, single family homes, associations, commercial properties and mixed portfolios.

    Rent Manager is award-winning property management software built for residential, commercial, and short-term-stay portfolios of any size. The program’s fully customizable features include a double-entry accounting system, maintenance management/scheduling, marketing integration, mobile applications, more than 450 insightful reports, and an API that integrates with the best PropTech providers on the market.
    Learn More
  • 5
    AgentOps

    AgentOps

    Python SDK for agent monitoring, LLM cost tracking, benchmarking, etc.

    Industry-leading developer platform to test and debug AI agents. We built the tools so you don't have to. Visually track events such as LLM calls, tools, and multi-agent interactions. Rewind and replay agent runs with point-in-time precision. Keep a full data trail of logs, errors, and prompt injection attacks from prototype to production. Native integrations with the top agent frameworks.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Alan AI

    Alan AI

    In-App assistant SDK to build a multimodal conversational UX websites

    ...Alan is a conversational voice AI platform that lets you create an intelligent voice assistant for your app. It offers all the necessary tools to design, embed, and host your voice solutions. A powerful web-based IDE where you can write, test and debug dialog scenarios for your voice assistant or chatbot. Alan's AI-backend powered by the industry’s best Automatic Speech Recognition (ASR), Natural Language Understanding (NLU) and Speech Synthesis. The Alan Cloud provisions and handles the infrastructure required to maintain your voice deployments and perform all the voice processing tasks. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 7
    AWS CodeDeploy Agent

    AWS CodeDeploy Agent

    Host Agent for AWS CodeDeploy

    ...AWS CodeDeploy fully automates your software deployments, allowing you to deploy reliably and rapidly. You can consistently deploy your application across your development, test, and production environments whether deploying to Amazon EC2, AWS Fargate, AWS Lambda, or your on-premises servers. The service scales with your infrastructure.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    Diplomacy Cicero

    Diplomacy Cicero

    Code for Cicero, an AI agent that plays the game of Diplomacy

    The project is the codebase for an AI agent named Cicero developed by Facebook Research. It is designed to play the board game Diplomacy by combining open-domain natural language negotiation with strategic planning. The repository includes training code, model checkpoints, and infrastructure for both language modelling (via the ParlAI framework) and reinforcement learning for strategy agents. It supports two variants: Cicero (which handles full “press” negotiation) and Diplodocus (a variant...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    LiteMultiAgent

    LiteMultiAgent

    The Library for LLM-based multi-agent applications

    LiteMultiAgent is a lightweight and extensible multi-agent reinforcement learning (MARL) platform designed for rapid experimentation. It allows researchers to design and test coordination, competition, and collaboration scenarios in simulated environments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Desktop and Mobile Device Management Software Icon
    Desktop and Mobile Device Management Software

    It's a modern take on desktop management that can be scaled as per organizational needs.

    Desktop Central is a unified endpoint management (UEM) solution that helps in managing servers, laptops, desktops, smartphones, and tablets from a central location.
    Learn More
  • 10
    Arcade AI

    Arcade AI

    Arcade Tool Development Kit (TDK), Worker, Evals, and CLI

    ...This repository contains the core Arcade libraries, organized as separate packages for maximum flexibility and modularity. Evaluation framework for testing tool performance. Test your MCP server's tools, resources, prompts, elicitation, and OAuth 2. MCPJam is compliant with the latest MCP specs. Connect to any MCP server. MCPJam inspector supports STDIO, SSE, and Streamable HTTP transports.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Agentex

    Agentex

    Open source codebase for Scale Agentex

    AgentEX is an open framework from Scale for building, running, and evaluating agentic workflows, with an emphasis on reproducibility and measurable outcomes rather than ad-hoc demos. It treats an “agent” as a composition of a policy (the LLM), tools, memory, and an execution runtime so you can test the whole loop, not just prompting. The repo focuses on structured experiments: standardized tasks, canonical tool interfaces, and logs that make it possible to compare models, prompts, and tool sets fairly. It also includes evaluation harnesses that capture success criteria and partial credit, plus traces you can inspect to understand where reasoning or tool use failed. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Mini Agent

    Mini Agent

    A minimal yet professional single agent demo project

    Mini-Agent is a minimal yet production-minded demo project that shows how to build a serious command-line AI agent around the MiniMax-M2 model. It is designed both as a reference implementation and as a usable agent, demonstrating a full execution loop that includes planning, tool calls, and iterative refinement. The project exposes an Anthropic-compatible API interface and fully supports interleaved thinking, letting the agent alternate between reasoning steps and tool invocations during...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Streamline Analyst

    Streamline Analyst

    AI agent that streamlines the entire process of data analysis

    ...This Data Analysis Agent effortlessly automates all the tasks such as data cleaning, preprocessing, and even complex operations like identifying target objects, partitioning test sets, and selecting the best-fit models based on your data. With Streamline Analyst, results visualization and evaluation become seamless.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    OpenAGI

    OpenAGI

    When LLM Meets Domain Experts

    OpenAGI is a package for AI agent creation designed to connect large language models with domain-specific tools and workflows in the AIOS (AI Operating System) ecosystem. It provides a structured Python framework, pyopenagi, for defining agents as modular units that encapsulate execution logic, configuration, and dependency metadata. Agents are organized in a well-defined folder structure that includes code (agent.py), configuration (config.json), and extra requirements...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Petri

    Petri

    An alignment auditing agent capable of exploring alignment hypothesis

    Petri is an open-source alignment auditing agent that lets researchers rapidly test concrete safety hypotheses against target models using realistic, multi-turn scenarios. Instead of building bespoke evals, Petri automatically generates audit environments from seed “special instructions,” orchestrates an auditor model to probe a target model, and simulates tool use and rollbacks to surface risky behaviors. Each interaction transcript is then scored by a judge model using a consistent rubric so results are comparable across runs and models. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    agents.md

    agents.md

    A simple, open format for guiding coding agents

    ...Markdown file) for guiding coding agents in software repositories. The idea is that AGENTS.md acts as a “README for agents”: a predictable, structured place where humans can put instructions, conventions, build/test commands, environment setup, and other guidance that generative agents (e.g. code-writing, code-assisting tools) should consult when operating in the repo. Instead of putting everything in README or doc files (which are more human-oriented and might mix high-level narrative), AGENTS.md is intended to surface agent-relevant details that help them “do the right thing” (tests, style, project structure, tooling).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Agentic Commerce Protocol (ACP)

    Agentic Commerce Protocol (ACP)

    Interaction model for connecting buyers to complete purchases

    ACP is an open, draft specification for letting buyers, their AI agents, and businesses complete purchases through a standardized interaction model. It’s maintained by OpenAI and Stripe and licensed under Apache-2.0, with the goal of being easy to adopt alongside a merchant’s existing commerce stack rather than replacing it. The repository organizes the spec as human-readable RFCs plus machine-readable OpenAPI and JSON Schema definitions, along with worked examples and a changelog so...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    TinyCog

    Cognitive architecture with a focus on self-models

    TinyCog is a cognitive architecture (perception, action, reasoning, planning, and language) with a focus on representing "self-models" (physical, capabilities, intentions, social, historical, "cognitive stream") in order to implement social reasoning and collaborative planning in a "hunter domain" test bed. We use Prolog, RDF Triples, Description Logic and a HTN-Planner as base technologies.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    TDD MAS Toolkit

    A set of powerful tools to perform TDD on MAS based on JADE

    The TDD MAS Toolkit provides a set of tools to perform Test Driven Development of MAS based on Jade. The toolkit allows developers to set up simulation based and test case scenario based testing, providing powerful insights of the MAS under construction and assuring that the interation protocols and agent interaction in general are been enacted as expected.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    DisChoco

    DisChoco

    DisChoco: Distributed Constraint Reasoning Solver

    DisChoco is a Distributed Constraint Reasoning Solver. Several Distributed Constraint Reasoning algorithms are implemented (like ABT, AFC, Adopt, ...). Users may easily implement and test their algorithms with DisChoco. contact: wahbi[at]users[dot]sourceforge[dot]net
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    TuCSoN

    TuCSoN

    TuCSoN tuple-based coordination middleware

    TuCSoN (Tuple Centres Spread over the Network) is a tuple-based middleware for the coordination of distributed systems – including multi-agent systems, mobile agents and pervasive systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    april

    Simulating worlds in a computer

    ...The code is structured in several layers: - april-core simply defines the library and the objects - april-gui builds on the concepts in april-core, giving a visual interpretation to the concepts - AprilDream is a GUI program that uses april-gui objects to display a World in evolution. The code base also contains a test suite based on [Google test](http://code.google.com/p/googletest/) and a set of ever-growing commented examples.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Sentence Composer

    Sentence Composer

    Compose the writings semi-automatically

    ...SAMPLE: Input sentence was "Learning algorithm" The prefix was "transductive learning" Learning algorithm. Transduction, or transductive inference, tries to predict new outputs on specific and fixed (test) cases from observed, specific (training) cases. The simplest realization for transductive inference is the method of k-nearest neighbors. Learning to learn learns its own inductive bias based on previous experience. ...AND SO ON.... Enjoy the program !
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Siafu simulates individual agents and their context, from home to city-wide scenarios. As a developer, you use the API to write your simulation for the purposes of data-set generation, test or visualization, optionally hooking it to your own application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    openEAR is the Munich Open-Source Emotion and Affect Recognition Toolkit developed at the Technische Universität München (TUM). It provides efficient (audio) feature extraction algorithms implemented in C++, classfiers, and pre-trained models on well-known emotion databases. It is now maintained and supported by audEERING. Updates will follow soon.
    Leader badge
    Downloads: 8 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next