116 projects for "source testing unit testing" with 2 filters applied:

  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    AgentBench

    AgentBench

    A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

    ...These environments require agents to interpret instructions, take actions, and adapt their strategies based on feedback from the environment. AgentBench also includes an evaluation framework that measures success rates, rewards, and task completion performance across different agent implementations. By testing models across diverse scenarios, the benchmark highlights strengths and weaknesses in reasoning, long-term planning, and tool usage.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    VibeKit

    VibeKit

    Run Claude Code, Gemini, Codex in a clean, isolated sandbox

    ...This can include things like dynamic content generation, live adaptation based on user interaction, and connectors to external APIs for enriched grounding. The toolkit also supports testing and local iteration, with utilities that simulate event streams and mock model responses to make development predictable.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Spring AI Alibaba Examples

    Spring AI Alibaba Examples

    Spring AI Alibaba examples for building and testing AI apps

    Spring AI Alibaba Examples provides a collection of example projects that demonstrate how to use Spring AI and Spring AI Alibaba across different scenarios, from basic setups to more advanced AI applications. It is designed to help developers understand core concepts, explore practical implementations, and follow best practices when building AI-powered systems using the Spring ecosystem. Each module focuses on a specific use case such as chat, image processing, audio handling, graph...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    promptmap2

    promptmap2

    A security scanner for custom LLM applications

    promptmap is an automated security scanner for custom LLM applications that focuses on prompt injection and related attack classes. The project supports both white-box and black-box testing, which means it can either run tests directly against a known model and system prompt configuration or attack an external HTTP endpoint without internal access. Its scanning workflow uses a dual-LLM architecture in which one model acts as the target being tested and another acts as a controller that evaluates whether an attack succeeded. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 5
    GLM-4.1V

    GLM-4.1V

    GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning

    ...It represents a trade-off: somewhat reduced capacity compared to 4.5V or 4.6V, but with benefits in terms of speed, deployability, and lower hardware requirements — making it especially useful for developers experimenting locally, building lightweight agents, or deploying on limited infrastructure. Given its open-source availability under the same project repository, it provides an accessible entry point for testing multimodal reasoning and building proof-of-concept applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Nestia

    Nestia

    NestJS Helper + AI Chatbot Development

    Nestia is a high-performance toolkit and ecosystem built on top of NestJS that enhances backend development by introducing strongly typed APIs, automated SDK generation, and advanced tooling for scalable server applications. It is designed to eliminate much of the boilerplate typically associated with API development by leveraging pure TypeScript types to automatically generate validation logic, API documentation, and client SDKs. One of its defining advantages is its focus on performance...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 7
    Mini Agent

    Mini Agent

    A minimal yet professional single agent demo project

    Mini-Agent is a minimal yet production-minded demo project that shows how to build a serious command-line AI agent around the MiniMax-M2 model. It is designed both as a reference implementation and as a usable agent, demonstrating a full execution loop that includes planning, tool calls, and iterative refinement. The project exposes an Anthropic-compatible API interface and fully supports interleaved thinking, letting the agent alternate between reasoning steps and tool invocations during...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    CodeMachine

    CodeMachine

    CLI tool for multi-agent workflows and automated code generation

    CodeMachine CLI is a command-line orchestration engine designed to run coordinated multi-agent workflows locally. It enables developers to transform high-level specifications into production-ready code by managing planning, architecture, implementation, testing, and validation within a unified environment. CodeMachine CLI supports parallel execution through multiple specialized agents, allowing faster development cycles and scalable automation. Built for flexibility, it can handle anything...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    super-agent-party

    super-agent-party

    All-in-one AI companion! Desktop girlfriend + virtual streamer

    Super Agent Party is an open-source experimental framework designed to demonstrate collaborative multi-agent AI systems interacting within a shared environment. The project explores how multiple specialized AI agents can coordinate to solve complex tasks by communicating with each other and sharing information. Instead of relying on a single monolithic model, the framework organizes agents with different roles or capabilities that cooperate to achieve goals. Each agent may handle different...
    Downloads: 11 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    Beehave

    Beehave

    Behavior tree AI for Godot Engine

    Beehave is a powerful AI behavior tree framework designed as an addon for the Godot game engine, enabling developers to create sophisticated and dynamic non-player character behaviors in games. It uses a node-based system that integrates directly into the Godot scene tree, allowing developers to visually design and organize complex AI logic in a structured and intuitive way. Behavior trees provide a modular approach to decision-making, making it easier to manage large and adaptive AI systems...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    Agent Development Kit (ADK) for Java

    Agent Development Kit (ADK) for Java

    An open-source, code-first Java toolkit

    Google’s Agent Development Kit for Java is an open-source toolkit that helps developers design, evaluate, and deploy advanced AI agents using the Java programming language. The framework follows a code-first approach that treats agent development as a structured software engineering task rather than a collection of prompt scripts. It provides abstractions and tools that allow developers to create agents capable of executing complex workflows, calling tools, and interacting with external...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 12
    ChainForge

    ChainForge

    An open-source visual programming environment

    ChainForge is an open-source visual programming environment designed to help developers systematically test, compare, and evaluate prompts and outputs across multiple large language models in a structured and scalable way. Instead of relying on isolated prompt experimentation, it introduces a dataflow-based interface that allows users to create complex prompt pipelines and evaluate them across different models, parameters, and datasets simultaneously. The platform enables rapid...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Agent Behavior Monitoring

    Agent Behavior Monitoring

    The open source post-building layer for agents

    Agent Behavior Monitoring is an open-source framework designed to monitor, evaluate, and improve the behavior of AI agents operating in real or simulated environments. The system focuses on agent behavior monitoring by collecting interaction data and analyzing how agents perform across different scenarios and tasks. Developers can use the framework to observe agent actions in both online production environments and offline evaluation settings, making it useful for debugging and performance...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 14
    L1B3RT45

    L1B3RT45

    Harmless liberation prompts

    L1B3RT4S is a large prompt collection project focused on adversarial and “liberation-style” prompt engineering experiments for large language models. The repository gathers creative prompt patterns intended to explore model behavior boundaries, roleplay scenarios, and red-teaming techniques. It is positioned more as a prompt experimentation archive than a traditional software library, emphasizing the study of how instruction phrasing can influence AI outputs. The project reflects the growing...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Rivet

    Rivet

    Visual AI IDE for building agents with prompt chains and graphs

    Rivet is an open source visual AI programming environment designed to help developers build complex AI agents using a node-based interface and prompt chaining workflows. It provides a desktop application that allows users to visually construct and debug AI logic as interconnected graphs, making it easier to manage sophisticated interactions between language models and external tools. Rivet also includes a TypeScript library that enables these visual graphs to be executed and integrated...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16
    Prometheus-Eval

    Prometheus-Eval

    Evaluate your LLM's response with Prometheus and GPT4

    Prometheus-Eval is an open-source framework designed to evaluate the outputs of large language models using specialized evaluator models known as Prometheus. The project provides tools, datasets, and scripts that allow developers and researchers to measure the quality of LLM responses through automated scoring rather than relying solely on human evaluators. It implements an “LLM-as-a-judge” approach in which a dedicated language model analyzes instruction–response pairs and assigns scores or...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17
    Bedrock Chat

    Bedrock Chat

    AWS-native chatbot using Bedrock

    Bedrock Chat is a mirrored version of an open-source project that provides a conversational interface for interacting with large language models and AI services through a chat-style application. The project typically focuses on delivering a user interface that allows individuals or teams to communicate with AI models, manage conversations, and experiment with prompts and responses. Implementations like Bedrock Chat often integrate with model hosting platforms or APIs that provide access to...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    Codex MCP Server

    Codex MCP Server

    MCP server wrapper for OpenAI Codex CLI

    Codex MCP Server is an open-source integration tool that allows AI development environments to access the capabilities of the OpenAI Codex command-line interface through the Model Context Protocol. The project acts as a bridge between AI assistants such as Claude Code and the Codex CLI, enabling those assistants to perform advanced coding operations using Codex as a backend engine. Through this architecture, developers can request tasks such as code explanation, refactoring, or analysis...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 19
    NOFX

    NOFX

    Open source AI trading OS for autonomous multi-model trading systems

    NOFX is an open source AI-powered trading operating system designed to automate financial trading workflows using autonomous AI agents. It acts as an infrastructure layer that transforms market data into AI-driven trade decisions and execution. Instead of requiring users to manually configure machine learning models, data sources, and API integrations, the system allows AI components to perceive market conditions, select models, and perform trading actions automatically. It supports running...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 20
    GitHub Agentic Workflows

    GitHub Agentic Workflows

    GitHub Agentic Workflows

    GitHub Agentic Workflows is an experimental CLI extension and framework for the gh GitHub CLI that lets developers author automation driven by natural language specifications instead of hand-written code, compiling those descriptions into GitHub Actions workflows that run AI agents (like Copilot, Claude Code, or Codex) on schedule or in response to repository events. By writing intent in markdown files, a developer can quickly generate .yml Actions workflows that perform tasks such as...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    LaVague

    LaVague

    Framework for building AI agents that automate complex web tasks

    LaVague is an open source framework designed to help developers build AI-powered web agents capable of automating tasks across websites and web applications. It implements the concept of a Large Action Model framework, allowing agents to interpret a user-provided objective and translate it into a sequence of actions performed in a browser. These agents can navigate web pages, retrieve information, fill out forms, and execute multi-step workflows automatically. LaVague is centered around a...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    Agent Starter Pack

    Agent Starter Pack

    Ship AI Agents to Google Cloud in minutes, not months

    Agent Starter Pack is a production-focused framework that provides pre-built templates and infrastructure for rapidly developing and deploying generative AI agents on Google Cloud. It is designed to eliminate the complexity of moving from prototype to production by bundling essential components such as deployment pipelines, monitoring, security, and evaluation tools into a single package. Developers can create fully functional agent projects with a single command, generating both backend and...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23
    Courses (Anthropic)

    Courses (Anthropic)

    Anthropic's educational courses

    Anthropic’s courses repository is a growing collection of self-paced learning materials that teach practical AI skills using Claude and the Anthropic API. It’s organized as a sequence of hands-on courses—starting with API fundamentals and prompt engineering—so learners build capability step by step rather than in isolation. Each course mixes short readings with runnable notebooks and exercises, guiding you through concepts like model parameters, streaming, multimodal prompts, structured...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 24
    SERA CLI

    SERA CLI

    A tool to use the Ai2 Open Coding Agents Soft-Verified Agents

    SERA CLI is a command-line tool created by AllenAI to enable developers to interact with the SERA (Soft-Verified Efficient Repository Agents) model family using Claude Code as the execution front end. It provides a convenient interface for deploying, testing, and using SERA models without needing to write scaffold code from scratch, acting as both a proxy and utility wrapper to simplify workflows that involve large agent models. Through sera-cli, users can connect to local or cloud-hosted...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    Sapiens

    Sapiens

    High-resolution models for human tasks

    Sapiens is a research framework from Meta AI focused on embodied intelligence and human-like multimodal learning, aiming to train agents that can perceive, reason, and act in complex environments. It integrates sensory inputs such as vision, audio, and proprioception into a unified learning architecture that allows agents to understand and adapt to their surroundings dynamically. The project emphasizes long-horizon reasoning and cross-modal grounding—connecting language, perception, and...
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB