Showing 49 open source projects for "reliability"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    Upsonic

    Upsonic

    The most reliable AI agent framework that supports MCP

    Upsonic is a reliability-focused AI agent framework designed for real-world applications. It enables the development of trusted agent workflows within organizations by incorporating advanced reliability features, such as verification layers and output evaluation systems. The framework supports the Model Context Protocol (MCP), facilitating integration with various tools and enhancing agent capabilities. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Future AGI

    Future AGI

    Open-source platform for evaluating, observing, and improving LLM

    ...It supports both cloud and self-hosted deployment models, making it useful for teams with different privacy, infrastructure, and compliance needs. Future AGI is especially relevant for agent-heavy products where reliability, regression testing, and safety checks matter before and after release. Its main value is turning AI agent development into a measurable engineering process instead of an informal cycle of prompting, guessing, and manual review.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 3
    KeepChatGPT

    KeepChatGPT

    Browser userscript that enhances ChatGPT reliability and usability

    KeepChatGPT is an open source browser userscript designed to enhance the reliability, usability, and efficiency of the ChatGPT web interface. It runs through userscript managers and injects additional functionality directly into the page, allowing users to improve their workflow without requiring a backend service or separate application. It focuses on solving common problems experienced during AI conversations, such as session timeouts, network errors, message failures, and interruptions during long chats. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    AnyTool

    AnyTool

    AnyTool: Universal Tool-Use Layer for AI Agents

    ...Rather than having each agent handle tool invocation logic on its own, AnyTool provides a standardized interface and orchestrator that intelligently selects and manages tools, reduces context overhead, and improves execution reliability across diverse capabilities like web APIs, local commands, and GUI automation. It uses progressive filtering and adaptive orchestration to ensure the right tools are retrieved efficiently and work cohesively with agents of varying complexity, scaling to thousands of tools with self-optimizing behavior. The system also tracks tool reliability and quality, offering a safer and more predictable automation experience with persistent learning from previous executions.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 5
    HolmesGPT

    HolmesGPT

    CNCF Sandbox Project

    HolmesGPT is an open-source AI agent designed to help DevOps and site reliability engineering teams diagnose and resolve production incidents. The system aggregates signals from observability tools such as logs, metrics, alerts, and distributed traces, then analyzes them using large language models to identify potential root causes. Rather than requiring engineers to manually correlate large volumes of monitoring data, HolmesGPT automatically synthesizes evidence and presents explanations in natural language. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Agent Skills for Context Engineering

    Agent Skills for Context Engineering

    A comprehensive collection of Agent Skills for context engineering

    ...Rather than being a single application, it packages practical guidance into skill modules that agents can load to improve planning, retrieval, memory usage, and overall reliability in real workflows. The repository emphasizes context engineering as a discipline, covering why agents fail when context gets too large, too noisy, or poorly structured, and how to mitigate those failure modes with repeatable patterns. It is designed to be used across modern agent environments that support skill folders and structured instructions, so teams can standardize how agents operate instead of relying on ad-hoc prompting.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    NVIDIA NeMo Agent Toolkit

    NVIDIA NeMo Agent Toolkit

    Library for efficiently connecting and optimizing teams of AI agents

    NVIDIA NeMo Agent Toolkit is an open-source framework designed to build, optimize, and manage AI agents across different development ecosystems. It provides enterprise-grade tools for improving agent performance, reliability, and observability throughout the development lifecycle. The toolkit integrates with popular agent frameworks such as LangChain, LlamaIndex, CrewAI, Microsoft Semantic Kernel, and Google ADK. Developers can monitor agent execution, trace workflows, and analyze token-level performance to identify bottlenecks and improve efficiency. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    CLI-Anything

    CLI-Anything

    Making ALL Software Agent-Native

    ...It integrates with multiple AI platforms such as Claude Code, OpenClaw, Codex, and GitHub Copilot CLI, enabling cross-platform compatibility and flexibility. CLI-Anything emphasizes structured outputs such as JSON to reduce parsing complexity and improve reliability in automation scenarios.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 9
    Composio

    Composio

    Composio equip's your AI agents & LLMs

    Empower your AI agents with Composio - a platform for managing and integrating tools with LLMs & AI agents using Function Calling. Equip your agent with high-quality tools & integrations without worrying about authentication, accuracy, and reliability in a single line of code.
    Downloads: 8 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 10
    Spec Kit

    Spec Kit

    Toolkit to help you get started with Spec-Driven Development

    ...The toolkit provides scaffolding, prompt templates, and automation scripts that help teams maintain a clear source of truth throughout the development lifecycle. By emphasizing intent before code, Spec Kit reduces ambiguity and improves the reliability of AI-generated output. It integrates with popular AI coding tools such as GitHub Copilot and similar assistants, allowing developers to embed spec-driven practices directly into their existing workflows. Overall, the project aims to improve collaboration between humans and AI by making software development more predictable, traceable, and maintainable.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 11
    TypeChat

    TypeChat

    Library for building type-safe natural language interfaces with LLMs

    ...Traditional natural language interfaces often relied on complex decision trees to interpret user intent and gather required inputs. With the rise of large language models, developers can interpret user requests more easily, but they still face challenges related to output reliability, safety, and structured responses. TypeChat addresses these challenges by replacing traditional prompt engineering with a concept called schema engineering. Instead of writing complex prompts, developers define types that represent the intents supported by their applications. It then uses those type definitions to construct prompts for language models and translate user input into structured data that follows the defined schema.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 12
    DSPy

    DSPy

    DSPy: The framework for programming—not prompting—language models

    Developed by the Stanford NLP Group, DSPy (Declarative Self-improving Python) is a framework that enables developers to program language models through compositional Python code rather than relying solely on prompt engineering. It facilitates the construction of modular AI systems and provides algorithms for optimizing prompts and weights, enhancing the quality and reliability of language model outputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Kiln

    Kiln

    Open source platform for managing, testing, and deploying AI apps

    Kiln is an open source platform designed to help developers build, evaluate, and deploy AI-powered applications with greater structure and reliability. It provides a unified environment for managing prompts, datasets, and evaluation workflows, allowing teams to iterate on AI behavior in a controlled and measurable way. Kiln emphasizes reproducibility, enabling users to track changes to prompts and models while comparing outputs across different configurations. Kiln also supports systematic testing of AI systems by defining evaluation criteria and running experiments to assess performance over time. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    TypeAgent Python

    TypeAgent Python

    Structured RAG: ingest, index, query

    ...Instead of relying solely on free-form prompts, the architecture emphasizes converting natural language interactions into structured representations that can be processed by deterministic software components. This design allows the system to combine the flexibility of language models with the reliability of traditional programming logic. The repository is intended primarily as a research prototype and sample code rather than a production-ready framework, allowing developers to experiment with building AI agents that maintain structured memory and perform tasks through defined actions.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Phidata

    Phidata

    Build multi-modal Agents with memory, knowledge, tools and reasoning

    ...Phidata offers pre-configured templates to accelerate development and deployment, allowing users to quickly go from building agents to shipping them into production. It includes features like real-time monitoring, agent evaluations, and performance optimization tools, ensuring the reliability and scalability of AI solutions. Phidata also allows developers to bring their own cloud infrastructure, offering flexibility for custom setups. The platform provides robust support for enterprises, including security features, agent guardrails, and automated DevOps for smoother deployment processes.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    MiMo-V2.5-ASR

    MiMo-V2.5-ASR

    Robust Speech Recognition Across Languages, Dialects

    ...It focuses on scalability and performance, making it suitable for both research and production applications. Overall, it represents a high-performance speech recognition solution optimized for versatility and reliability.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    OpenLIT

    OpenLIT

    OpenLIT is an open-source LLM Observability tool

    ...Whether you're working with popular LLM providers such as OpenAI and HuggingFace, or leveraging vector databases like ChromaDB, OpenLIT ensures your applications are monitored seamlessly, providing critical insights including GPU performance stats for self-hosted LLMs to improve performance and reliability. This project proudly follows the Semantic Conventions of the OpenTelemetry community, consistently updating to align with the latest standards in observability.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    ContextForge MCP Gateway

    ContextForge MCP Gateway

    A Model Context Protocol (MCP) Gateway & Registry

    ...Operators can define virtual servers, wire multiple transports, and optionally enable an admin UI for management and monitoring. Packaged for quick starts via PyPI and Docker, it targets production reliability with health checks, metrics, and structured logs. The project positions itself as an integration hub so agentic apps can “connect once, use many” backends with consistent policy and lifecycle control.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Claude Codex Settings

    Claude Codex Settings

    My personal Claude Code and OpenAI Codex setup

    Claude Codex Settings is a configuration-focused repository that provides curated settings, prompts, and workflow optimizations for improving AI-assisted coding environments. It is designed to help developers fine-tune how Claude and similar models behave within coding workflows, ensuring more consistent and high-quality outputs. The project emphasizes practical usability, offering ready-to-use configurations that can be directly integrated into development environments. It also includes...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    OpenSpace

    OpenSpace

    OpenSpace: Make Your Agents: Smarter, Low-Cost, Self-Evolving

    ...It also focuses on cost efficiency by reducing redundant computations and reusing successful workflows, significantly lowering token usage in repeated tasks. The framework includes monitoring and evaluation mechanisms to track skill performance and ensure reliability as systems evolve. It supports integration with various agent platforms, making it flexible and extensible across different environments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    ComfyUI-Copilot

    ComfyUI-Copilot

    AI assistant for ComfyUI workflow generation, debugging, and tuning

    ...ComfyUI-Copilot leverages large language model capabilities to analyze user intent, recommend nodes, and suggest models that match specific requirements. It also provides automated error detection and repair suggestions, improving reliability during development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Humanoid-Gym

    Humanoid-Gym

    Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real

    ...The framework emphasizes the concept of zero-shot sim-to-real transfer, meaning that behaviors learned in simulation can be deployed directly on physical robots with minimal adjustment. To improve reliability and generalization, the framework also includes sim-to-sim validation pipelines that test trained policies across different physics engines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    FuzzyAI Fuzzer

    FuzzyAI Fuzzer

    A powerful tool for automated LLM fuzzing

    FuzzyAI is an open-source fuzzing framework designed to test the security and reliability of large language model applications. The tool automates the process of generating adversarial prompts and input variations to identify vulnerabilities such as jailbreaks, prompt injections, or unsafe model responses. It allows developers and security researchers to systematically evaluate the robustness of LLM-based systems by simulating a wide range of malicious or unexpected inputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    LMOps

    LMOps

    General technology for enabling AI capabilities w/ LLMs and MLLMs

    ...It includes experimental tools and frameworks that help developers optimize prompts, design workflows for generative models, and manage the lifecycle of LLM-based systems. The initiative also investigates techniques for improving the reliability, scalability, and maintainability of applications powered by large models. By addressing challenges such as prompt engineering, evaluation strategies, and deployment infrastructure, LMOps aims to establish best practices for operating large language model systems in real-world environments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    TorchRL

    TorchRL

    A modular, primitive-first, python-first PyTorch library

    TorchRL is an open-source Reinforcement Learning (RL) library for PyTorch. TorchRL provides PyTorch and python-first, low and high-level abstractions for RL that are intended to be efficient, modular, documented, and properly tested. The code is aimed at supporting research in RL. Most of it is written in Python in a highly modular way, such that researchers can easily swap components, transform them, or write new ones with little effort.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Auth0 Logo