8 projects for "model based testing tool" with 2 filters applied:

  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 1
    SERA CLI

    SERA CLI

    A tool to use the Ai2 Open Coding Agents Soft-Verified Agents

    SERA CLI is a command-line tool created by AllenAI to enable developers to interact with the SERA (Soft-Verified Efficient Repository Agents) model family using Claude Code as the execution front end. It provides a convenient interface for deploying, testing, and using SERA models without needing to write scaffold code from scratch, acting as both a proxy and utility wrapper to simplify workflows that involve large agent models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Crush

    Crush

    The glamourous AI CLI coding agent for your favourite terminal 💘

    Crush is a next-generation, terminal-based AI coding assistant developed by Charm, designed to seamlessly integrate with your tools, workflows, and preferred LLMs. It provides developers with an intuitive, session-based experience where multiple contexts can be managed across projects. With flexible model switching, Crush allows you to change providers mid-session while retaining conversation history.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    agents-cli

    agents-cli

    CLI to turn coding assistants into expert at deploying AI agents

    agents-cli is a command-line tool developed to simplify the creation, management, and execution of AI agents directly from the terminal. It provides developers with a structured interface for defining agent behavior, configuring tools, and running workflows. The tool integrates with agent frameworks and supports modular extensions for adding new capabilities. It emphasizes productivity by enabling rapid iteration and testing of agent logic without complex setup. agents-cli is designed to fit into modern developer workflows, particularly those that rely on automation and scripting. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    VibeKit

    VibeKit

    Run Claude Code, Gemini, Codex in a clean, isolated sandbox

    ...Instead of treating AI models as black boxes behind simple prompts, Vibekit encourages developers to define declarative behaviors, reactive rules, and data flows that make the outputs of models part of living application logic. This can include things like dynamic content generation, live adaptation based on user interaction, and connectors to external APIs for enriched grounding. The toolkit also supports testing and local iteration, with utilities that simulate event streams and mock model responses to make development predictable.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • 5
    Learn Claude Code

    Learn Claude Code

    Bash is all you need, write a claude code with only 16 line code

    Learn Claude Code is an educational repository that teaches how modern AI coding agents work by walking learners through a sequence of progressively more complex agent implementations, starting with a minimal Bash-based agent and culminating in agents with explicit planning, subagents, and skills. It emphasizes a hands-on learning path where each version (from v0 to v4) adds conceptual building blocks like the core agent loop, todo planning, task decomposition, and domain knowledge skills,...
    Downloads: 65 This Week
    Last Update:
    See Project
  • 6
    MiniMax-M2

    MiniMax-M2

    MiniMax-M2, a model built for Max coding & agentic workflows

    MiniMax-M2 is an open-weight large language model designed specifically for high-end coding and agentic workflows while staying compact and efficient. It uses a Mixture-of-Experts (MoE) architecture with 230 billion total parameters but only 10 billion activated per token, giving it the behavior of a very large model at a fraction of the runtime cost. The model is tuned for end-to-end developer flows such as multi-file edits, compile–run–fix loops, and test-validated repairs across real...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    deepclaude

    deepclaude

    Use Claude Code's agent loop with DeepSeek V4 Pro, OpenRouter & more

    deepclaude is a lightweight proxy tool that enables developers to run Claude Code’s autonomous coding agent loop using alternative AI backends like DeepSeek V4 Pro, OpenRouter, or other Anthropic-compatible models. It preserves the full Claude Code experience—including file editing, terminal execution, and multi-step agent workflows—while dramatically reducing operational costs. By swapping out the underlying model instead of the interface, deepclaude delivers the same familiar UX with...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Kimi K2.7 Code

    Kimi K2.7 Code

    Coding-focused Kimi model for long-horizon agent workflows

    Kimi K2.7 Code is a coding-focused agentic model built on Kimi K2.6, designed for long-horizon software engineering, autonomous coding workflows, and complex tool-based execution. It improves end-to-end task completion across real-world programming scenarios while reducing thinking-token usage by about 30% compared with K2.6. Architecturally, it uses a 1T-parameter Mixture-of-Experts design with 32B activated parameters, 61 layers, 384 experts, a 256K-token context window, and a MoonViT vision encoder. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo