Showing 3 open source projects for "scoring"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 1
    Output

    Output

    TypeScript framework for building AI workflows and agents

    ...It eliminates reliance on fragmented SaaS tools by providing all necessary components locally, ensuring better transparency and control over data and processes. Output includes built-in evaluation systems, such as LLM-as-a-judge scoring, and integrates workflow orchestration tools like Temporal to handle retries, parallel execution, and state management. It also supports multiple model providers through a unified API.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    StabilityMatrix

    StabilityMatrix

    Multi-Platform Package Manager for Stable Diffusion

    ...It provides a framework to run experiments systematically—capturing inputs, model configurations, outputs, and metrics—so researchers and practitioners can reason about differences in quality, robustness, and failure modes. The repository often bundles tooling for automated prompt sweeping, scoring heuristics (such as diversity, coherence, or task-specific metrics), and visualization helpers to make comparisons interpretable. This approach is useful for model selection, prompt engineering, and benchmarking new checkpoints against baseline models under reproducible conditions. By turning ad-hoc tests into tracked experiments, StabilityMatrix reduces bias, surfaces subtle regressions, and accelerates iteration when tuning generative systems.
    Downloads: 87 This Week
    Last Update:
    See Project
  • 3
    BentoML

    BentoML

    Unified Model Serving Framework

    BentoML simplifies ML model deployment and serves your models at a production scale. Support multiple ML frameworks natively: Tensorflow, PyTorch, XGBoost, Scikit-Learn and many more! Define custom serving pipeline with pre-processing, post-processing and ensemble models. Standard .bento format for packaging code, models and dependencies for easy versioning and deployment. Integrate with any training pipeline or ML experimentation platform. Parallelize compute-intense model inference...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo