npp-compare free download

Showing 107 open source projects for "npp-compare"

View related business solutions

Artificial Intelligence Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Fully Managed MySQL, PostgreSQL, and SQL Server
Automatic backups, patching, replication, and failover. Focus on your app, not your database.

Cloud SQL handles your database ops end to end, so you can focus on your app.

Try Free
1

promptfoo

Evaluate and compare LLM outputs, catch regressions, improve prompts

Ensure high-quality LLM outputs with automatic evals. Use a representative sample of user inputs to reduce subjectivity when tuning prompts. Use built-in metrics, LLM-graded evals, or define your own custom metrics. Compare prompts and model outputs side-by-side, or integrate the library into your existing test/CI workflow. Use OpenAI, Anthropic, and open-source models like Llama and Vicuna, or integrate custom API providers for any LLM API.

Downloads: 3 This Week

Last Update: 6 days ago
See Project
2

Google AI Edge Gallery

A gallery that showcases on-device ML/GenAI use cases

Gallery is a curated collection of on-device machine learning examples, demo apps, and model artifacts designed to help developers experiment with and deploy ML at the edge. The project bundles runnable samples that show how to run TensorFlow Lite/Edge TPU models (and similar lightweight runtimes) on mobile and embedded platforms, demonstrating common tasks like image classification, object detection, audio recognition, and pose estimation. Each sample is intended to be both a learning aid...

Downloads: 1,099 This Week

Last Update: 2026-04-02
See Project
3

Every Code

Local AI coding agent CLI with multi-agent orchestration tools

...It is a community-driven fork of the Codex CLI, with a strong emphasis on improving real-world developer ergonomics and workflows. Every Code enhances the traditional coding assistant model by introducing multi-agent orchestration, allowing multiple AI agents to collaborate, compare solutions, and refine outputs in parallel. It supports integration with various AI providers, enabling users to route tasks across different models depending on their needs. Every Code also includes browser integration and automation capabilities, extending its usefulness beyond simple code generation into more complex development tasks. ...

Downloads: 21 This Week

Last Update: 3 days ago
See Project
4

pixelmatch

The smallest, simplest JavaScript pixel-level image comparison library

The smallest, simplest and fastest JavaScript pixel-level image comparison library, originally created to compare screenshots in tests. Features accurate anti-aliased pixels detection and perceptual color difference metrics. Inspired by Resemble.js and Blink-diff. Unlike these libraries, pixelmatch is around 150 lines of code, has no dependencies, and works on raw typed arrays of image data, so it's blazing fast and can be used in any environment (Node or browsers).

Downloads: 1 This Week

Last Update: 2025-02-21
See Project
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
5

TruLens

Evaluation and Tracking for LLM Experiments

...An easy-to-use interface that allows developers to compare different versions of their applications, facilitating informed decision-making and optimization. TruLens supports various use cases, including question-answering, summarization, retrieval-augmented generation, and agent-based applications.

Downloads: 1 This Week

Last Update: 2026-04-09
See Project
6

Aim

An easy-to-use & supercharged open-source experiment tracker

Aim logs all your AI metadata (experiments, prompts, etc) enabling a UI to compare & observe them and SDK to query them programmatically. The Aim standard package comes with all integrations. If you'd like to modify the integration and make it custom, create a new integration package and share with others. Aim is an open-source, self-hosted AI Metadata tracking tool designed to handle 100,000s of tracked metadata sequences.

Downloads: 1 This Week

Last Update: 2025-05-08
See Project
7

SwanLab

An open-source, modern-design AI training tracking and visualization

SwanLab is an open-source experiment tracking and visualization platform designed to help machine learning engineers monitor, compare, and analyze the training of artificial intelligence models. The tool records training metrics, hyperparameters, model outputs, and experiment configurations so that developers can easily understand how different experiments perform over time. It provides a modern user interface for visualizing results, enabling teams to compare runs, track model performance trends, and collaborate on machine learning research. ...

Downloads: 1 This Week

Last Update: 2026-04-09
See Project
8

H2O LLM Studio

Framework and no-code GUI for fine-tuning LLMs

...With H2O LLM Studio, training your large language model is easy and intuitive. First, upload your dataset and then start training your model. Start by creating an experiment. You can then monitor and manage your experiment, compare experiments, or push the model to Hugging Face to share it with the community.

Downloads: 4 This Week

Last Update: 2026-04-07
See Project
9

ReinforcementLearning.jl

A reinforcement learning package for Julia

A collection of tools for doing reinforcement learning research in Julia. Provide elaborately designed components and interfaces to help users implement new algorithms. Make it easy for new users to run benchmark experiments, compare different algorithms, and evaluate and diagnose agents. Facilitate reproducibility from traditional tabular methods to modern deep reinforcement learning algorithms. Make it easy for new users to run benchmark experiments, compare different algorithms, and evaluate and diagnose agents. Facilitate reproducibility from traditional tabular methods to modern deep reinforcement learning algorithms. ...

Downloads: 0 This Week

Last Update: 2025-01-13
See Project
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
10

DVC Extension for Visual Studio Code

https://github.com/iterative/vscode-dvc

A Visual Studio Code extension that integrates Data Version Control (DVC) into the development environment, enhancing reproducibility and collaboration for machine learning projects.

Downloads: 1 This Week

Last Update: 2026-03-02
See Project
11

Prompt Master

A Claude skill that writes the accurate prompts for any AI tool

...The project emphasizes clarity and organization, allowing users to categorize prompts by use case, domain, or functionality. It also supports experimentation, enabling users to refine prompts and compare results to achieve better outputs. The repository can be used as both a learning resource and a practical toolkit for developers working with AI systems. It reflects the growing importance of prompt engineering as a discipline in modern AI development.

Downloads: 6 This Week

Last Update: 2026-03-31
See Project
12

Agentex

Open source codebase for Scale Agentex

...It treats an “agent” as a composition of a policy (the LLM), tools, memory, and an execution runtime so you can test the whole loop, not just prompting. The repo focuses on structured experiments: standardized tasks, canonical tool interfaces, and logs that make it possible to compare models, prompts, and tool sets fairly. It also includes evaluation harnesses that capture success criteria and partial credit, plus traces you can inspect to understand where reasoning or tool use failed. The design encourages clean separation between experiment configuration and code, which makes sharing results or re-running baselines straightforward. ...

Downloads: 0 This Week

Last Update: 3 days ago
See Project
13

Empirical

Test and evaluate LLMs and model configurations

Empirical is the fastest way to test different LLMs and model configurations, across all the scenarios that matter for your application.

Downloads: 0 This Week

Last Update: 2024-11-13
See Project
14

Norfair

Lightweight Python library for adding real-time multi-object tracking

...Supports moving camera, re-identification with appearance embeddings, and n-dimensional object tracking. Norfair provides several predefined distance functions to compare tracked objects and detections. The distance functions can also be defined by the user, enabling the implementation of different tracking strategies.

Downloads: 0 This Week

Last Update: 2025-04-30
See Project
15

Learning Interpretability Tool

Interactively analyze ML models to understand their behavior

The Learning Interpretability Tool (LIT, formerly known as the Language Interpretability Tool) is a visual, interactive ML model-understanding tool that supports text, image, and tabular data. It can be run as a standalone server, or inside of notebook environments such as Colab, Jupyter, and Google Cloud Vertex AI notebooks.

Downloads: 2 This Week

Last Update: 2024-12-20
See Project
16

MLflow

Open source platform for the machine learning lifecycle

MLflow is a platform to streamline machine learning development, including tracking experiments, packaging code into reproducible runs, and sharing and deploying models. MLflow offers a set of lightweight APIs that can be used with any existing machine learning application or library (TensorFlow, PyTorch, XGBoost, etc), wherever you currently run ML code (e.g. in notebooks, standalone applications or the cloud).

Downloads: 7 This Week

Last Update: 2026-04-13
See Project
17

Opik

Debug, evaluate, and monitor your LLMapps, RAG systems, and agentic AI

...Opik is an open-source platform for evaluating, testing, and monitoring LLM applications. Built by Comet. Record, sort, search, and understand each step your LLM app takes to generate a response. Manually annotate, view, and compare LLM responses in a user-friendly table. Log traces during development and in production. Run experiments with different prompts and evaluate against a test set. Choose and run pre-configured evaluation metrics or define your own with our convenient SDK library. Consult built-in LLM judges for complex issues like hallucination detection, factuality, and moderation.

Downloads: 10 This Week

Last Update: 11 hours ago
See Project
18

PapersGPT

A powerful Zotero AI and MCP plugin with ChatGPT, Gemini 3.1, Claude

...One of its most powerful features is its ability to process large volumes of academic content quickly, enabling tasks such as literature reviews, theoretical analysis, and research synthesis to be completed significantly faster. It also supports multi-document querying, allowing users to compare findings across multiple papers and generate comprehensive overviews of research topics.

Downloads: 5 This Week

Last Update: 2026-04-11
See Project
19

Rewriting Project Claw Code

Ensure consistency and alignment between different codebases

...It focuses on maintaining parity across systems, which is particularly important in distributed architectures or multi-platform applications. The project provides mechanisms to compare, validate, and synchronize code or behavior, helping teams avoid discrepancies that can lead to bugs or inconsistencies. It may include automation tools that detect differences and enforce standards across repositories. The tool is useful in scenarios such as maintaining parity between frontend and backend logic, ensuring API consistency, or synchronizing multiple deployments. ...

Downloads: 2 This Week

Last Update: 2026-04-10
See Project
20

SAM 2

The repository provides code for running inference with SAM 2

...SAM2 comes with pretrained weights and easy-to-use APIs, enabling developers and researchers to integrate promptable segmentation into annotation tools, vision pipelines, or downstream tasks. The project also includes scripts and notebooks to compare SAM2 against SAM on edge cases, benchmarks showing improvements, and evaluation suites to measure mask quality metrics like IoU and boundary error.

Downloads: 8 This Week

Last Update: 2025-10-06
See Project
21

BrowserGym

A Gym environment for web task automation

...One of its main strengths is that it bundles several important benchmarks by default, including MiniWoB, WebArena, VisualWebArena, WorkArena, AssistantBench, WebLINX, and OpenApps. This gives researchers a unified way to compare agent behavior across diverse web environments and task types without stitching together separate evaluation stacks. BrowserGym is also designed to be extensible, and the repository notes that creating new benchmarks mainly involves inheriting its abstract task interface.

Downloads: 3 This Week

Last Update: 2026-03-09
See Project
22

Weights and Biases

Tool for visualizing and tracking your machine learning experiments

Use W&B to build better models faster. Track and visualize all the pieces of your machine learning pipeline, from datasets to production models. Quickly identify model regressions. Use W&B to visualize results in real time, all in a central dashboard. Focus on the interesting ML. Spend less time manually tracking results in spreadsheets and text files. Capture dataset versions with W&B Artifacts to identify how changing data affects your resulting models. Reproduce any model, with saved...

Downloads: 6 This Week

Last Update: 7 days ago
See Project
23

ChainForge

An open-source visual programming environment

ChainForge is an open-source visual programming environment designed to help developers systematically test, compare, and evaluate prompts and outputs across multiple large language models in a structured and scalable way. Instead of relying on isolated prompt experimentation, it introduces a dataflow-based interface that allows users to create complex prompt pipelines and evaluate them across different models, parameters, and datasets simultaneously.

Downloads: 1 This Week

Last Update: 2026-03-18
See Project
24

Agent Stack

Deploy and share agents with open infrastructure

...The platform supports agents built in frameworks like LangChain, CrewAI, etc., enabling them to be hosted, managed and shared through a unified interface. It also offers multi-model, multi-provider support (OpenAI, Anthropic, Gemini, IBM WatsonX, Ollama etc.), letting users compare performance and cost across models. For developers and organizations building AI-agent products or automations, Agent Stack gives a scaffold that handles the “plumbing”, so they can focus on logic and domain.

Downloads: 7 This Week

Last Update: 2026-03-30
See Project
25

Advanced + Agentic RAG Cookbooks

Advanced RAG cookbooks for building accurate LLM applications

...Athina AI’s RAG Cookbooks covers the full RAG pipeline, including indexing, retrieval, augmentation, and generation, while also addressing evaluation to measure accuracy and relevance. It includes multiple approaches such as hybrid search, contextual compression, and agent-based retrieval strategies, allowing users to experiment and compare methods. It is designed to reduce development time by offering practical examples and references to research papers, making it useful for both learning and production use. Overall, it serves as a hands-on resource for improving LLM outputs using external data sources.

Downloads: 3 This Week

Last Update: 4 days ago
See Project