Showing 65 open source projects for "data quality"

View related business solutions
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    lakeFS

    lakeFS

    lakeFS - Git-like capabilities for your object storage

    ...Easily Collaborate on production data with your team. Automate data quality checks within data pipelines.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Arize Phoenix

    Arize Phoenix

    Uncover insights, surface problems, monitor, and fine tune your LLM

    Phoenix provides ML insights at lightning speed with zero-config observability for model drift, performance, and data quality. Phoenix is an Open Source ML Observability library designed for the Notebook. The toolset is designed to ingest model inference data for LLMs, CV, NLP and tabular datasets. It allows Data Scientists to quickly visualize their model data, monitor performance, track down issues & insights, and easily export to improve. Deep Learning Models (CV, LLM, and Generative) are an amazing technology that will power many of future ML use cases. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 3
    ODD Platform

    ODD Platform

    First open-source data discovery and observability platform

    ...Know the impact of each code change with automatic testing. Enjoy lineage and alerts powered with data quality information.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    Dagster

    Dagster

    An orchestration platform for the development, production

    Dagster is an orchestration platform for the development, production, and observation of data assets. Dagster as a productivity platform: With Dagster, you can focus on running tasks, or you can identify the key assets you need to create using a declarative approach. Embrace CI/CD best practices from the get-go: build reusable components, spot data quality issues, and flag bugs early. Dagster as a robust orchestration engine: Put your pipelines into production with a robust multi-tenant, multi-tool engine that scales technically and organizationally. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    SDGym

    SDGym

    Benchmarking synthetic data generation methods

    ...You also customize the process to include your own work. Select any of the publicly available datasets from the SDV project, or input your own data. Choose from any of the SDV synthesizers and baselines. Or write your own custom machine learning model. In addition to performance and memory usage, you can also measure synthetic data quality and privacy through a variety of metrics. Install SDGym using pip or conda. We recommend using a virtual environment to avoid conflicts with other software on your device.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    syslog-ng

    syslog-ng

    Log management solution that improves the performance of SIEM

    syslog-ng is the log management solution that improves the performance of your SIEM solution by reducing the amount and improving the quality of data feeding your SIEM. With syslog-ng Store Box, you can find the answer. Search billions of logs in seconds using full text queries with Boolean operators to pinpoint critical logs. syslog-ng Store Box provides secure, tamper-proof storage and custom reporting to demonstrate compliance. syslog-ng can deliver data from a wide variety of sources to Hadoop, Elasticsearch, MongoDB, and Kafka as well as many others. syslog-ng flexibly routes log data from X sources to Y destinations. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 7
    AWS IoT FleetWise Edge

    AWS IoT FleetWise Edge

    AWS IoT FleetWise Edge Agent

    ...Improve electric vehicle (EV) battery range estimates with crowdsourced environmental data, such as weather and driving conditions, from nearby vehicles. Collect select data from nearby vehicles and use it to notify drivers of changing road conditions, such as lane closures or construction. Use near real-time data to proactively detect and mitigate fleet-wide quality issues.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Elementary

    Elementary

    Open-source data observability for analytics engineers

    Elementary is an open-source data observability solution for data & analytics engineers. Monitor your dbt project and data in minutes, and be the first to know of data issues. Gain immediate visibility, detect data issues, send actionable alerts, and understand the impact and root cause. Generate a data observability report, host it or share with your team. Monitoring of data quality metrics, freshness, volume and schema changes, including anomaly detection. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    AdGuard Home

    AdGuard Home

    Network-wide ads and trackers blocking DNS server

    ...This includes video ads and ads in your favorite apps, browsers, games, and on any website you can imagine. Dozens of ad filters are available to you and are updated on a regular basis, guaranteeing the best filtering quality. Protecting your personal data is our top priority. With AdGuard, you and your sensitive data will be safe from any online tracker and analytics system that may attempt to steal your data while surfing the web. Use the Family protection mode to block access to all websites with adult content and enforce safe search in the browser, in addition to the regular perks of ad blocking and browsing security.
    Downloads: 49 This Week
    Last Update:
    See Project
  • 10
    Grafana

    Grafana

    Leading open-source visualization and observability platform

    Grafana OSS is the leading open-source platform for visualization and observability. It enables teams to query, visualize, alert on, and explore telemetry data from multiple sources in a single interface. With support for 100+ data source plugins—including Prometheus, Loki, Elasticsearch, InfluxDB, SQL/NoSQL databases, and OpenTelemetry—Grafana helps teams correlate metrics, logs, and traces across applications and infrastructure. Users can build interactive dashboards with rich...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 11
    OpenTelemetry

    OpenTelemetry

    OpenTelemetry Go API and SDK

    OpenTelemetry-Go is the Go implementation of OpenTelemetry. It provides a set of APIs to directly measure the performance and behavior of your software and send this data to observability platforms. High-quality, ubiquitous, and portable telemetry to enable effective observability. OpenTelemetry is a collection of APIs, SDKs, and tools. Use it to instrument, generate, collect, and export telemetry data (metrics, logs, and traces) to help you analyze your software’s performance and behavior.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Helicone

    Helicone

    Open source LLM-Observability Platform for Developers

    Open source LLM-Observability Platform for Developers. One-line integration for monitoring, metrics, evals, agent tracing, prompt management, playground, etc. Supports OpenAI SDK, Vercel AI SDK, Anthropic SDK, LiteLLM, LLamaIndex, LangChain, and more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    rag-search

    rag-search

    RAG Search API

    ...It is built to be easily deployable, requiring only environment configuration and dependency installation to run a functional RAG service. The system supports configurable filtering, scoring thresholds, and reranking options, allowing developers to fine-tune retrieval quality. Its architecture is modular, separating handlers, services, and utilities to support customization and extension. Overall, rag-search serves as a practical starter backend for teams building AI search or question-answering applications on their own data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Inbucket

    Inbucket

    Disposable webmail server (similar to Mailinator) with built in SMTP

    Inbucket is an email testing application; it will accept messages from any email address and make them available to view via a web interface. When you need to test your webapp's outbound emails with Mailinator but are stuck behind a firewall, Inbucket provides the solution. It allows you to keep your new application development secret until it's time to release it. Inbucket is ideal for validating that emails go out as part of your integration test suite, sending links to coworkers to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    MentDB Projects

    MentDB Projects

    Generalized Interoperability and Strong AI

    MentDB is an open-source platform driving research into next-generation AI and universal data exchange. Our architecture is built around the revolutionary Mentalese Query Language (MQL). MentDB Weak (Generalized Interoperability): A unified data layer enabling seamless data exchange and application integration (SOA, ETL, Data Quality). We eliminate data silos through a single, generalized data language. MentDB Strong (Strong AI / AGI): The framework for exploring and building Machine Consciousness, free will, and advanced ethical reasoning systems. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16

    RecordEditor

    Editor for Fixed Width, Csv and Existing Xml files.

    The RecordEditor is a Data File editor for Flat Files (delimited and fixed field position). It supports Unix / PC / Legacy (e.g. Mainframe) file formats, both Text and binary files. The Editor uses a Record-Layout description to format the files. This is ideal for Fixed width (Text or Binary) files, Cobol Data Files, Mainframe files and complicated Csv files. Cobol Copybooks can be used to format Cobol Data files. As well as an editor, The following utilities are supplied * Formatted...
    Downloads: 40 This Week
    Last Update:
    See Project
  • 17
    GeigerLog

    GeigerLog

    Python program for Geiger counters and Environmental Sensors

    ...Continues to support Geiger counters like GQ Electronic's GMC-300/500/600 , RadMon, RadPro, Gamma-Scout, audio-click and other low-cost Geiger counters, IoT devices, I2C based sensors, GQ EMF devices. GeigerLog reads data from devices, saves them to databases, prints to screen and plots as auto-updated, customizable graphs. Quality control using Statistics, Scatter, Poisson, FFT & Autocorrelation, and more. Gamma Spectra - can be displayed, calibrated, and evaluated in GeigerLog! The Raspberry Pi 4, 5 qualify as GeigerLog hosts.
    Leader badge
    Downloads: 45 This Week
    Last Update:
    See Project
  • 18
    JDiskMark

    JDiskMark

    Java Disk Benchmark Utility

    JDiskMark is a powerful and versatile disk benchmarking tool written in Java, designed to help accurately measure the performance of your storage devices across different operating systems. Whether you're testing HDDs, SSDs, or network drives, JDiskMark provides reliable insights into your disk's read and write...
    Leader badge
    Downloads: 15 This Week
    Last Update:
    See Project
  • 19
    TCNOpen

    TCNOpen

    Components for IEC61375 standardised communication

    ...TCN (Train Communication Network) is a series of international standards (IEC61375) developed by Working Group 43 of the IEC (International Electrotechnical Commission), specifying a communication system for the data communication within and between vehicles of a train. It is currently in use on many thousands of trains in the world in order to allow electronic devices to exchange information while operating aboard the same train. TCNOpen follows the Open Source scheme, as the software is jointly developed by participating companies, according to their role, so as to achieve cheaper, quicker and better quality results. ...
    Leader badge
    Downloads: 42 This Week
    Last Update:
    See Project
  • 20
    i-doit - Open Source CMDB & ITSM Tool

    i-doit - Open Source CMDB & ITSM Tool

    ITIL conform CMDB solution for documenting your entire IT landscape

    i-doit is a web based IT documentation and CMDB. i-doit documents IT-systems and their changes, defines emergency plans, displays vital information and helps to ensure a stable and efficient IT operation: The technical documentation: This means that all information can be organised (details flexibly adjusted), stored and maintained in one place. This allows simple Helpdesk, Support and Planning functions and provides the basis for quality assurance and certification. i-doit allows services to be documented as a logical grouping of objects placed in relation to each other. This provides the basis for (ITIL) processes such as Change and Problem Management, helps to control and manage the SLA (Service Level Agreement) or the communication with users. Its interfaces to other ITSM applications (like Nagios, Request Tracker etc.) allow the entering of data only once in the ITSM environment. i-doit maintains one holistic view of the operational situation.
    Downloads: 36 This Week
    Last Update:
    See Project
  • 21
    Open Crypto Tracker

    Open Crypto Tracker

    Bitcoin Alts portfolio tracker, email / text / alexa / telegram alerts

    100% FREE / open source / PRIVATE cryptocurrency portfolio tracker. Email / text / alexa / telegram price alerts, price charts, mining calcs, leverage / gain / loss / balance stats, news feeds +more. Privately track Bitcoin / Ethereum / unlimited cryptocurrencies. Customize as many assets / markets / alerts / charts as you want. Over 50 Exchanges / 40 Trading Pairs Supported (exchanges / pairings list at bottom of README.txt): https://tinyurl.com/ct-readme Nearly Unlimited Assets...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    FileHashler

    FileHashler

    cross-platform (Java) file encryption and archiving tool

    Java based API (library) and console utility for cross-platform file and directory encryption and archiving. Encrypts single files up to 9.2 EB length. Directories can be encrypted as sets of single files in a single action. Uses Twofish (CBC) and SHA-256. Security level is strong and benefits from the PWS3 (Password Safe) encrypted file format. The level can be lowered to 128-bit key-length (normally 256).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    TurboVNC

    TurboVNC

    High-speed, 3D-friendly, TightVNC-compatible remote desktop software

    TurboVNC is a high-performance, enterprise-quality version of VNC based on TightVNC, TigerVNC, and X.org. It contains a variant of Tight encoding that is tuned for maximum performance and compression with 3D applications (VirtualGL), video, and other image-intensive workloads. TurboVNC, in combination with VirtualGL, provides a complete solution for remotely displaying 3D applications with interactive performance. TurboVNC's high-speed encoding methods have been adopted by TigerVNC and...
    Leader badge
    Downloads: 133,387 This Week
    Last Update:
    See Project
  • 24
    Crane

    Crane

    Crane is a FinOps Platform for Cloud Resource Analytics and Economics

    Crane is a FinOps Platform for Cloud Resource Analytics and Economics in Kubernetes clusters. The goal is not only to help users to manage cloud cost easily but also to ensure the quality of applications.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    Cinemagoer

    Cinemagoer

    Python package to retrieve and manage data of the IMDb

    Cinemagoer is a Python package useful to retrieve and manage the data of the IMDb movie database about movies, people, characters and companies. Platform-independent, it can retrieve data from both the IMDb's web server and a local copy of the whole db.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
Auth0 Logo