Showing 260 open source projects for "data quality"

View related business solutions
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Open Source Data Quality and Profiling

    Open Source Data Quality and Profiling

    World's first open source data quality & data preparation project

    This project is dedicated to open source data quality and data preparation solutions. Data Quality includes profiling, filtering, governance, similarity check, data enrichment alteration, real time alerting, basket analysis, bubble chart Warehouse validation, single customer view etc. defined by Strategy. This tool is developing high performance integrated data management platform which will seamlessly do Data Integration, Data Profiling, Data Quality, Data Preparation, Dummy Data Creation, Meta Data Discovery, Anomaly Discovery, Data Cleansing, Reporting and Analytic. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    WhyLogs Java Library

    WhyLogs Java Library

    Profile and monitor your ML data pipeline end-to-end

    This is a Java implementation of WhyLogs, with support for Apache Spark integration for large scale datasets. Understanding the properties of data as it moves through applications is essential to keeping your ML/AI pipeline stable and improving your user experience, whether your pipeline is built for production or experimentation. WhyLogs is an open source statistical logging library that allows data science and ML teams to effortlessly profile ML/AI pipelines and applications, producing log...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    TimeSlotTracker

    TimeSlotTracker

    Simple and useful time tracker

    Simple and useful time tracker. Collects tasks and works (timeslots) in hierarchical tree. Has: reports (based on xslt templates), localization (7 langs), xml data format, ical, jnlp and JIRA (R) support. Runs on Windows, Mac and Linux. The code at GitHub: https://github.com/TimeSlotTracker/timeslottracker-desktop The git on Source forge is read only (replica).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    SiliCompressor

    SiliCompressor

    Easy to use Video and Image compression library for Android

    A powerful, flexible, and easy-to-use Video and Image compression library for Android. It's usually said that "A picture is worth a thousand words". Images add flair and beauty to our android apps, but we usually have problems with these images due to their large size. With SiliCompressor you can now compress and use your images more smoothly. Due to the high resolution of our Smartphone cameras and cameras from other devices, Video files have become large in size and thus difficult for them...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 5
    CloudTest-Cloud java unit test framework

    CloudTest-Cloud java unit test framework

    A redefined framework with new approach and methodology for unit test

    CloudTest is a redefined unit testing approach and methodology, which can make your testing jobs become much more easy and efficient. It is a pure java lightweight framework integrated test cases management, test data management, assert management, automation regression, performance monitor and test report in one.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    UbForms

    Field data measurements entry and complex data structures at ease

    With UbForms a person can manage field data measurements entry and complex data structures at ease. This app is inspired and based on the UrForms data model and design with several intricate enhancements and fixes. It is open source and free with a low privacy footprint.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Jack LIRE (Linked Record)

    Jack LIRE (Linked Record)

    Quantum universal object system with referencing and genetics

    Jack LIRE (Linked Record) is a quantum database-backed life-long text record diary with referencing (linking) between records, and a data library for life-long keeping of files such as documents, photographs, music and more. The diary records are fully searchable. Files can be mentioned in diary records to preserve memories, document knowledge and ideas, and more. Massive numbers of diary records and files can be permanently kept. A database located on one computer can be accessed from...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    cglib

    cglib

    High level API to generate and transform Java byte code

    Byte Code Generation Library is high-level API to generate and transform Java byte code. It is used by AOP, testing, data access frameworks to generate dynamic proxy objects and intercept field access. cglib is a powerful, high-performance and quality Code Generation Library. It is used to extend Java classes and implement interfaces at runtime. See samples and API documentation to learn more about features. This library is free software, freely reusable for personal or commercial purposes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    apache spark data pipeline osDQ

    apache spark data pipeline osDQ

    osDQ dedicated to create apache spark based data pipeline using JSON

    This is an offshoot project of open source data quality (osDQ) project https://sourceforge.net/projects/dataquality/ This sub project will create apache spark based data pipeline where JSON based metadata (file) will be used to run data processing , data pipeline , data quality and data preparation and data modeling features for big data. This uses java API of apache spark.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • 10
    CloverDX

    CloverDX

    Design, automate, operate and publish data pipelines at scale

    Please, visit www.cloverdx.com for latest product versions. Data integration platform; can be used to transform/map/manipulate data in batch and near-realtime modes. Suppors various input/output formats (CSV,FIXLEN,Excel,XML,JSON,Parquet, Avro,EDI/X12,HL7,COBOL,LOTUS, etc.). Connects to RDBMS/JMS/Kafka/SOAP/Rest/LDAP/S3/HTTP/FTP/ZIP/TAR. CloverDX offers 100+ specialized components which can be further extended by creation of "macros" - subgraphs - and libraries, shareable with 3rd...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    benerator is a framework for creating realistic and valid high-volume test data, used for load and performance testing and showcase setup. Data is generated from an easily configurable metadata model and exported to databases, XML, CSV or flat files.
    Leader badge
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    CallbackParams is a JUnit-extension for writing parameterized tests with new innovative patterns that offer elegant solutions to many of the drawbacks that are traditionally associated with parameterized testing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Luban

    Luban

    Image compression with efficiency very close to WeChat

    ...Although there are already a lot of writings on the internet on the topic, a great number of possible scenarios still have to be thought of, like unsuccessful compressions, too small pictures (eg. for profile pictures) or too bad image quality. Naturally, the first idea was to see how WeChat, the app giant manages this task in action. To gather data, 100 images with different resolutions were sent through WeChat Moments, then the compressed images were compared with the original ones. Luban's foundation is the result of this analysis on WeChat's compression method. Because the process was analyzed backward, Luban's efficacy is not yet exactly the same as that of WeChat, but the results are already very close to what WeChat Moments' image compression produces.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    TimeCollect
    TimeCollect records time you spent on your development work. Core features are: Automatic uptime monitoring, recording a break with one click, Trac integration for fetching tickets you are working on, storage in Derby or external MySQL database.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    ImageBox

    ImageBox

    An image Exif rotation tool.

    ImageBox is a free image editing software specially designed for easy image rotation. The goal of this project is the correct rotated representation of JPG, PNG, BMP and GIF images on any hardware, such as TVs and Media Centers. Why are images shown differently rotated? An image may contain Exif metadata in addition to the image data. A digital camera often encodes the image data with the rotation of the lens. If the camera is rotated during recording, the image data is also encoded...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    OpenCms ANT Build allows to build OpenCms modules from a build tree (e.g. in your IDE) which then may be installed on your OpenCms server. It contains 2 ANT task for scripting custom build processes. A sample tree with build.xml is included.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    CamShaft
    Calculation of camshafts - allows to define a camshaft profile on the set trajectory which should provide the given mechanism. The trajectory can be set graphically, co-ordinates or function.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    FilteredPush
    Network software for annotation and quality control of distributed data. Supported by NSF: DBI 0646266 and NSF: DBI 0960535. http://wiki.filteredpush.org
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    CRUSH

    CRUSH

    Data reduction and imaging for select astronomical cameras

    CRUSH is a pioneering reduction and imaging suite for many astronomical cameras operating at far-infrared, submillimeter and millimeter wavelengths, such as SHARC-2, SCUBA-2, LABOCA, GISMO, or SOFIA/HAWC+ (to name a few).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    MOIRAI

    Simple Scientific Workflow System for CAGE Analysis

    ...After mapping, a CAGE peak on the genome indicates the position of an active transcriptional start site (TSS) and the number of reads correspond to its expression level. CAGE is prominently used in both the FANTOM and ENCODE project. MOIRAI is a compact yet flexible workflow system designed to carry out the main steps in data processing and analysis of CAGE data. MOIRAI has a graphical interface allowing wet-lab researchers to create, modify and run analysis workflows. Embedded within the workflows are graphical quality control indicators allowing users assess data quality and to quickly spot potential problems. MOIRAI package comes with three main workflows allowing users to map, annotate and perform an expression analysis over multiple samples.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    DataCleaner

    DataCleaner

    Data quality analysis, profiling, cleansing, duplicate detection +more

    DataCleaner is a data quality analysis application and a solution platform for DQ solutions. It's core is a strong data profiling engine, which is extensible and thereby adds data cleansing, transformations, enrichment, deduplication, matching and merging. Website: http://datacleaner.github.io
    Downloads: 4 This Week
    Last Update:
    See Project
  • 22
    CSV Comparator

    CSV Comparator

    CSV Comparator is a tool for comparing files which contain CSV-Data.

    CSV Comparator is a tool for comparing files which contain CSV-Data. It was created for performance engineers to be able to have a easy way to compare data from two testruns and be able to automate the comparing process of various files with scripts. There is a command line tool as well as a user interface. In case you need to compare .xml-Files you can convert them to .csv-Files with my other project CSV Transformer. The advantages by using CSV as input and output are the...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    SoaBox
    SoaBox lets you create a virtual version of your SOA environment. Clients (WS-Webservices,RESTful, JMS, HTTP, FTP, Filesystem, POJOs/Custom Code), servers (HTTP/Endpoints) and data (XML, SOAP, CSV, etc). It's written in Java and runs with JBoss 5.x and 6.x
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    SmartRoot

    SmartRoot

    Semi-automated root image analysis software

    SmartRoot is a semi-automated image analysis software which streamlines the quantification of root growth and architecture for complex root systems. The software combines a vectorial representation of root objects with a powerful tracing algorithm which accommodates to a wide range of image source and quality. The software supports a sampling-based analysis of root system images, in which detailed information is collected on a limited number of roots selected by the user according to specific research requirements. SmartRoot is an operating system independent freeware based on ImageJ and uses cross-platform standards (XML, SQL, Java) for communication with data analysis softwares. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25

    LogDruid

    Generate charts and reports using data gathered in log files

    An application to gather, aggregate, chart and report information originating from any log files. It uses regular expressions that are constructed graphically and can be tested in the application against samples. Once configured for a specific type of log file set, the gathering and display of the chart for a new files set can be done in just one click. Contains a sample template to handle few log types: Java GC log, OpenDS access log, Apache access log
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo