Showing 58 open source projects for "data quality"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 1
    OpenDataLoader PDF

    OpenDataLoader PDF

    PDF Parser for AI-ready data. Automate PDF accessibility

    OpenDataLoader PDF is an open-source document processing system designed to convert complex PDF files into structured, AI-ready formats such as Markdown, JSON, and HTML while preserving layout, hierarchy, and semantic meaning. It focuses on enabling downstream use cases like retrieval-augmented generation (RAG), knowledge extraction, and document intelligence pipelines by maintaining accurate reading order and spatial metadata through bounding boxes. The tool combines deterministic parsing...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    MentDB Projects

    MentDB Projects

    Generalized Interoperability and Strong AI

    MentDB is an open-source platform driving research into next-generation AI and universal data exchange. Our architecture is built around the revolutionary Mentalese Query Language (MQL). MentDB Weak (Generalized Interoperability): A unified data layer enabling seamless data exchange and application integration (SOA, ETL, Data Quality). We eliminate data silos through a single, generalized data language. MentDB Strong (Strong AI / AGI): The framework for exploring and building Machine Consciousness, free will, and advanced ethical reasoning systems. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3

    RecordEditor

    Editor for Fixed Width, Csv and Existing Xml files.

    The RecordEditor is a Data File editor for Flat Files (delimited and fixed field position). It supports Unix / PC / Legacy (e.g. Mainframe) file formats, both Text and binary files. The Editor uses a Record-Layout description to format the files. This is ideal for Fixed width (Text or Binary) files, Cobol Data Files, Mainframe files and complicated Csv files. Cobol Copybooks can be used to format Cobol Data files. As well as an editor, The following utilities are supplied * Formatted...
    Downloads: 40 This Week
    Last Update:
    See Project
  • 4
    Grassroots DICOM

    Grassroots DICOM

    Cross-platform DICOM implementation

    Grassroots DiCoM is a C++ library for DICOM medical files. It is accessible from Python, C#, Java and PHP. It supports RAW, JPEG, JPEG 2000, JPEG-LS, RLE and deflated transfer syntax. It comes with a super fast scanner implementation to quickly scan hundreds of DICOM files. It supports SCU network operations (C-ECHO, C-FIND, C-STORE, C-MOVE). PS 3.3 & 3.6 are distributed as XML files. It also provides PS 3.15 certificates and password based mecanism to anonymize and de-identify DICOM datasets.
    Leader badge
    Downloads: 162 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    x3d

    x3d

    X3D is the open-standard format for 3D graphics scenes on the Web.

    Extensible 3D (X3D) Graphics is a royalty-free International Standard for real-time interactive 3D graphics on the Web, providing unsurpassed interoperability for 3D communications on the Web. This project includes source for example X3D scene libraries and multiple X3D codebases produced by Web3D Consortium members. All open-source contributions are welcome.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Fronsetia

    Fronsetia

    A web application for testing webservices

    Fronsetia (Free Online Service Testing Application) is a Java web application that allows testing webservices (REST and SOAP). No end-user installations required, just deploy the application on your server and pass the link so that users can use their usual browsers to access Fronsetia. You can think of Fronsetia as a simple version of SoapUI on the web, or a simple and free and open-source version of the "wls_utc" utility. See the project homepage https://fronsetia.sourceforge.io and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    FilteredPush
    Network software for annotation and quality control of distributed data. Supported by NSF: DBI 0646266 and NSF: DBI 0960535. http://wiki.filteredpush.org
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    CSV Comparator

    CSV Comparator

    CSV Comparator is a tool for comparing files which contain CSV-Data.

    CSV Comparator is a tool for comparing files which contain CSV-Data. It was created for performance engineers to be able to have a easy way to compare data from two testruns and be able to automate the comparing process of various files with scripts. There is a command line tool as well as a user interface. In case you need to compare .xml-Files you can convert them to .csv-Files with my other project CSV Transformer. The advantages by using CSV as input and output are the...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    Restful APIs for Data Cleansing

    Restful APIs for Data Cleansing

    This is sister project for osDQ which provide Restful APIs

    (Beta Version) This is sister project for https://sourceforge.net/projects/dataquality/ . It provides Restful APIs for features for data quality and data preparation features. This project will help projects which want embed data quality and data preparation features in their project or UI using restful calls. Data Cleansing APIs Dockerfile: # Pull base image FROM frnde/jetty-9.4.2-jre8-alpine-cet ADD osdq-v0.0.1.war /var/lib/jetty/webapps/osdq.war EXPOSE 8080 Docker Image https://hub.docker.com/r/vreddym/osdq-web/tags
    Downloads: 0 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 10
    MsgViewer
    MsgViewer is email-viewer utility for .msg e-mail messages, implemented in pure Java. MsgViewer works on Windows/Linux/Mac Platforms. Also provides a java api to read mail messges (msg files) programmatically.
    Downloads: 127 This Week
    Last Update:
    See Project
  • 11
    A Transport Stream analyser for the Brazilian D-TV system (SBTVD / ISDB-Tb). It shows in a GUI the SI/PSI structure of the stream in a tree-view, bitrate statistics for each ES, specific Closed Caption, EPG and DSMCC carroussel decoding plus more.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    GenerateData
    GenerateData is a general purpose data generation engine. No plug-ins, no APIs, just data generation made easy. From single files, to referentially sound databases, point, click, tweak and generate.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    Database to CSV

    "Database to CSV" project stores/loads data between database and CSVs

    "Database to CSV" allows to back-up data from database(Oracle, MySQL, SQL Server) in the form of CSV files. Each table data is stored in a separate CSV file. It also allows to load data from CSV files into the database. Program can be run from UI or from command line. There's one more advanced feature which allows merging of data from CSV file to Database. It is only available from command line. (It will take foreign keys and unique keys into consideration and then try to insert...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    MailProbe
    E-Mail monitoring, benchmarking and load testing tool with integrated web UI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Toolsverse ETL Framework

    Toolsverse ETL Framework

    Open source Extract Transform Load engine written in Java

    ...It includes executables for all major platforms and can be easily integrated into other applications. Key Features: * embeddable, open source and free * fast and scalable * uses target database features to do transformations and loads * manual and automatic data mapping * data streaming * bulk data loads * data quality features using SQL, JavaScript? and regex * data transformations Requirements * Java 1.6 and up * At least 4 MB of RAM New in 3.2 (01/18/2013) * Improved auto-update functionality * Bug fixes
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    tap4j

    tap4j

    Test Anything Protocol for Java

    tap4J is a Java implementation of the Test Anything Protocol. This protocol lets you standardize the output of your test execution. This way a producer outputs a TAP result file that can be read by a TAP consumer.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    BrowserMob JavaScript Validator VNC
    VNC for use with the BrowserMob Selenium JavaScript Validator. This tool is made available for users of BrowserMob FREE Website Monitoring and Load Testing. The BrowserMob Local Validation Service can be downloaded from https://browsermob.com/tools.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    SAADA transforms a set of heterogeneous astronomical data FITS files in a powerfull database deployed on the Web and including VO features. The database is located on your host and keeps under your full control. This job doesn't require to write cod
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    A tool kit encoding XML schema Naming and Design Rules in Schematron and other computer-interpretable languages, enabling automated rule enforcement and improving schema quality.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    HTTP functional and non-functional (load and performance) toolkit based on jython/grinder (http://grinder.sf.net) ...includes capabilities to support: SOA services, REST, json/xml encoding, AES and WS security ... and a stub to collect requests
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    This project aims to port ExTeX, a high quality typesetting system developed for UNIX operating system to Macintosh OS X using Fink, a package management system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    The project Jaulp is jet another utility library project written in Java. It contains utility classes for Date ,Calendar, Collections, Resources, Files, IO for Random data, and many more. This is the last version for this project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    SAIM allows to interlink knowledge bases in the Semantic Web. It focuses on instance matching of very large knowledge bases available as SPARQL endpoints. SAIM uses machine learning techniques and is compatible with SILK.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    J2J provides an intuitive way to convert Java to JSON and JSON to Java. Annotate any Java class then use JsonWriter to emit JSON from java or JsonReader to convert JSON to java objects. (source code and tutorial included) Version 2.0 in development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    This is a web framework, based on XML and XSL technology. Support for the incremental development with plug-in, support plug and play.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
Auth0 Logo