Showing 241 open source projects for "data quality"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    ispc

    ispc

    Intel SPMD Program Compiler

    ispc is a compiler for a variant of the C programming language, with extensions for single programs, and multiple data programming. Under the SPMD model, the programmer writes a program that generally appears to be a regular serial program, though the execution model is actually that a number of program instances execute in parallel on the hardware. ispc compiles a C-based SPMD programming language to run on the SIMD units of CPUs and GPUs; it frequently provides a 3x or more speedup on...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    jimmer

    jimmer

    A revolutionary ORM framework for both java and kotlin.

    ...CDC solutions decoupled from specific caching technologies are transparent to business code. Requires no special prior knowledge - veterans of any ORM can quickly and painlessly migrate. The learning curve is scientifically friendly, serving as a high-quality reference for new learners of ORM usage.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    OPENRNDR

    OPENRNDR

    Kotlin library for creative coding, real-time and interactive graphics

    ...OPENRNDR provides simple, reusable utilities with which creative coders can build robust, fast, and reliable (interactive) applications for prototyping as well as building production-quality software. With ORML you can easily connect to a number of widely used Machine Learning models, such as Facemesh, Posenet, and Stylegan. You can use OPENRNDR to visualize the data coming from these models in order to create compelling (interactive) experiences. The ORML library includes both models and interface code to make the use of those models simple. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    J2ObjC

    J2ObjC

    A Java to iOS Objective-C translation tool and runtime

    J2ObjC is an open-source command-line tool from Google that translates Java source code to Objective-C for the iOS (iPhone/iPad) platform. This tool enables Java source to be part of an iOS application's build, as no editing of the generated files is necessary. The goal is to write an app's non-UI code (such as application logic and data models) in Java, which is then shared by web apps (using GWT), Android apps, and iOS apps. J2ObjC supports most Java language and runtime features required...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 5
    Timber themes

    Timber themes

    Create WordPress themes with OOP code and the Twig template engine

    Timber helps you create fully-customized WordPress themes faster with more sustainable code. With Timber, you write your HTML using the Twig Template Engine separate from your PHP files. This cleans up your theme code so, for example, your PHP file can focus on being the data/logic, while your Twig file can focus 100% on the HTML and display. Once Timber is installed and activated in your plugins directory, it gives any WordPress theme the ability to take advantage of the power of Twig and...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    LLM Datasets

    LLM Datasets

    Curated list of datasets and tools for post-training

    ...The repository aims to make datasets easy to inspect and transform, with scripts for downloading, deduping, cleaning, and converting to formats like JSONL that slot into training pipelines. It highlights instruction-tuning and conversation-style corpora while also pointing to code, math, or domain-specific sets for targeted capabilities. Quality is a recurring theme: examples and utilities help filter low-value samples, enforce length limits, and split train/validation consistently so results are comparable. Licensing and provenance are surfaced to encourage compliant usage and to guide dataset selection in commercial settings. For practitioners, the repo is a practical “starting pantry” that accelerates experimentation and helps keep data wrangling from dominating the project timeline.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Code Climate CLI

    Code Climate CLI

    Code Climate CLI

    ...Software development teams used to operate in the dark, forced to make decisions based on gut feel and anecdotes, causing friction, creating silos, and leading to mediocre outcomes. Today, forward-thinking engineering leaders leverage data-driven insights to build a culture of trust and high performance. Align on business priorities with improved transparency across every level of your organization, and ensure optimal resource allocation for your engineering teams. Deliver high-quality code quickly and consistently to achieve true Continuous Delivery and out-innovate your competition. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Best-of React Web

    Best-of React Web

    A ranked list of awesome React open-source libraries and tools

    This curated list contains 430 awesome open-source projects with a total of 5M stars grouped into 22 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from GitHub and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Software Engineering Blogs

    Software Engineering Blogs

    A curated list of engineering blogs

    Software Engineering Blogs is a curated collection of engineering-focused blog posts and resources aggregated from leading technology companies and developers. The repository serves as a centralized index that helps users discover high-quality technical content across a wide range of topics, including software engineering, system design, infrastructure, and data engineering. It is organized in a structured manner, making it easy to browse and find articles from specific companies or domains of interest. The project is continuously updated by the community, ensuring that the content remains relevant and reflective of current industry practices. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    applied-ml

    applied-ml

    Papers & tech blogs by companies sharing their work on data science

    The applied-ml repository is a rich, curated collection of papers, technical articles, and case-study blog posts about how machine learning (ML) and data-driven systems are applied in real production environments by major companies. Instead of focusing solely on theoretical ML research, this repo highlights industry-scale challenges: data collection, quality, infrastructure, feature stores, model serving, monitoring, scalability, and how ML is embedded in product workflows. It acts as a living library for practitioners who want to learn from real-world successes and failures — giving insight into how large organizations structure their data pipelines, how they manage ML lifecycle at scale, and what architectural or operational tradeoffs they made. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Jailer

    Jailer

    Database Subsetter and Relational Data Browser

    Database Subsetting and Relational Data Browsing Tool. Navigate bidirectionally through the database by following foreign-key-based or user-defined relationships. Exports consistent, referentially intact row-sets from relational databases. Removes data w/o violating integrity. Generates topologically sorted SQL-DML, hierarchically structured XML. and DbUnit datasets.
    Leader badge
    Downloads: 66 This Week
    Last Update:
    See Project
  • 12

    RecordEditor

    Editor for Fixed Width, Csv and Existing Xml files.

    The RecordEditor is a Data File editor for Flat Files (delimited and fixed field position). It supports Unix / PC / Legacy (e.g. Mainframe) file formats, both Text and binary files. The Editor uses a Record-Layout description to format the files. This is ideal for Fixed width (Text or Binary) files, Cobol Data Files, Mainframe files and complicated Csv files. Cobol Copybooks can be used to format Cobol Data files. As well as an editor, The following utilities are supplied * Formatted...
    Downloads: 43 This Week
    Last Update:
    See Project
  • 13
    Grassroots DICOM

    Grassroots DICOM

    Cross-platform DICOM implementation

    Grassroots DiCoM is a C++ library for DICOM medical files. It is accessible from Python, C#, Java and PHP. It supports RAW, JPEG, JPEG 2000, JPEG-LS, RLE and deflated transfer syntax. It comes with a super fast scanner implementation to quickly scan hundreds of DICOM files. It supports SCU network operations (C-ECHO, C-FIND, C-STORE, C-MOVE). PS 3.3 & 3.6 are distributed as XML files. It also provides PS 3.15 certificates and password based mecanism to anonymize and de-identify DICOM datasets.
    Leader badge
    Downloads: 156 This Week
    Last Update:
    See Project
  • 14
    Hypercube

    Hypercube

    Graph visualizing tool

    Hypercube is a tool for visualizing DOT (graphviz), GML, GraphML, GXL and simple text-based graph representations as SVG and EPS images. Hypercube comes with a Qt based GUI application and a Qt-independent command-line tool. It uses a simulated annealing algorithm to lay out the graph, that can be easily parameterized to achieve the desired look. The main development goals are portability and easy usage rather than high performance and complexity.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 15

    Practically Random

    statistical tests & psuedo- random number generators (RNGs, PRNGs)

    Randon number generation & testing. The best suite of statistical tests for fast PRNGs anywhere. Multithreaded for speed, command line tools for automation, no upper limit on data size. Also, a variety of C++ pseudo-random number generators with well designed interfaces aimed at practical uses, not just research.
    Leader badge
    Downloads: 52 This Week
    Last Update:
    See Project
  • 16

    Text File Generator

    Generate large text files for various testing purposes

    Generate large text files with dummy data (i.e. for performance testing an app that needs to read a file). This is a console app. It allows to generate large files (you can specify the size in megabytes). There is a template which you can set to define how a single line should look like, e.g. you can make a CSV file from it. This can contain random words, random numbers and sequential numbers. It has a dictionary of English words which you can replace with your own set. You can...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    SafeUtils

    SafeUtils

    110+ developer tools as native MacOS, Linux & Windows desktop apps.

    Tools: https://safeutils.com/barcode-generator https://safeutils.com/color-picker https://safeutils.com/qr-code-generator https://safeutils.com/qr-code-scanner https://safeutils.com/word-counter https://safeutils.com/base-64-decoder https://safeutils.com/diff-checker https://safeutils.com/hex-to-ascii https://safeutils.com/json-formatter https://safeutils.com/lorem-ipsum-generator https://safeutils.com/random-generator https://safeutils.com/time-converter https://safeutils.com/...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    Optimizer_sovkov

    Constructing and optimizing general mathematical and physical models

    We present the package Optimizer, aimed at constructing and optimizing general mathematical models of phenomena of versatile nature. It is written in the Matlab algorithmic language and is executed in the Matlab environment with partial functionality in Octave. The convenient visual interface and the detailed manuals are provided. The main benefit of the package is its capability to construct models of any level of complexity in a block-by-block manner. Elementary model blocks can be...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Hoodland Toolbox

    Hoodland Toolbox

    An assorted miscellany of tools and toys.

    Hoodland Toolbox (formerly known as the Rockabilly Common Libraries) An assorted miscellany of tools and toys used by all other Hoodland Open Source Projects (formerly Rockabilly Software). This project has been rewritten in Kotlin for the JVM. Its new home is here: https://github.com/william-hood/toolbox-kotlin Available in Maven Central Repository... Kotlin: https://central.sonatype.com/artifact/io.github.william-hood/toolbox-kotlin Java:...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Koarse Grind (formerly Coarse Grind)

    Koarse Grind (formerly Coarse Grind)

    Test Automation framework intended for coarse-grained testing

    Koarse Grind has been rewritten in Kotlin (hence the 'K'). It's new home is here: https://github.com/william-hood/koarse-grind-kotlin __________________________________ Test Automation framework for Kotlin/Java with an integrated web-based user interface. It is intended for larger-grained testing of an installed & configured product, rather than unit testing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Fronsetia

    Fronsetia

    A web application for testing webservices

    Fronsetia (Free Online Service Testing Application) is a Java web application that allows testing webservices (REST and SOAP). No end-user installations required, just deploy the application on your server and pass the link so that users can use their usual browsers to access Fronsetia. You can think of Fronsetia as a simple version of SoapUI on the web, or a simple and free and open-source version of the "wls_utc" utility. See the project homepage https://fronsetia.sourceforge.io and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Neiki's Gallery

    Neiki's Gallery

    Vanilla JavaScript image gallery & lightbox

    Neiki's Gallery is a lightweight, production-ready image gallery and lightbox library built with vanilla JavaScript and CSS. It requires no dependencies and can be integrated with a single <script> tag, with automatic initialization out of the box. It provides a highly customizable experience for modern web projects, combining performance, flexibility, and rich UI interactions. Designed for both developers and end users, it supports responsive layouts, advanced lightbox features, touch...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    chimp

    chimp

    Tooling that helps you do quality, faster

    Your Apollo GraphQL development companion for doing quality, faster. Chimp helps you write high-quality code from the get-go. No more putting tests and quality as an after-thought. Quality first, speed for free. Boilerplate is time-consuming, error-prone and boring! Chimp reduces that through its various generators and smart defaults. Modularity leads to maintainable and testable code, and this is a key feature of all Chimp's domain-driven and data-driven generators. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    GitLab CE Server For Local Intranets

    GitLab CE Server For Local Intranets

    The Free & Popular Community git Server in a Complete Virtual Machine

    This VM is created for 2 reasons: 1. Very little initial setup work required to make a GIT Server live, within minutes. 2. This system should keep running for Years, without requiring Updates / Breakages. If you are new to Virtual Machines, then please watch the Video below ( taken from my other project. just replace td with gi wherever mentioned ) After starting this VM, please login to its administration panel with: Website Address: https://gi.local/ ( Accept Any Warnings due to...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    fastMRI

    fastMRI

    A large open dataset + tools to speed up MRI scans using ML

    fastMRI is a large-scale collaborative research project by Facebook AI Research (FAIR) and NYU Langone Health that explores how deep learning can accelerate magnetic resonance imaging (MRI) acquisition without compromising image quality. By enabling reconstruction of high-fidelity MR images from significantly fewer measurements, fastMRI aims to make MRI scanning faster, cheaper, and more accessible in clinical settings. The repository provides an open-source PyTorch framework with data loaders, subsampling utilities, reconstruction models, and evaluation metrics, supporting both research reproducibility and practical experimentation. ...
    Downloads: 1 This Week
    Last Update:
    See Project
Auth0 Logo