Showing 292 open source projects for "data quality"

View related business solutions
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 1
    ispc

    ispc

    Intel SPMD Program Compiler

    ispc is a compiler for a variant of the C programming language, with extensions for single programs, and multiple data programming. Under the SPMD model, the programmer writes a program that generally appears to be a regular serial program, though the execution model is actually that a number of program instances execute in parallel on the hardware. ispc compiles a C-based SPMD programming language to run on the SIMD units of CPUs and GPUs; it frequently provides a 3x or more speedup on...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    jimmer

    jimmer

    A revolutionary ORM framework for both java and kotlin.

    ...CDC solutions decoupled from specific caching technologies are transparent to business code. Requires no special prior knowledge - veterans of any ORM can quickly and painlessly migrate. The learning curve is scientifically friendly, serving as a high-quality reference for new learners of ORM usage.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    OPENRNDR

    OPENRNDR

    Kotlin library for creative coding, real-time and interactive graphics

    ...OPENRNDR provides simple, reusable utilities with which creative coders can build robust, fast, and reliable (interactive) applications for prototyping as well as building production-quality software. With ORML you can easily connect to a number of widely used Machine Learning models, such as Facemesh, Posenet, and Stylegan. You can use OPENRNDR to visualize the data coming from these models in order to create compelling (interactive) experiences. The ORML library includes both models and interface code to make the use of those models simple. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Timber themes

    Timber themes

    Create WordPress themes with OOP code and the Twig template engine

    Timber helps you create fully-customized WordPress themes faster with more sustainable code. With Timber, you write your HTML using the Twig Template Engine separate from your PHP files. This cleans up your theme code so, for example, your PHP file can focus on being the data/logic, while your Twig file can focus 100% on the HTML and display. Once Timber is installed and activated in your plugins directory, it gives any WordPress theme the ability to take advantage of the power of Twig and...
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    LLM Datasets

    LLM Datasets

    Curated list of datasets and tools for post-training

    ...The repository aims to make datasets easy to inspect and transform, with scripts for downloading, deduping, cleaning, and converting to formats like JSONL that slot into training pipelines. It highlights instruction-tuning and conversation-style corpora while also pointing to code, math, or domain-specific sets for targeted capabilities. Quality is a recurring theme: examples and utilities help filter low-value samples, enforce length limits, and split train/validation consistently so results are comparable. Licensing and provenance are surfaced to encourage compliant usage and to guide dataset selection in commercial settings. For practitioners, the repo is a practical “starting pantry” that accelerates experimentation and helps keep data wrangling from dominating the project timeline.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Best-of React Web

    Best-of React Web

    A ranked list of awesome React open-source libraries and tools

    This curated list contains 430 awesome open-source projects with a total of 5M stars grouped into 22 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from GitHub and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Software Engineering Blogs

    Software Engineering Blogs

    A curated list of engineering blogs

    Software Engineering Blogs is a curated collection of engineering-focused blog posts and resources aggregated from leading technology companies and developers. The repository serves as a centralized index that helps users discover high-quality technical content across a wide range of topics, including software engineering, system design, infrastructure, and data engineering. It is organized in a structured manner, making it easy to browse and find articles from specific companies or domains of interest. The project is continuously updated by the community, ensuring that the content remains relevant and reflective of current industry practices. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    applied-ml

    applied-ml

    Papers & tech blogs by companies sharing their work on data science

    The applied-ml repository is a rich, curated collection of papers, technical articles, and case-study blog posts about how machine learning (ML) and data-driven systems are applied in real production environments by major companies. Instead of focusing solely on theoretical ML research, this repo highlights industry-scale challenges: data collection, quality, infrastructure, feature stores, model serving, monitoring, scalability, and how ML is embedded in product workflows. It acts as a living library for practitioners who want to learn from real-world successes and failures — giving insight into how large organizations structure their data pipelines, how they manage ML lifecycle at scale, and what architectural or operational tradeoffs they made. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Jailer

    Jailer

    Database Subsetter and Relational Data Browser

    Database Subsetting and Relational Data Browsing Tool. Navigate bidirectionally through the database by following foreign-key-based or user-defined relationships. Exports consistent, referentially intact row-sets from relational databases. Removes data w/o violating integrity. Generates topologically sorted SQL-DML, hierarchically structured XML. and DbUnit datasets.
    Leader badge
    Downloads: 66 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 10

    RecordEditor

    Editor for Fixed Width, Csv and Existing Xml files.

    The RecordEditor is a Data File editor for Flat Files (delimited and fixed field position). It supports Unix / PC / Legacy (e.g. Mainframe) file formats, both Text and binary files. The Editor uses a Record-Layout description to format the files. This is ideal for Fixed width (Text or Binary) files, Cobol Data Files, Mainframe files and complicated Csv files. Cobol Copybooks can be used to format Cobol Data files. As well as an editor, The following utilities are supplied * Formatted...
    Downloads: 43 This Week
    Last Update:
    See Project
  • 11
    TextTest is an application-independent tool for text-based functional testing. This means running a batch-mode binary in lots of different ways, and using the text output produced as a means of controlling the behaviour of that application.
    Leader badge
    Downloads: 53 This Week
    Last Update:
    See Project
  • 12
    Grassroots DICOM

    Grassroots DICOM

    Cross-platform DICOM implementation

    Grassroots DiCoM is a C++ library for DICOM medical files. It is accessible from Python, C#, Java and PHP. It supports RAW, JPEG, JPEG 2000, JPEG-LS, RLE and deflated transfer syntax. It comes with a super fast scanner implementation to quickly scan hundreds of DICOM files. It supports SCU network operations (C-ECHO, C-FIND, C-STORE, C-MOVE). PS 3.3 & 3.6 are distributed as XML files. It also provides PS 3.15 certificates and password based mecanism to anonymize and de-identify DICOM datasets.
    Leader badge
    Downloads: 156 This Week
    Last Update:
    See Project
  • 13
    Hypercube

    Hypercube

    Graph visualizing tool

    Hypercube is a tool for visualizing DOT (graphviz), GML, GraphML, GXL and simple text-based graph representations as SVG and EPS images. Hypercube comes with a Qt based GUI application and a Qt-independent command-line tool. It uses a simulated annealing algorithm to lay out the graph, that can be easily parameterized to achieve the desired look. The main development goals are portability and easy usage rather than high performance and complexity.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 14

    Practically Random

    statistical tests & psuedo- random number generators (RNGs, PRNGs)

    Randon number generation & testing. The best suite of statistical tests for fast PRNGs anywhere. Multithreaded for speed, command line tools for automation, no upper limit on data size. Also, a variety of C++ pseudo-random number generators with well designed interfaces aimed at practical uses, not just research.
    Leader badge
    Downloads: 52 This Week
    Last Update:
    See Project
  • 15
    Gerber2PDF

    Gerber2PDF

    Gerber to PDF converter

    Gerber2PDF is a command-line tool to convert Gerber files to PDF for proofing and hobbyist printing purposes. It converts multiple Gerber files at once, placing the resulting layers each on it's own page within the PDF. Each layer has a PDF bookmark for easy reference. Layers can optionally be combined onto a single page and rendered with custom colours and transparency. There is a Drill to Gerber converter available from the downloads page.
    Leader badge
    Downloads: 30 This Week
    Last Update:
    See Project
  • 16
    iSphere

    iSphere

    The iSphere Project for and RDi 9.5.1.3+

    The iSphere Source Code has been moved to GitHub (https://github.com/rdi-open-source/isphere-plugin) on January 3rd, 2024. Important: The update site and ticket management has been moved to GitHub as well. iSphere is an open source plug-in for IBM's Rational Developer for i 9.5.1.3+. It delivers high quality extensions to improve developer productivity. IBM's current Eclipse based Integrated Development Environment (IDE) is a huge step beyond SEU, but it still lacks features...
    Leader badge
    Downloads: 158 This Week
    Last Update:
    See Project
  • 17

    Text File Generator

    Generate large text files for various testing purposes

    Generate large text files with dummy data (i.e. for performance testing an app that needs to read a file). This is a console app. It allows to generate large files (you can specify the size in megabytes). There is a template which you can set to define how a single line should look like, e.g. you can make a CSV file from it. This can contain random words, random numbers and sequential numbers. It has a dictionary of English words which you can replace with your own set. You can...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    Cinemagoer

    Cinemagoer

    Python package to retrieve and manage data of the IMDb

    Cinemagoer is a Python package useful to retrieve and manage the data of the IMDb movie database about movies, people, characters and companies. Platform-independent, it can retrieve data from both the IMDb's web server and a local copy of the whole db.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    SafeUtils

    SafeUtils

    110+ developer tools as native MacOS, Linux & Windows desktop apps.

    Tools: https://safeutils.com/barcode-generator https://safeutils.com/color-picker https://safeutils.com/qr-code-generator https://safeutils.com/qr-code-scanner https://safeutils.com/word-counter https://safeutils.com/base-64-decoder https://safeutils.com/diff-checker https://safeutils.com/hex-to-ascii https://safeutils.com/json-formatter https://safeutils.com/lorem-ipsum-generator https://safeutils.com/random-generator https://safeutils.com/time-converter https://safeutils.com/...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Muse: Middleware Universal Scripting idE

    Muse: Middleware Universal Scripting idE

    Automate: WebSphere; WebLogic; JBoss; Glassfish; Tomcat; Linux, WinRM

    Simplify... Aggregate... Automate... Simplify... *** OPEN SOURCE - GPL3/EPL. Use Python / Jython to automate WebSphere, WebLogic, JBoss, Glassfish and Tomcat Middleware Estates over JMX, both SSL and non-SSL + Linux SSH (agent-less) + WinRM Target all 5 servers, Linux and WinRM from the same workspace. Familiar Eclipse based Jython and Python Development IDE, pre-configured and ready to go. 4-Click Installer. Win x64, Linux WINE x64. Built-In JVM. Java 8/9/10, Amazon...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Bootsmann

    Bootsmann

    A minimalistic, lean and fast HTTP REST API test application

    Bootsmann is an opensource C++ alternative to Postman/Insomnia/Hoppscotch for simple HTTP REST API testing. It is especially useful for embedded devices and local development, where low memory usage and fast response times are crucial.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    Optimizer_sovkov

    Constructing and optimizing general mathematical and physical models

    We present the package Optimizer, aimed at constructing and optimizing general mathematical models of phenomena of versatile nature. It is written in the Matlab algorithmic language and is executed in the Matlab environment with partial functionality in Octave. The convenient visual interface and the detailed manuals are provided. The main benefit of the package is its capability to construct models of any level of complexity in a block-by-block manner. Elementary model blocks can be...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Hoodland Toolbox

    Hoodland Toolbox

    An assorted miscellany of tools and toys.

    Hoodland Toolbox (formerly known as the Rockabilly Common Libraries) An assorted miscellany of tools and toys used by all other Hoodland Open Source Projects (formerly Rockabilly Software). This project has been rewritten in Kotlin for the JVM. Its new home is here: https://github.com/william-hood/toolbox-kotlin Available in Maven Central Repository... Kotlin: https://central.sonatype.com/artifact/io.github.william-hood/toolbox-kotlin Java:...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Koarse Grind (formerly Coarse Grind)

    Koarse Grind (formerly Coarse Grind)

    Test Automation framework intended for coarse-grained testing

    Koarse Grind has been rewritten in Kotlin (hence the 'K'). It's new home is here: https://github.com/william-hood/koarse-grind-kotlin __________________________________ Test Automation framework for Kotlin/Java with an integrated web-based user interface. It is intended for larger-grained testing of an installed & configured product, rather than unit testing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Fronsetia

    Fronsetia

    A web application for testing webservices

    Fronsetia (Free Online Service Testing Application) is a Java web application that allows testing webservices (REST and SOAP). No end-user installations required, just deploy the application on your server and pass the link so that users can use their usual browsers to access Fronsetia. You can think of Fronsetia as a simple version of SoapUI on the web, or a simple and free and open-source version of the "wls_utc" utility. See the project homepage https://fronsetia.sourceforge.io and...
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo