Showing 766 open source projects for "data quality"

View related business solutions
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Argos Translate

    Argos Translate

    Open-source offline translation library written in Python

    ...This allows for translating between a wide variety of languages at the cost of some loss of translation quality.
    Downloads: 80 This Week
    Last Update:
    See Project
  • 2
    jfDataLogger

    jfDataLogger

    PLC Data Logger

    jfDataLogger ============ jfDataLogger can log data from PLC controllers. Supports Hardware: - Siemens S7-300/400 PLCs - AllenBradley (CompactLogic Controllers only) - ModBus IP protocol - National Instruments DAQmx System Features: - Trigger bit at start - Save/Load data set - Save Image - log to CSV Website: http://jfdatalogger.sourceforge.net
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Myrtille

    Myrtille

    A native HTML4 / HTML5 Remote Desktop Protocol and SSH client

    Myrtille provides simple and fast access to remote desktops, applications, and SSH servers through a web browser, without any plugin, extension or configuration. Technically, Myrtille is an HTTP(S) to RDP and SSH gateway. User input (keyboard, mouse, touchscreen) is forwarded from a web browser to an HTTP(S) gateway, then up to an RDP (or SSH) client which maintains a session with an RDP (or SSH) server. The display resulting (or not) of such actions is streamed back to the browser, from the...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    TemaTres: controlled vocabulary server

    TemaTres: controlled vocabulary server

    Manage, Publish and Share Ontologies, Taxonomies, Thesauri, Glossaries

    Web application for management formal representations of knowledge, thesauri, taxonomies and multilingual vocabularies / Aplicación para la gestión de representaciones formales del conocimiento, tesauros, taxonomías, vocabularios multilingües. For the latest version of code: https://github.com/tematres/TemaTres-Vocabulary-Server
    Downloads: 5 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 5
    JDecisiontable

    JDecisiontable

    Stop to rack your brain, start using process-oriented decision tables!

    This tool is about a method called "process-oriented decision tables". These are decision tables but different from "normal" decision tables. They fix the flaws of traditional decision tables making it a joy to work with. But they keep all benefits of traditional decision tables. First of all they present the logic in a very dense form. 1. Explore all possible combinations of conditions and actions by question-and-answer. 2. Have a compact one-sheet-view on complex logic. ...
    Downloads: 43 This Week
    Last Update:
    See Project
  • 6
    XLM (Cross-lingual Language Model)

    XLM (Cross-lingual Language Model)

    PyTorch original implementation of Cross-lingual Language Model

    XLM (Cross-lingual Language Model) is a family of multilingual pretraining methods that align representations across languages to enable strong zero-shot transfer. It popularized objectives like Masked Language Modeling (MLM) across many languages and Translation Language Modeling (TLM) that jointly trains on parallel sentence pairs to tighten cross-lingual alignment. Using a shared subword vocabulary, XLM learns language-agnostic features that work well for classification and sequence...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    CodeQL

    CodeQL

    Libraries and queries that power security researchers

    CodeQL is a semantic code analysis engine that treats programs as queryable databases, enabling users to write expressive queries that identify security vulnerabilities, logic bugs, and code quality issues across large codebases. Instead of just pattern matching text, CodeQL ingests source code, builds rich representations of structure and data flow, and allows queries that reason about control flow, type systems, and interprocedural relationships. This makes it powerful not just for basic linting but for deep detection of complex security flaws like SQL injection, cross-site scripting, and taint propagation that traditional static analyzers can miss. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8
    Nexplorer is a web-based phylogenetic browser, used to view and edit comparative data, and ideal for creating publication-quality views in which the tips of a tree are aligned with the rows of a character matrix (e.g., a sequence alignment).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Open Source Data Quality and Profiling

    Open Source Data Quality and Profiling

    World's first open source data quality & data preparation project

    This project is dedicated to open source data quality and data preparation solutions. Data Quality includes profiling, filtering, governance, similarity check, data enrichment alteration, real time alerting, basket analysis, bubble chart Warehouse validation, single customer view etc. defined by Strategy. This tool is developing high performance integrated data management platform which will seamlessly do Data Integration, Data Profiling, Data Quality, Data Preparation, Dummy Data Creation, Meta Data Discovery, Anomaly Discovery, Data Cleansing, Reporting and Analytic. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 10
    code-nav

    code-nav

    Professional programming navigation

    Programming Navigation is not just a website, but a complete ecology with the goal of "helping everyone discover high-quality programming resources and improve learning efficiency". It contains multiple sub-projects, and the technology stack includes React, Java SpringBoot, Tencent Cloud Development, etc., all of which are open source for everyone to learn, so that you can easily develop beautiful information navigation websites! Most of the programming navigation websites are in disrepair...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    ETS Offers iClassicMDM - MDM Software

    ETS Offers iClassicMDM - MDM Software

    iClassicMDM offered by ETS is a Master Data Platform for all.

    ...It has built in Data Modeler, Databases, Data Quality - Cleanse & Match, Data Flow studio & Data store to accelerate the turn around time. Our customers can use the product for evaluation and once they are satisfied, they can reach out to us for pricing before go-live. We are recognized by Gartner since 2016. Contact us at info@etsondemand.com or visit www.etsondemand.com to download the software.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Begin LaTeX in minutes

    Begin LaTeX in minutes

    Brief Intro to LaTeX for beginners that helps you use LaTeX with ease

    Begin LaTeX in minutes is a beginner-friendly guide that teaches the basics of LaTeX through short explanations and practical examples. It introduces LaTeX as a document preparation system for high-quality typesetting, especially for academic, technical, and scientific writing. The project walks readers through setup, the first LaTeX file, document structure, commands, comments, and common formatting patterns. It also covers lists, sections, tables of contents, footnotes, packages, tables,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    OpenAI Glow

    OpenAI Glow

    Copy code in "Glow: Generative Flow with Invertible 1x1 Convolutions"

    Glow is an open source generative model released by OpenAI that demonstrates flow-based generative modeling techniques. Unlike models that rely on approximate inference, Glow uses invertible transformations to directly learn the data distribution, allowing for exact likelihood computation and efficient sampling. The model is capable of producing high-quality synthetic images while maintaining interpretable latent spaces that enable meaningful manipulation of generated outputs. Glow’s architecture is based on reversible layers and efficient flow operations, which allow large-scale training while keeping memory usage manageable. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    CC-Net

    CC-Net

    Tools to download and cleanup Common Crawl data

    cc_net provides tools to download, segment, clean, and filter Common Crawl to build large-scale text corpora, including monolingual datasets and the multilingual CC-100 collection introduced in the associated paper. It includes pipelines to fetch snapshots, extract text, de-duplicate, identify language, and apply quality filtering based on heuristics and language models. The outputs are intended for pretraining language models and for creating standardized corpora that can be reproduced or...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    WhyLogs Java Library

    WhyLogs Java Library

    Profile and monitor your ML data pipeline end-to-end

    This is a Java implementation of WhyLogs, with support for Apache Spark integration for large scale datasets. Understanding the properties of data as it moves through applications is essential to keeping your ML/AI pipeline stable and improving your user experience, whether your pipeline is built for production or experimentation. WhyLogs is an open source statistical logging library that allows data science and ML teams to effortlessly profile ML/AI pipelines and applications, producing log...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Sing App Lite

    Sing App Lite

    Free and open-source admin dashboard template built with Bootstrap 4

    Including responsive layout and high-quality UI built with Bootstrap 4.5. This repository contains a limited free version of Sing App. Full bootstrap 4, React with backend, Vue.js, Angular and Angular JS versions are also available and are shipped in a full package. Using this template you will be able to leverage lots of built-in components and thus focus on building features that set you apart from competitors rather than building everything from scratch.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Helpdezk

    Helpdezk

    HelpDEZk is a powerfull software that manages requests/incidents.

    HelpDEZk is a powerfull software that manages requests/incidents. It has all the needed requirements to an efficient workflow management of all processes involved in service execution. This control is done for internal demands and also for outsourced services. HelpDEZk can be used at any company’s area, serving as an support to the shared service center concept, beyond the ability to log all the processes and maintain the request’s history, it can pass it through many approval...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    TimeSlotTracker

    TimeSlotTracker

    Simple and useful time tracker

    Simple and useful time tracker. Collects tasks and works (timeslots) in hierarchical tree. Has: reports (based on xslt templates), localization (7 langs), xml data format, ical, jnlp and JIRA (R) support. Runs on Windows, Mac and Linux. The code at GitHub: https://github.com/TimeSlotTracker/timeslottracker-desktop The git on Source forge is read only (replica).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    SiliCompressor

    SiliCompressor

    Easy to use Video and Image compression library for Android

    A powerful, flexible, and easy-to-use Video and Image compression library for Android. It's usually said that "A picture is worth a thousand words". Images add flair and beauty to our android apps, but we usually have problems with these images due to their large size. With SiliCompressor you can now compress and use your images more smoothly. Due to the high resolution of our Smartphone cameras and cameras from other devices, Video files have become large in size and thus difficult for them...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    EiffelStudio
    EiffelStudio is an Integrated Development Environment (IDE) that provides comprehensive facilities to programmers. It offers a comprehensive suite of tools that enable programmers to produce correct, reliable, and maintainable software while keeping control of the development process. If you want to create fast, robust, scalable applications, then EiffelStudio™ will offer you a cost-effective solution. Imagine being able to model your system as you think – capturing your requirements and...
    Downloads: 17 This Week
    Last Update:
    See Project
  • 21
    CloudTest-Cloud java unit test framework

    CloudTest-Cloud java unit test framework

    A redefined framework with new approach and methodology for unit test

    CloudTest is a redefined unit testing approach and methodology, which can make your testing jobs become much more easy and efficient. It is a pure java lightweight framework integrated test cases management, test data management, assert management, automation regression, performance monitor and test report in one.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Optimus

    Optimus

    Agile Data Preparation Workflows made easy with Pandas

    Easily write code to clean, transform, explore and visualize data using Python. Process using a simple API, making it easy to use for newcomers. More than 100 functions to handle strings, process dates, urls and emails. Easily plot data from any size. Out-of-box functions to explore and fix data quality. Use the same code to process your data in your laptop or in a remote cluster of GPUs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Redux-ORM

    Redux-ORM

    A small, simple and immutable ORM to manage relational data

    A small, simple and immutable ORM to manage relational data in your Redux store. Redux-ORM is fast. All queries are lazily evaluated and come with built-in support for memorization. It provides a clean abstraction over low-level updates, protecting the state from accidental mutations. With nearly 100% branch coverage, the library is thoroughly tested to ensure rock-solid code quality. No matter where your data comes from, Redux-ORM will stay out of its way.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    EliteQuant

    EliteQuant

    A list of online resources for quantitative modeling, trading, etc.

    ...Has criteria for recommending projects/resources to help keep quality up.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Jack LIRE (Linked Record)

    Jack LIRE (Linked Record)

    Quantum universal object system with referencing and genetics

    Jack LIRE (Linked Record) is a quantum database-backed life-long text record diary with referencing (linking) between records, and a data library for life-long keeping of files such as documents, photographs, music and more. The diary records are fully searchable. Files can be mentioned in diary records to preserve memories, document knowledge and ideas, and more. Massive numbers of diary records and files can be permanently kept. A database located on one computer can be accessed from...
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo