Showing 19 open source projects for "data quality"

View related business solutions
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Cookiecutter Data Science

    Cookiecutter Data Science

    Project structure for doing and sharing data science work

    A logical, reasonably standardized, but flexible project structure for doing and sharing data science work. When we think about data analysis, we often think just about the resulting reports, insights, or visualizations. While these end products are generally the main event, it's easy to focus on making the products look nice and ignore the quality of the code that generates them. Because these end products are created programmatically, code quality is still important! ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Pandas Profiling

    Pandas Profiling

    Create HTML profiling reports from pandas DataFrame objects

    ...Mostly global details about the dataset (number of records, number of variables, overall missigness and duplicates, memory footprint). Comprehensive and automatic list of potential data quality issues (high correlation, skewness, uniformity, zeros, missing values, constant values, between others).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Apache InLong

    Apache InLong

    Apache InLong - a one-stop integration framework for massive data

    ...InLong was originally built at Tencent, which has served online businesses for more than 8 years, to support massive data (data scale of more than 80 trillion pieces of data per day) reporting services in big data scenarios. The entire platform has integrated 5 modules: Ingestion, Convergence, Caching, Sorting, and Management, so that the business only needs to provide data sources, data service quality, data landing clusters and data landing formats.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    awesome-single-cell

    awesome-single-cell

    Community-curated list of software packages and data resources

    Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc. List of software packages (and the people developing these methods) for single-cell data analysis, including RNA-seq, ATAC-seq, etc. Rapid, accurate and memory-frugal preprocessing of single-cell and single-nucleus RNA-seq data. Find bimodal, unimodal, and multimodal features in your data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 5
    LabPlot

    LabPlot

    Data Visualization and Analysis

    LabPlot is a FREE, open source and cross-platform Data Visualization and Analysis software accessible to everyone.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 6
    SafeUtils

    SafeUtils

    110+ developer tools as native MacOS, Linux & Windows desktop apps.

    Tools: https://safeutils.com/barcode-generator https://safeutils.com/color-picker https://safeutils.com/qr-code-generator https://safeutils.com/qr-code-scanner https://safeutils.com/word-counter https://safeutils.com/base-64-decoder https://safeutils.com/diff-checker https://safeutils.com/hex-to-ascii https://safeutils.com/json-formatter https://safeutils.com/lorem-ipsum-generator https://safeutils.com/random-generator https://safeutils.com/time-converter https://safeutils.com/...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    OptiMIClass

    Automatic n-dimensional data clustering tool powered by five advanced

    This software is an automatic n-dimensional quantitative data clustering tool based on five optimization heuristics (GA, PSO, ACO, SA, and TS). It enables performance comparisons across heuristics and parameter testing within each method, analyzing solution quality, runtime, and convergence; it also includes a repository of benchmark test tables and an intuitive interface for research and operational use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    EZStacking

    EZStacking is Jupyter notebook generator for machine learning

    EZStacking is Jupyter notebook generator for supervised learning problems using Scikit-Learn pipelines and stacked generalization. EZStacking handles classification and regression problems for structured data. It can also be viewed as a development tool, because a notebook generated with EZStacking contains: -an exploratory data analysis (EDA) used to assess data quality - a modelling producing a reduced-size stacked estimator - a server returning a prediction, a measure of the quality of input data and the execution time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    SciDAVis is a user-friendly data analysis and visualization program primarily aimed at high-quality plotting of scientific data. It strives to combine an intuitive, easy-to-use graphical user interface with powerful features such as Python scriptability.
    Leader badge
    Downloads: 1,271 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 10

    MS-Helios

    MS-Helios: A Circos wrapper to visualize multi-omic datasets

    ...MS-Helios automatically generates data and configuration files to create high quality and publishable circular plots with Circos.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    AlphaPlot

    AlphaPlot

    Interactive scientific graphing and data analysis software.

    Alpha Plot can generate different types of 2D and 3D plots (such as line, scatter, bar, pie, and surface plots) from data that is either imported from ASCII files, entered by hand, or calculated using formulas. The data is held in spreadsheets which are referred to as tables with column-based data (typically X and Y values for 2D plots) or matrices (for 3D plots). The spreadsheets as well as graphs and note windows are gathered in a project and can be organized using folders. The built-in...
    Leader badge
    Downloads: 39 This Week
    Last Update:
    See Project
  • 12
    QtiPlot
    QtiPlot is a user-friendly, platform independent data analysis and visualization application similar to the non-free Windows program Origin.
    Downloads: 21 This Week
    Last Update:
    See Project
  • 13
    QuickRNASeq

    QuickRNASeq

    A pipeline for large scale RNA-seq data analysis

    We have implemented QuickRNASeq, an open-source based pipeline for large scale RNA-seq data analysis. QuickRNASeq takes advantage of parallel computing resources, a careful selection of previously published algorithms for RNA-seq read mapping, counting and quality control, and a three-stage strategy to build a fully automated workflow. We also implemented built-in functionalities to detect sample swapping or mislabeling in large-scale RNA-seq studies.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    SmartRoot

    SmartRoot

    Semi-automated root image analysis software

    SmartRoot is a semi-automated image analysis software which streamlines the quantification of root growth and architecture for complex root systems. The software combines a vectorial representation of root objects with a powerful tracing algorithm which accommodates to a wide range of image source and quality. The software supports a sampling-based analysis of root system images, in which detailed information is collected on a limited number of roots selected by the user according to specific research requirements. SmartRoot is an operating system independent freeware based on ImageJ and uses cross-platform standards (XML, SQL, Java) for communication with data analysis softwares. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15

    PRADA

    PRADA : Pipeline for RNA-Sequencing Data Analysis

    Massively parallel sequencing of cDNA reverse transcribed from RNA (RNASeq) provides an accurate estimate of the quantity and composition of mRNAs. To characterize the transcriptome through the analysis of RNA-seq data, we developed PRADA. PRADA focuses on the processing and analysis of gene expression estimates, supervised and unsupervised gene fusion identification, and supervised intragenic deletion identification. PRADA currently supports 7 modules to process and identify abnormalities from RNAseq data: preprocess: Generates aligned and recalibrated BAM files. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Import width files into MySQL, manipulate and export back to fixed width. Useful for health data analysis or *whatever*. Includes Thompson Reuters Care Discovery 3000 data file definition. Windows, OS X, Linux Requires PHP 5.3.x and MySQL 5.1.x
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Logbus-ng consists in a set of tools to aid developers perform Log Analysis in all the stages: log generation, collection, distribution, storage and analysis. It is designed specifically for Field Failure Data Analysis in critical distributed systems
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    EDAS2 is a web-based environmental quality data management and decision support framework
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    An end-to-end solution for medical management including dispatching medical units, employee management, medical billing, and data analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo