Showing 888 open source projects for "data quality"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    metawatt

    metawatt

    Binner for assembled metagenomes

    The Metawatt binner is a graphical binning tool that makes use of multivariate statistics of tetranucleotide frequencies and differential coverage based binning. It also performs taxonomic assessment of binning quality (via diamond BLASTx). Created bins can be edited and exported as fasta. The Metawatt is implemented in Java SWING and minimally depends on Diamond, HMMer3.1, BBMap, Prodigal and the Batik library for the export of SVG graphics. Citation: Strous M, Kraft B, Bisdorf R,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Spawner is a generator of sample/test data for databases. It can be configured to output delimited text or SQL insert statements. It can also insert directly into a MySQL 5.x database. Includes many field types, most of which are configurable.
    Leader badge
    Downloads: 91 This Week
    Last Update:
    See Project
  • 3
    Priority Estimation Tool (AHP)

    Priority Estimation Tool (AHP)

    PriEsT is a decision making tool for Analytic Hierarchy Process (AHP).

    Priorty Estimation Tool (PriEsT) is a decision analysis tool. You can use it for ranking the options you have, or alternatively, you may use it for resource allocation (budgeting) problems. In PriEsT, you enter a list of available options and then define your criteria for prioritization. After defining criteria, PriEsT allows you to enter your judgements against each criterion, which are then used to calculate the final ranking (or weights). Please cite this if you find it...
    Leader badge
    Downloads: 12 This Week
    Last Update:
    See Project
  • 4

    ngopt

    de novo assembly & analysis of Illumina sequence data

    de novo assembly & analysis of Illumina sequence data, including the A5 pipeline, A5-miseq, tools to evaluate assembly quality, and scripts to facilitate data submission to NCBI and the RAST annotation system
    Downloads: 6 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 5

    P3BSseq

    Parallel processing pipeline for analysis of bisulfite sequencing data

    Bisulfite sequencing (BSseq) processing is among the most cumbersome next generation sequencing (NGS) applications. Though some BSseq processing tools are available, they are scattered, require puzzling parameters and are running-time and memory-usage demanding. We have developed P3BSseq, a parallel processing pipeline for fast, accurate and automatic analysis of BSseq reads that trims, aligns, annotates, records the intermediate results, performs bisulfite conversion quality assessment,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    JSensor

    high performance computing, Wireless Sensors Networks Simulators

    ...We must use simulators to achieve comfortable confidence levels before real tests. This way, we develop simulation models where entities, named sensors, collect environment data, process data, communicate, move around the space and a lot more. JSensor is a sensor network simulator implemented at Computer Science Department of Federal University of Ouro Preto (DECOM-UFOP), Brazil. In contrast to JSensor counterparts, it is designed to simulate millions of sensors in multicore and, in near future, multicomputer architectures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    DemuxFS is a live filesystem to aid on the analysis of transport streams in digital TV systems such as the SBTVD (ISDB-T), DVB and ATSC.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Speechalyzer

    Speechalyzer

    Process large speech data wrt transcription, labeling and annotation

    Speechalyzer: a tool for the daily work of a 'speech worker' It is optimized to process large speech data sets with respect to transcription, labeling and annotation. It is implemented as a client server based framework in Java and interfaces software for speech recognition, synthesis, speech classification and quality evaluation. The application is mainly the processing of training data for speech recognition and classification models and performing benchmarking tests on speech-to-text, text-to-speech and speech classification software systems.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9

    User Extensible Dictionary

    Creation of crowdsourcing dictionary using XDXF format.

    ...Dictionary offers standard user interface with possibility to create user-specific translation and their sharing. By sharing user dictionaries the scope and possibilities of the dictionary is further expanded, including the possibility to improve the quality of its content. Administrator is needed for quality check. With this crowdsourcing is used. Web server was created in language C# with technology .Net. Dictionary data is saved in XML with XDXF formatting, which allows obtaining or updating the content of dictionary data. Basic English-Czech dictionary is included. This dictionary can be expanded further.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 10

    ClinQC

    ClinQC: A tool for quality control of Sanger and NGS data in clinic

    ClinQC is an integrated and user-friendly pipeline for quality control, filtering and trimming of Sanger and NGS sequencing data for hundred to thousands of samples/patients in a single run in clinical research. It can analyze raw sequencing data and produces unified output as FASTQ files per sample/patient with Sanger quality encoding. First, ClinQC convert input read files from their native formats to a common FASTQ format and remove adapters, and PCR primers. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    MutAid

    MutAid: Sanger and NGS based pipeline for mutation screening.

    MutAid: Sanger and NGS based integrated pipeline for mutation identification, validation and annotation in molecular diagnosis. MutAid is an integrated pipeline for mutation screening in clinical research. It can analyze Sanger sequencing and NGS data from raw reads to list of annotated mutation list. MutAid can analyze and interpret raw sequencing data produced by Sanger or several NGS sequencing platforms. It performs format conversion, base calling, quality trimming, filtering, read mapping, variant calling, variant annotation and co-analyze Sanger and NGS data under a single platform. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    CanReg5 (moved to Github)

    CanReg5 (moved to Github)

    Canreg5 is a software package for population based cancer registries

    ...It has modules to do: data entry, quality control, consistency checks and basic analysis of the data It was designed with an emphasis on user friendliness, it has a modern user interface and is easy to navigate. Is available in several languages. (English, French, Spanish, Portuguese, Russian, Turkish, Georgian, and Chinese.)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Restful APIs for Data Cleansing

    Restful APIs for Data Cleansing

    This is sister project for osDQ which provide Restful APIs

    (Beta Version) This is sister project for https://sourceforge.net/projects/dataquality/ . It provides Restful APIs for features for data quality and data preparation features. This project will help projects which want embed data quality and data preparation features in their project or UI using restful calls. Data Cleansing APIs Dockerfile: # Pull base image FROM frnde/jetty-9.4.2-jre8-alpine-cet ADD osdq-v0.0.1.war /var/lib/jetty/webapps/osdq.war EXPOSE 8080 Docker Image https://hub.docker.com/r/vreddym/osdq-web/tags
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    iCAS - An Illumina Clone Assembly System

    An Illumina clone assembly system using SOAPdenovo and ABySS

    Clone-by-clone sequencing, as a means of achieving high quality assemblies for large and complex genomes, continues to be of great relevance in the era of high throughput sequencing. However, assemblies obtained using current whole genome assemblers are often fragmented and sometimes have issues of genome completeness owing to different data characteristics introduced by multiplexed sequencing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    CSV Transformer

    CSV Transformer

    transforms xml to csv

    The CSV Transformer is a data processing tool which transforms .xml-Files to comma separated values. The CSV Transformer was created in a load and performance testing project, the use case was to be able to transform 2800 configuration files of a big banking application to a single .csv-File. With this single file it was possible to compare the whole configuration between two releases with the already available tool CSV Comparator. This enabled the team to verify if there were any...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    PyPES Library

    Library of analytic potential energy surfaces

    The PyPES library is a python-based library of high quality semi-global potential energy surfaces for 50 molecules, each containing 3-6 atoms. The PyPES code enables the generation of energy derivatives to 6th order about any point on the potential energy surface in a range of common coordinate systems, including curvilinear internal, Cartesian and normal mode coordinates. For portability, FORTRAN, C, MATLAB and Mathematica wrappers are provided to interface with PyPES, reading in PyPES-generated Cartesian and normal mode derivative data via a text interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    DBpedia Spotlight

    DBpedia Spotlight

    DBpedia Spotlight is a tool for automatically annotating

    It is a tool for automatically annotating mentions of DBpedia resources in text, providing a solution for linking unstructured information sources to the Linked Open Data cloud through DBpedia. With a four step approach, DBpedia Spotlight performs named entity extraction, including entity detection and name resolution. It can also be used for named entity recognition, amongst other information extraction tasks. Empower the user experience reusing, interlinking and making semantic queries among high-quality open datasets, extracting meaning from unstructured data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    SQL!kE - PHP, SQL JS - Database Explore

    SQL!kE - PHP, SQL JS - Database Explore

    SQL!ke is a PHP mySQL RD Database Editing & Explorer

    SQL!ke is a new general project for mastering data effectively stored in a database using PHP and Javascript . This RD Explorer works with existing Databases with Embed SQL Query editor and your Custom Table Views. Keep track of this fresh project as it will include a simple blueprint to manipulate en view interconnected data entities.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Ava: Testdata Xsl

    Ava: Testdata Xsl

    generates Testdata on base of excel: creates xml,excel,csv,html,sql,+

    this tool for test-data-generation receives an 'excel-sheet' as primary input. second important paramter is the 'number of test-records to produce'. The excel-data will be reused as long data is needed. This tool is hightly paramatrisazable by the use of 'xsl scripts'. data can be created, updated, modified and finally exported in a format of your choice Main Fuctions: (1) Generates Testdata (excel, xsl, xml) (2) Exports generated testdata in multiple formats (csv, excel, html,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    EPRI Open PQ Dashboard

    EPRI Open PQ Dashboard

    Demos new techniques for extracting information from PQ data files

    ...This version consist of a few proof-of-concept applications of applying event severity and trend values to heatmap displays—giving the PQ engineers a wide-area status of PQ for quick interpretation. Data quality has been added so users can quickly see when meters are providing incomplete or invalid data. This dashboard currently accepts power quality data from COMTRADE and PQDIF standard file formats. Other proprietary software interfaces have been added. See the installation manual for more details.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    ncPRO-seq

    ncPRO-seq

    Non-Coding RNA PROfiling from sRNA-seq

    ncPRO-seq is a tool for annotation and profiling of ncRNAs from smallRNA sequencing data. It aims to interrogate and perform detailed analysis on small RNAs derived from annotated non-coding regions in miRBase, piRBase, Rfam and repeatMasker, and regions defined by users. The ncPRO pipeline also has a module to identify regions significantly enriched with short reads that can not be classified as known ncRNA families. ############# Docker version : download and run Dockerfile (go in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    TTA Lossless Audio Codec
    Lossless compressor for multichannel 8,16 and 24 bits audio data, with the ability of password data protection. Being 'lossless' means that no data/quality is lost in the compression - when uncompressed, the data will be identical to the original.
    Downloads: 72 This Week
    Last Update:
    See Project
  • 23
    GloVe

    GloVe

    GloVe model for distributed word representation

    ...The demo.sh script downloads a small corpus, consisting of the first 100M characters of Wikipedia. It collects unigram counts, constructs and shuffles cooccurrence data, and trains a simple version of the GloVe model. It also runs a word analogy evaluation script in python to verify word vector quality.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    JPGRAR Create/Extract

    JPGRAR Create/Extract

    Extract or create JPGRAR easily. Also creates any other file formats.

    JPGRAR software allows you to create or extract archive files from JPGRAR, a seemingly normal JPG file with RAR secretly attached (steganography). Yes trust me, this is possible ;) JPGRAR is popular for sharing files on image board websites, where you are only allowed to share photos and external download links may expire. Your JPGRAR mustn't be compressed by the website for you to share it. It wouldn't work on, say, Facebook which compresses your photo to a lower quality file. While...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    CNVision

    CNV prediction from Illumina genotyping data

    CNVision is a Perl script that runs Illumina genotyping data (all chips from 300k to latest Omni) through PennCNV, QuantiSNPv2.3 and GNOSIS (an in-built algorithm). It merges the results and assesses the quality of the raw data. CNVision can also identify de novo CNVs in family-based data using a highly accurate algorithm that considers the possibility of CNVs in either parent based on the raw genotyping data.
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo