Showing 766 open source projects for "data quality"

View related business solutions
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 1

    MutAid

    MutAid: Sanger and NGS based pipeline for mutation screening.

    MutAid: Sanger and NGS based integrated pipeline for mutation identification, validation and annotation in molecular diagnosis. MutAid is an integrated pipeline for mutation screening in clinical research. It can analyze Sanger sequencing and NGS data from raw reads to list of annotated mutation list. MutAid can analyze and interpret raw sequencing data produced by Sanger or several NGS sequencing platforms. It performs format conversion, base calling, quality trimming, filtering, read mapping, variant calling, variant annotation and co-analyze Sanger and NGS data under a single platform. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 2
    CanReg5 (moved to Github)

    CanReg5 (moved to Github)

    Canreg5 is a software package for population based cancer registries

    ...It has modules to do: data entry, quality control, consistency checks and basic analysis of the data It was designed with an emphasis on user friendliness, it has a modern user interface and is easy to navigate. Is available in several languages. (English, French, Spanish, Portuguese, Russian, Turkish, Georgian, and Chinese.)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Restful APIs for Data Cleansing

    Restful APIs for Data Cleansing

    This is sister project for osDQ which provide Restful APIs

    (Beta Version) This is sister project for https://sourceforge.net/projects/dataquality/ . It provides Restful APIs for features for data quality and data preparation features. This project will help projects which want embed data quality and data preparation features in their project or UI using restful calls. Data Cleansing APIs Dockerfile: # Pull base image FROM frnde/jetty-9.4.2-jre8-alpine-cet ADD osdq-v0.0.1.war /var/lib/jetty/webapps/osdq.war EXPOSE 8080 Docker Image https://hub.docker.com/r/vreddym/osdq-web/tags
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    iCAS - An Illumina Clone Assembly System

    An Illumina clone assembly system using SOAPdenovo and ABySS

    Clone-by-clone sequencing, as a means of achieving high quality assemblies for large and complex genomes, continues to be of great relevance in the era of high throughput sequencing. However, assemblies obtained using current whole genome assemblers are often fragmented and sometimes have issues of genome completeness owing to different data characteristics introduced by multiplexed sequencing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 5
    CSV Transformer

    CSV Transformer

    transforms xml to csv

    The CSV Transformer is a data processing tool which transforms .xml-Files to comma separated values. The CSV Transformer was created in a load and performance testing project, the use case was to be able to transform 2800 configuration files of a big banking application to a single .csv-File. With this single file it was possible to compare the whole configuration between two releases with the already available tool CSV Comparator. This enabled the team to verify if there were any...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    PyPES Library

    Library of analytic potential energy surfaces

    The PyPES library is a python-based library of high quality semi-global potential energy surfaces for 50 molecules, each containing 3-6 atoms. The PyPES code enables the generation of energy derivatives to 6th order about any point on the potential energy surface in a range of common coordinate systems, including curvilinear internal, Cartesian and normal mode coordinates. For portability, FORTRAN, C, MATLAB and Mathematica wrappers are provided to interface with PyPES, reading in PyPES-generated Cartesian and normal mode derivative data via a text interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    DBpedia Spotlight

    DBpedia Spotlight

    DBpedia Spotlight is a tool for automatically annotating

    It is a tool for automatically annotating mentions of DBpedia resources in text, providing a solution for linking unstructured information sources to the Linked Open Data cloud through DBpedia. With a four step approach, DBpedia Spotlight performs named entity extraction, including entity detection and name resolution. It can also be used for named entity recognition, amongst other information extraction tasks. Empower the user experience reusing, interlinking and making semantic queries among high-quality open datasets, extracting meaning from unstructured data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Ava: Testdata Xsl

    Ava: Testdata Xsl

    generates Testdata on base of excel: creates xml,excel,csv,html,sql,+

    this tool for test-data-generation receives an 'excel-sheet' as primary input. second important paramter is the 'number of test-records to produce'. The excel-data will be reused as long data is needed. This tool is hightly paramatrisazable by the use of 'xsl scripts'. data can be created, updated, modified and finally exported in a format of your choice Main Fuctions: (1) Generates Testdata (excel, xsl, xml) (2) Exports generated testdata in multiple formats (csv, excel, html,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    EPRI Open PQ Dashboard

    EPRI Open PQ Dashboard

    Demos new techniques for extracting information from PQ data files

    ...This version consist of a few proof-of-concept applications of applying event severity and trend values to heatmap displays—giving the PQ engineers a wide-area status of PQ for quick interpretation. Data quality has been added so users can quickly see when meters are providing incomplete or invalid data. This dashboard currently accepts power quality data from COMTRADE and PQDIF standard file formats. Other proprietary software interfaces have been added. See the installation manual for more details.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    ncPRO-seq

    ncPRO-seq

    Non-Coding RNA PROfiling from sRNA-seq

    ncPRO-seq is a tool for annotation and profiling of ncRNAs from smallRNA sequencing data. It aims to interrogate and perform detailed analysis on small RNAs derived from annotated non-coding regions in miRBase, piRBase, Rfam and repeatMasker, and regions defined by users. The ncPRO pipeline also has a module to identify regions significantly enriched with short reads that can not be classified as known ncRNA families. ############# Docker version : download and run Dockerfile (go in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    GloVe

    GloVe

    GloVe model for distributed word representation

    ...The demo.sh script downloads a small corpus, consisting of the first 100M characters of Wikipedia. It collects unigram counts, constructs and shuffles cooccurrence data, and trains a simple version of the GloVe model. It also runs a word analogy evaluation script in python to verify word vector quality.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    JPGRAR Create/Extract

    JPGRAR Create/Extract

    Extract or create JPGRAR easily. Also creates any other file formats.

    JPGRAR software allows you to create or extract archive files from JPGRAR, a seemingly normal JPG file with RAR secretly attached (steganography). Yes trust me, this is possible ;) JPGRAR is popular for sharing files on image board websites, where you are only allowed to share photos and external download links may expire. Your JPGRAR mustn't be compressed by the website for you to share it. It wouldn't work on, say, Facebook which compresses your photo to a lower quality file. While...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    MsgViewer
    MsgViewer is email-viewer utility for .msg e-mail messages, implemented in pure Java. MsgViewer works on Windows/Linux/Mac Platforms. Also provides a java api to read mail messges (msg files) programmatically.
    Leader badge
    Downloads: 124 This Week
    Last Update:
    See Project
  • 14
    A Transport Stream analyser for the Brazilian D-TV system (SBTVD / ISDB-Tb). It shows in a GUI the SI/PSI structure of the stream in a tree-view, bitrate statistics for each ES, specific Closed Caption, EPG and DSMCC carroussel decoding plus more.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15

    PNGwriter

    Easy plotting of images pixel by pixel!

    Frustrated because you want to output a high-quality image from your C++ program but haven't found an easy, direct and fast way? Try PNGwriter. Written with scientific simulations in mind, PNGwriter allows you to plot to a PNG image pixel by pixel. * * *README* * * This is the last PNGwriter release that will be hosted on SourceForge.net. Please use the following site: https://github.com/pngwriter/pngwriter/releases to find all versions of PNGwriter including and later than...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    ...Prokaryotic Proteogenomic Tool(PPT) is developed as a part of GenoSuite framework. PPT is for finding novel translations in prokaryotic genomes and is dependent on Mass Spectrometry based proteomics data and related genome sequence. Some Key features of the tool are.... (1) Configured for 4 open source algorithms to perform database searching for peptide identification. Any combination of the 4 algorithm can be chosen. (2) Lists out the genome search specific peptides(Novel peptides). (3) Spectral matches can be visualized for the quality assessment...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Reconcile: Web Credibility Evaluation

    Reconcile: Web Credibility Evaluation

    Robust online credibility evaluation of web content

    ...Our dataset contains 15750 evaluations of 5543 pages from 2041 participants with some additional information about website characteristics and basic demographic features of users. If You would like to gain access to this data-set or learn more about data structure please contact our administrator (konrad@pjwstk.edu.pl) stating Your name, academic degree and university affiliation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    WebGL Heatmap

    WebGL Heatmap

    A high performance WebGL/JS heatmap display library

    ...Because most work happens on the GPU, it scales well as the number of points grows and remains interactive for animations, live telemetry, or dynamic overlays. The library focuses on visual quality, producing anti-aliased, blur-free gradients that look good at various zoom levels and resolutions. It fits well into data-viz stacks where you need a responsive heat layer on top of maps, charts, or custom canvases, and it is compact enough to embed in dashboards and demos without heavy dependencies.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    VarScan

    VarScan

    Variant detection in next-generation sequencing data

    Variant detection in massively parallel sequencing. For one sample, calls SNPs, indels, and consensus genotypes. For tumor-normal pairs, further classifies each variant as Germline, Somatic, or LOH, and also detects somatic copy number changes. THE LATEST VERSION IS AVAILABLE ON GITHUB
    Downloads: 10 This Week
    Last Update:
    See Project
  • 20
    JsonRpc-Cpp is an OpenSource implementation of JSON-RPC protocol in C++. JSON-RPC is a lightweight remote procedure call protocol similar to XML-RPC.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    SynchedIn

    To synch and encrypt all sorts of data across different sources

    SynchedIn is a portable utility created to enforce realtime or latent synchronization of data (generally files) between different data storage sources. You can create as many synch configs as you please. Each synch config specifies a number of data sources, a number of schedulers. Each scheduler determines when the data sources must be checked for synchronization. Each data storage source represents a hierarchical (file system ...) or other type of data storage medium. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    MSCViewer

    MSCViewer

    A tool for visualization and analysis of logs as sequence diagrams

    MSCViewer is a tool intended for debugging of control flows in concurrent, distributed systems. The tool loads logs generated by various entities in the system and visualize a sequence diagram chart for events and interactions. The diagram is fully interactive: entity can be added/removed from the diagram and shuffled; events can be filtered, searched, highlighted and annotated with comments. MSCViewer features integration with a Python interpreter which allows writing Python scripts...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    PROPER is a package for visual evaluation of ranking classifiers for biological big data mining studies in the mathematical language MATLAB. It is an efficient tool for optimization and comparison of the state-of-the-art ranking classifiers by generating over 20 different high quality two- and three-dimensional performance curves.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    SecSy: Security-oriented Log Synthesis

    SecSy: Security-oriented Log Synthesis

    Tool for synthesizing business process logs.

    SecSy is a tool for security-oriented log synthesis. Besides basic synthesis properties (cases per day, office hours, randomized activity duration, ...), its detailed parameter setting for simulating business processes also allows to specify data usage (objects + access modality), actors for process activities and access control constraints for task/object permissions. It is also capable of enforcing/violating specific security properties on process traces, such as SoD/BoD, Unauthorized...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    Pyxplot

    Graph Plotting Package and Scientific Scripting Language

    Pyxplot is a scientific scripting language, graph plotting tool and vector graphics suite. It runs in a command-line environment under Linux, Mac OS X, and other POSIX systems.
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo