Showing 888 open source projects for "data quality"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 1
    MsgViewer
    MsgViewer is email-viewer utility for .msg e-mail messages, implemented in pure Java. MsgViewer works on Windows/Linux/Mac Platforms. Also provides a java api to read mail messges (msg files) programmatically.
    Downloads: 124 This Week
    Last Update:
    See Project
  • 2
    A Transport Stream analyser for the Brazilian D-TV system (SBTVD / ISDB-Tb). It shows in a GUI the SI/PSI structure of the stream in a tree-view, bitrate statistics for each ES, specific Closed Caption, EPG and DSMCC carroussel decoding plus more.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    dotCODES_Source_Control_for_VS

    dotCODES_Source_Control_for_VS

    The dotCODES Source Control Maintenance Mainframe (SCM2)

    The dotCODES Source Control Maintenance Mainframe for Visual Studio is an administrator console application for developing dotCODES components. Built upon a Python foundation, the program is used to create data center routines (Unix packages) and maintain enterprise cloud services (CGI scripts/Apache) by means of building dotCODES runtimes and deploying them to and from the client server.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    PNGwriter

    Easy plotting of images pixel by pixel!

    Frustrated because you want to output a high-quality image from your C++ program but haven't found an easy, direct and fast way? Try PNGwriter. Written with scientific simulations in mind, PNGwriter allows you to plot to a PNG image pixel by pixel. * * *README* * * This is the last PNGwriter release that will be hosted on SourceForge.net. Please use the following site: https://github.com/pngwriter/pngwriter/releases to find all versions of PNGwriter including and later than...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • 5
    ...Prokaryotic Proteogenomic Tool(PPT) is developed as a part of GenoSuite framework. PPT is for finding novel translations in prokaryotic genomes and is dependent on Mass Spectrometry based proteomics data and related genome sequence. Some Key features of the tool are.... (1) Configured for 4 open source algorithms to perform database searching for peptide identification. Any combination of the 4 algorithm can be chosen. (2) Lists out the genome search specific peptides(Novel peptides). (3) Spectral matches can be visualized for the quality assessment...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Reconcile: Web Credibility Evaluation

    Reconcile: Web Credibility Evaluation

    Robust online credibility evaluation of web content

    ...Our dataset contains 15750 evaluations of 5543 pages from 2041 participants with some additional information about website characteristics and basic demographic features of users. If You would like to gain access to this data-set or learn more about data structure please contact our administrator (konrad@pjwstk.edu.pl) stating Your name, academic degree and university affiliation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    WebGL Heatmap

    WebGL Heatmap

    A high performance WebGL/JS heatmap display library

    ...Because most work happens on the GPU, it scales well as the number of points grows and remains interactive for animations, live telemetry, or dynamic overlays. The library focuses on visual quality, producing anti-aliased, blur-free gradients that look good at various zoom levels and resolutions. It fits well into data-viz stacks where you need a responsive heat layer on top of maps, charts, or custom canvases, and it is compact enough to embed in dashboards and demos without heavy dependencies.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    VarScan

    VarScan

    Variant detection in next-generation sequencing data

    Variant detection in massively parallel sequencing. For one sample, calls SNPs, indels, and consensus genotypes. For tumor-normal pairs, further classifies each variant as Germline, Somatic, or LOH, and also detects somatic copy number changes. THE LATEST VERSION IS AVAILABLE ON GITHUB
    Downloads: 10 This Week
    Last Update:
    See Project
  • 9
    JsonRpc-Cpp is an OpenSource implementation of JSON-RPC protocol in C++. JSON-RPC is a lightweight remote procedure call protocol similar to XML-RPC.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10

    HTQC

    Quality control and filtration for illumina sequencing data

    A toolkit including statistics tool for illumina high-throughput sequencing data, and filtration tools for sequence quality, length, tail quality, etc..
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11

    SynchedIn

    To synch and encrypt all sorts of data across different sources

    SynchedIn is a portable utility created to enforce realtime or latent synchronization of data (generally files) between different data storage sources. You can create as many synch configs as you please. Each synch config specifies a number of data sources, a number of schedulers. Each scheduler determines when the data sources must be checked for synchronization. Each data storage source represents a hierarchical (file system ...) or other type of data storage medium. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    MSCViewer

    MSCViewer

    A tool for visualization and analysis of logs as sequence diagrams

    MSCViewer is a tool intended for debugging of control flows in concurrent, distributed systems. The tool loads logs generated by various entities in the system and visualize a sequence diagram chart for events and interactions. The diagram is fully interactive: entity can be added/removed from the diagram and shuffled; events can be filtered, searched, highlighted and annotated with comments. MSCViewer features integration with a Python interpreter which allows writing Python scripts...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    PROPER is a package for visual evaluation of ranking classifiers for biological big data mining studies in the mathematical language MATLAB. It is an efficient tool for optimization and comparison of the state-of-the-art ranking classifiers by generating over 20 different high quality two- and three-dimensional performance curves.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    SecSy: Security-oriented Log Synthesis

    SecSy: Security-oriented Log Synthesis

    Tool for synthesizing business process logs.

    SecSy is a tool for security-oriented log synthesis. Besides basic synthesis properties (cases per day, office hours, randomized activity duration, ...), its detailed parameter setting for simulating business processes also allows to specify data usage (objects + access modality), actors for process activities and access control constraints for task/object permissions. It is also capable of enforcing/violating specific security properties on process traces, such as SoD/BoD, Unauthorized...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    Pyxplot

    Graph Plotting Package and Scientific Scripting Language

    Pyxplot is a scientific scripting language, graph plotting tool and vector graphics suite. It runs in a command-line environment under Linux, Mac OS X, and other POSIX systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    LightProfiler

    LightProfiler

    Profiler for Oracle extended SQL trace files

    LightProfiler – application for performance analysis of the Oracle databases. It generates detailed resource profile for extended SQL trace files (10046 event), containing information about consuming of response time (by events, by cursors, etc.), data files usage, error analysis (SQL, PL/SQL) and much more. Also it contain tools for additional processing of trace files (extract session data, splitting files) and for management of database's sessions (disconnecting, tracing, monitor...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Agilefant

    Agilefant

    The scalable open source tool for lean and agile project management

    Agilefant is a simple but powerful web based tool for lean and agile project management. It is the most versatile open source solution of its kind today, with a rapidly growing user base of more than 10k users around the world in thousands of organizations. Based on more than ten years of solid scientific research within real world organizations, Agilefant's unique model scales from personal todo-lists to large-scale multi-team development and long-term product and portfolio...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Roomba

    Roomba

    A Node.js tool to examine the correctness of Open Data Metadata

    Linked Open Data (LOD) has emerged as one of the largest collection of interlinked datasets on the web. Benefiting from this mine of data requires the existence of descriptive information about each dataset in the accompanying metadata. Such meta information is currently very limited to few data portals where they are usually provided manually thus giving little or bad quality insights.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    HTML Inspector

    HTML Inspector

    HTML Inspector is a code quality tool

    HTML Inspector is a browser-based JavaScript code quality tool for checking markup against customizable rules. It was designed to sit between strict W3C validation and having no markup standards at all. The tool can detect questionable HTML patterns, enforce team conventions, and help developers catch structural problems before they become harder to maintain. Its rules are opinionated by default but can be customized or extended to match a team’s own standards. HTML Inspector runs directly...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20

    FASTQSim

    NGS data characterization and in silico read generation

    FASTQSim is a tool that provides the dual functionality of Next-Gen Sequencing dataset characterization and metagenomic data generation. FASTQSim is sequencing platform-independent, and computes distributions of read length, quality scores, indel rates, single point mutation rates, indel size, and similar statistics for any sequencing platform. To create training or testing datasets, FASTQSim has the ability to convert target sequences into in silico reads with matching error profiles. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    ZeeKMonitor

    Plot Real Time Data on webpage

    This setup lets user to plot any comma separated values at real time. The plotted chart is published on a webpage. This is a good tool to show realtime statistics. Special Thanks to JFreeChart. Please go thru "http://www.jfree.org/lgpl.php" Technically the client needs to broadcast (on UDP) comma separated values to server being used to process chart and push to a webpage. The intermediate dataProcessor will capture the data. The chartProcessor will process and create images (every X...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    MicrobeGPS

    MicrobeGPS

    The Explorative Taxonomic Profiling Tool for Metagenomic Data

    MicrobeGPS is a bioinformatics tool for the analysis of metagenomic sequencing data. The goal is to profile the composition of metagenomic communities as accurately as possible and present the results to the user in a convenient manner. One main focus is reliability: the tool calculates quality metrics for the estimated candidates and allows the user to identify false candidates easily.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    intzip

    Fast compression of integer sets.

    ...Compression means that a suitable list of integers can be encoded to a (whenever possible) shorter list of the same integer format. An arbitrary set of integers (i.e., a collection of unique numbers where the order does not matter) may be transformed into suitable input data by way of translation and sorting. The main focus of the algorithm is to encode structural characteristics of a given list, such as coherent intervals and repeated distances. Therefore, the quality of compression depends on diversity rather than size. In most cases, IntZip is able to achieve much better compression results than general compression algorithms on such data sets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    TCPPackLoader

    TCPPackLoader

    The program loads any TCP-based server simulating up to 10000 sockets

    The program loads your TCP-based server simulating up to 10000 socket connections. OS:Android Program works over GPRS and WiFi. This can help you find out how many connections could your server handle and also what is the status of the network used . Just Enter: IP address of the server Port of the server Maximum number of test socket conections . Server load-this is the slider position -it determines delay (in msec) between each socket connection.Position 0.0s means maximum...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    ospfdiscoverysnmp

    ospfdiscoverysnmp

    OSPF network discovery script via SNMP.

    OSPF_SNMP_v1.0.py is an OSPF network discovery script. It allows the discovery of all OSPF network devices in the network, without connecting to each of them or knowing all their IP addresses. The user must provide the IP and SNMP community string of a single OSPF device in the network (for example, a device in the core) and the script searches for all other OSPF routers. Prior to running the script, the user should configure SNMPv2 community string on the device and also include the subnet...
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo