Showing 224 open source projects for "data quality"

View related business solutions
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 1
    NLP Best Practices

    NLP Best Practices

    Natural Language Processing Best Practices & Examples

    In recent years, natural language processing (NLP) has seen quick growth in quality and usability, and this has helped to drive business adoption of artificial intelligence (AI) solutions. In the last few years, researchers have been applying newer deep learning methods to NLP. Data scientists started moving from traditional methods to state-of-the-art (SOTA) deep neural network (DNN) algorithms which use language models pretrained on large text corpora.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    QtiPlot
    QtiPlot is a user-friendly, platform independent data analysis and visualization application similar to the non-free Windows program Origin.
    Downloads: 26 This Week
    Last Update:
    See Project
  • 3
    Image Quality Assessment

    Image Quality Assessment

    Convolutional Neural Networks to predict aesthetic quality of images

    Image Quality Assessment is an open-source deep learning project that implements neural models for predicting the aesthetic and technical quality of digital images. The repository provides an implementation inspired by the NIMA (Neural Image Assessment) research approach, which uses convolutional neural networks trained on human-annotated datasets to estimate image quality scores.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    textgenrnn

    textgenrnn

    Easily train your own text-generating neural network

    With textgenrnn you can easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code. A modern neural network architecture that utilizes new techniques as attention-weighting and skip-embedding to accelerate training and improve model quality. Train on and generate text at either the character-level or word-level. Configure RNN size, the number of RNN layers, and whether to use bidirectional RNNs. Train on any generic input text...
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 5
    NYCOpenData-Profiling-Analysis

    NYCOpenData-Profiling-Analysis

    Open Data Profiling, Quality and Analysis on NYC OpenData dataset

    Open data often comes with little or no metadata. You will profile a large collection of open data sets and derive metadata that can be used for data discovery, querying, and identification of data quality problems. For each column, identify and summarize the semantic types present in the column. These can be generic types (e.g., city, state) or collection-specific types (NYU school names, NYC agency).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    EverydayWechat

    EverydayWechat

    Python tool that automates WeChat messages, replies, & group utilities

    ...In addition to personal messaging automation, the project includes a group assistant that can respond to queries and provide useful information within chat groups. These group utilities can retrieve data such as weather conditions, calendar details, garbage classification information, movie box office statistics, delivery tracking updates, and air quality reports.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    MatchZoo

    MatchZoo

    Facilitating the design, comparison and sharing of deep text models

    The goal of MatchZoo is to provide a high-quality codebase for deep text matching research, such as document retrieval, question answering, conversational response ranking, and paraphrase identification. With the unified data processing pipeline, simplified model configuration and automatic hyper-parameters tunning features equipped, MatchZoo is flexible and easy to use. Preprocess your input data in three lines of code, keep track parameters to be passed into the model. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    PyTorch-BigGraph

    PyTorch-BigGraph

    Generate embeddings from large-scale graph-structured data

    PyTorch-BigGraph (PBG) is a system for learning embeddings on massive graphs—think billions of nodes and edges—using partitioning and distributed training to keep memory and compute tractable. It shards entities into partitions and buckets edges so that each training pass only touches a small slice of parameters, which drastically reduces peak RAM and enables horizontal scaling across machines. PBG supports multi-relation graphs (knowledge graphs) with relation-specific scoring functions,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    abu

    abu

    Abu quantitative trading system (stocks, options, futures, bitcoin)

    ...The above system combines hundreds of seed quantitative models, such as financial time series loss model, deep pattern quality assessment model, long and short pattern combination evaluation model, long pattern stop-loss strategy model, short pattern covering strategy model, big data K-line pattern Historical portfolio fitting model, trading position mentality model, dopamine quantification model, inertial residual resistance support model, long-short swap revenge probability model, strong and weak confrontation model, trend angle change rate model, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 10
    Coursebook

    Coursebook

    Introductory Systems Programming Textbook for University of Illinois

    Welcome to the systems programming coursebook! This repository houses a high-quality, open-source introductory systems programming textbook used by the CS 341: System Programming course at the University of Illinois at Urbana-Champaign The book assumes that you have taken a programming language course and are familiar with assembly instructions. All of the code and instruction will be in C, as it is the de-facto language of the Linux Kernel.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    OpenSeq2Seq

    OpenSeq2Seq

    Toolkit for efficient experimentation with Speech Recognition

    ...The toolkit includes ready-made models for neural machine translation, automatic speech recognition, speech synthesis, language modeling, and additional NLP tasks such as sentiment analysis. It supports multi-GPU and multi-node data-parallel training, and integrates with Horovod to scale out across large GPU clusters. Mixed-precision support (float16) is optimized for NVIDIA Volta and Turing GPUs, allowing significant speedups and memory savings without sacrificing model quality. The project comes with configuration-driven training scripts, documentation, and examples that demonstrate how to set up pipelines for tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    PyX is a Python package for the creation of EPS, PS, PDF and SVG files. It combines an abstraction of the PostScript drawing model with a TeX/LaTeX interface. Complex tasks like 2d and 3d plots in publication-ready quality are built out of these primitives.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    AutotestnetServer

    Server part of autotestnet net edition (since 2.0)

    This project is used for server side of autotestnet (start from version 2.0) net edition. From v2.0, autotestnet use http to contact server(mysql) for all data storage, and a public service site was launched in pythonanywhere to support this. Before that, it contact mysql server directly by mysqltcl which need user to maintain mysql server by themselves, and, install mysqltcl is also a pain sometimes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    survol

    RDF-based framework monitoring business systems activity

    A Python agent and a web interface aiming to help the analysis and investigation of a legacy application. A set of machines, processes, databases, programs etc ... all communicating with each other, manipulating your data, and whose software architecture has become, with time, complicated, difficult to understand, and undocumented. Data are aggregated with an RDF inference engine, creating a global vision of the business information processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    matplotlib
    Matplotlib is a python library for making publication quality plots using a syntax familiar to MATLAB users. Matplotlib uses numpy for numerics. Output formats include PDF, Postscript, SVG, and PNG, as well as screen display. As of matplotlib version 1.5, we are no longer making file releases available on SourceForge. Please visit http://matplotlib.org/users/installing.html for help obtaining matplotlib.
    Leader badge
    Downloads: 57 This Week
    Last Update:
    See Project
  • 16
    pyEBSD-GUI

    pyEBSD-GUI

    A free software to post-process EBSD data.

    pyEBSD-GUI is a free software to post-process Electron Backscatter Diffraction (EBSD) data, inducing IPF mapping, image quality mapping, phase mapping etc, plotting pole figure, data segmentation, statistic analysis of the misorientation distribution etc. The software is written by python, and it supports both script and graphic user interface (GUI). pyEBSD V1.0 will be released soon.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    P3BSseq

    Parallel processing pipeline for analysis of bisulfite sequencing data

    Bisulfite sequencing (BSseq) processing is among the most cumbersome next generation sequencing (NGS) applications. Though some BSseq processing tools are available, they are scattered, require puzzling parameters and are running-time and memory-usage demanding. We have developed P3BSseq, a parallel processing pipeline for fast, accurate and automatic analysis of BSseq reads that trims, aligns, annotates, records the intermediate results, performs bisulfite conversion quality assessment,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    ncPRO-seq

    ncPRO-seq

    Non-Coding RNA PROfiling from sRNA-seq

    ncPRO-seq is a tool for annotation and profiling of ncRNAs from smallRNA sequencing data. It aims to interrogate and perform detailed analysis on small RNAs derived from annotated non-coding regions in miRBase, piRBase, Rfam and repeatMasker, and regions defined by users. The ncPRO pipeline also has a module to identify regions significantly enriched with short reads that can not be classified as known ncRNA families. ############# Docker version : download and run Dockerfile (go in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    SciEnPlot

    SciEnPlot

    Data Plotting and Analysis for Science and Engineering

    ... - Implemented a 3d surface plot (GLSurface) based on OpenGL (ScienPlot v1.3.2 and above) - ColorMap surface, trisurface, Pie, Polar plots, and 3D height field, 3dBar, scatter plots (under developing), and more - Column by column plotting/calculation - LaTex commands enclosed by $ symbols can be used for the labels in Graph - Accept txt(Text) and csv(Comma separated values) formatted data - Save, copy, print Graph - Use spread sheets to display data - Textboard to organize the results - Graphs in a publishable quality - Source codes based on: Python Numpy Scipy Matplotlib WxPython Visvis etc. - Special functions - Drag and drop data files - Python console is back (since v1.3.3), capable of reusing column data - Debye and Guinier models for SANS / SAX data - More apps in our Web below
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    dotCODES_Source_Control_for_VS

    dotCODES_Source_Control_for_VS

    The dotCODES Source Control Maintenance Mainframe (SCM2)

    The dotCODES Source Control Maintenance Mainframe for Visual Studio is an administrator console application for developing dotCODES components. Built upon a Python foundation, the program is used to create data center routines (Unix packages) and maintain enterprise cloud services (CGI scripts/Apache) by means of building dotCODES runtimes and deploying them to and from the client server.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    MSCViewer

    MSCViewer

    A tool for visualization and analysis of logs as sequence diagrams

    MSCViewer is a tool intended for debugging of control flows in concurrent, distributed systems. The tool loads logs generated by various entities in the system and visualize a sequence diagram chart for events and interactions. The diagram is fully interactive: entity can be added/removed from the diagram and shuffled; events can be filtered, searched, highlighted and annotated with comments. MSCViewer features integration with a Python interpreter which allows writing Python scripts...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    LightProfiler

    LightProfiler

    Profiler for Oracle extended SQL trace files

    LightProfiler – application for performance analysis of the Oracle databases. It generates detailed resource profile for extended SQL trace files (10046 event), containing information about consuming of response time (by events, by cursors, etc.), data files usage, error analysis (SQL, PL/SQL) and much more. Also it contain tools for additional processing of trace files (extract session data, splitting files) and for management of database's sessions (disconnecting, tracing, monitor...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    PanelCheck is an easy-to-use software tool for visualization of sensory profiling data using different types of plots. The joint information from the implemented plots provide detailed insight into assessor and panel performance.
    Leader badge
    Downloads: 20 This Week
    Last Update:
    See Project
  • 24
    MicrobeGPS

    MicrobeGPS

    The Explorative Taxonomic Profiling Tool for Metagenomic Data

    MicrobeGPS is a bioinformatics tool for the analysis of metagenomic sequencing data. The goal is to profile the composition of metagenomic communities as accurately as possible and present the results to the user in a convenient manner. One main focus is reliability: the tool calculates quality metrics for the estimated candidates and allows the user to identify false candidates easily.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    ospfdiscoverysnmp

    ospfdiscoverysnmp

    OSPF network discovery script via SNMP.

    OSPF_SNMP_v1.0.py is an OSPF network discovery script. It allows the discovery of all OSPF network devices in the network, without connecting to each of them or knowing all their IP addresses. The user must provide the IP and SNMP community string of a single OSPF device in the network (for example, a device in the core) and the script searches for all other OSPF routers. Prior to running the script, the user should configure SNMPv2 community string on the device and also include the subnet...
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo