Showing 888 open source projects for "data quality"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 1
    UDPspeeder

    UDPspeeder

    A Tunnel which Improves your Network Quality

    UDPspeeder is a tunneling utility focused on improving UDP traffic quality over high-latency, unstable, or packet-loss-prone links. Its core technique is Forward Error Correction, which sends redundant data so the connection can recover from packet loss without waiting for retransmission. Used by itself, it improves UDP-based connections, but it can also be paired with UDP-based VPN tools to support broader traffic types through the VPN layer.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Self-learning-Computer-Science

    Self-learning-Computer-Science

    Resources to learn computer science in your spare time

    Self-learning Computer Science is a curated, open-source guide repository designed to help learners independently study computer science topics using high-quality university-level resources. The author (an undergraduate CS student) assembled links to courses from institutions like MIT, UC Berkeley, Stanford, etc., covering mathematics, programming, data structures/algorithms, computer architecture, machine learning, software engineering and more. It’s aimed at learners who find traditional course structures restrictive and want a flexible, self-paced path through CS, with a focus on building depth and breadth rather than shortcut exam skills. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Rachota

    Rachota

    Easy-to-use personal time tracking application for Windows and Unix

    Rachota is a portable application for time-tracking your work on different projects. It runs on Windows, Linux, Solaris and Mac. Rachota displays time data in diagram form, creates customized reports or invoices and provides suggestions to increase your efficiency.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Data science blogs

    Data science blogs

    A curated list of data science blogs

    Data Science Blogs is a curated repository that aggregates a wide range of high-quality blogs and resources related to data science, machine learning, and analytics into a single organized collection. It serves as a discovery platform for practitioners, researchers, and learners who want to stay updated with industry trends, techniques, and insights without manually searching for reliable sources.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Lyra

    Lyra

    A Very Low-Bitrate Codec for Speech Compression

    ...It replaces hand-engineered codecs with learned models that capture speech characteristics more efficiently and reconstruct waveforms with a neural vocoder. The system targets mobile-class hardware, balancing latency and quality so it can run in real-time on phones. Its architecture is resilient to packet loss and jitter through framing strategies and error concealment, helping conversations remain understandable under adverse conditions. The codebase includes encoder and decoder components, along with tools for data preparation and evaluation. By pushing bitrates down to just a few kilobits per second while retaining quality, lyra expands access to voice calls where bandwidth is scarce.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    GitHub Rater

    GitHub Rater

    Check your GitHub rating, view results and enhance your quality

    ...The application’s frontend is built with modern web frameworks and interacts with GitHub’s API to gather real-time data, handling errors and edge cases gracefully. Its interface visualizes metrics and recommendations, making the assessment accessible even to those unfamiliar with raw GitHub statistics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    SQLBucket

    SQLBucket

    Lightweight library to write, orchestrate and test your SQL ETL

    SQLBucket is a lightweight framework to help write, orchestrate and validate SQL data pipelines. It gives the possibility to set variables and introduces some control flow using the fantastic Jinja2 library. It also implements a very simplistic unit and integration test framework where you can validate the results of your ETL in the form of SQL checks. With SQLBucket, you can apply TDD principles when writing data pipelines. To start working, you need to instantiate your SQLBucket core...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    ASH Viewer provides a graphical view of active session history data within the Oracle and PostgreSQL DB
    Leader badge
    Downloads: 22 This Week
    Last Update:
    See Project
  • 9
    Super-PDF-Editor

    Super-PDF-Editor

    World's most comprehensive, powerful, process-based PDF editor

    ...OCR performs in pdf files, scanned pdf files and any pdf files. OCR performs in image files, and supports multiple image formats. Auto and manual image enhancement for better OCR accuracy and quality. Supports 165+ languages with three languages data set. Use Multiple Languages at once. International Languages: 127 Languages, High, Medium, and Fast Quality. Scanned Images (jpg, png, gif, tiff, bmp) Multi-Page and TIFF and GIF, Scanned PDFs.
    Downloads: 7 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 10
    Data Science Collected Resources

    Data Science Collected Resources

    Carefully curated resource links for data science in one place

    ...Its goal is to provide learners and practitioners with easy access to high-quality resources related to data science tools, programming languages, cloud platforms, and machine learning techniques. The repository includes links to materials discussing topics such as artificial intelligence research, AWS infrastructure, machine learning algorithms, and data analysis tools. It also contains supplementary documents like cheat sheets and machine learning notes that help readers review important concepts quickly.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    EnCodec

    EnCodec

    State-of-the-art deep learning based audio codec

    ...Unlike traditional codecs (like MP3 or Opus), Encodec uses a learned quantizer and decoder to reconstruct complex waveforms with remarkable accuracy at bitrates as low as 1.5 kbps. It employs a convolutional encoder–decoder architecture trained with perceptual loss functions that optimize for human auditory quality rather than raw waveform distance. The model can operate in real time and supports variable bandwidths, bitrates, and multi-band audio. Encodec has applications in speech and music compression, generative modeling, and efficient data transmission for communication systems. The repository includes pretrained checkpoints, PyTorch inference code, and examples for integrating Encodec as a module in downstream generative or streaming systems.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Super-PDF-Editor-Lite

    Super-PDF-Editor-Lite

    World's most comprehensive, powerful, process-based PDF editor

    ...OCR performs in pdf files, scanned pdf files and any pdf files. OCR performs in image files, and supports multiple image formats. Auto and manual image enhancement for better OCR accuracy and quality. Supports 165+ languages with three languages data set. Use Multiple Languages at once. International Languages: 127 Languages, High, Medium, and Fast Quality. Scanned Images (jpg, png, gif, tiff, bmp) Multi-Page and TIFF and GIF, Scanned PDFs.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    AllenNLP

    AllenNLP

    An open-source NLP research library, built on PyTorch

    AllenNLP makes it easy to design and evaluate new deep learning models for nearly any NLP problem, along with the infrastructure to easily run them in the cloud or on your laptop. AllenNLP includes reference implementations of high quality models for both core NLP problems (e.g. semantic role labeling) and NLP applications (e.g. textual entailment). AllenNLP supports loading "plugins" dynamically. A plugin is just a Python package that provides custom registered classes or additional...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Penumbra

    Penumbra

    Penumbra Color Theme

    ...The palette consists of nine nearly symmetric base colors, which are used to build the main light and dark themes, along with two additional high-contrast dark variants tailored for people with mild to moderate visual impairments. Its design focuses on functionality first, while maintaining an aesthetic quality that draws from familiar natural tones. Beyond its use in text editors and terminal environments, Penumbra’s carefully structured accent palettes are also suited for encoding information in data visualizations, where perceptual uniformity and hue differentiability are critical.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15
    Lingua

    Lingua

    The most accurate natural language detection library for Java

    Its task is simple: It tells you which language some provided textual data is written in. This is very useful as a preprocessing step for linguistic data in natural language processing applications such as text classification and spell checking. Other use cases, for instance, might include routing e-mails to the right geographically located customer service department, based on the e-mails' languages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    CityHash

    CityHash

    Automatically exported from Google code CityHash

    ...The library emphasizes hashing performance and uniformity rather than cryptographic security, making it ideal for use in data structures like hash tables and distributed systems requiring rapid key lookups. CityHash has been rigorously tested using tools like SMHasher to ensure high-quality mixing and collision resistance across a wide range of inputs. Its speed and portability have made it a popular choice for developers needing dependable, lightweight hash functions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Grow.dev

    Grow.dev

    A declarative website generator designed for high-quality websites

    Grow.dev is a static site generator optimized for building highly interactive, localized microsites. Grow.dev focuses on providing optimal workflows and developer ergonomics for creating projects that are highly maintainable in the long term. Grow.dev encourages a strong but simple separation of content and presentation and makes maintaining content in different locales and environments a snap.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    DBFeeder

    DBFeeder

    Highly Customizable Test Data Generator

    DBFeeder is a great tool to generate synthetic testdata for Oracle Databases and it is ideal for companies who wants to outsource development. Thanks to his original approach, data can be highly customizable and it even fits primary and foreign keys constraints of tables.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    WaveRNN

    WaveRNN

    WaveRNN Vocoder + TTS

    WaveRNN is a PyTorch implementation of DeepMind’s WaveRNN vocoder, bundled with a Tacotron-style TTS front end to form a complete text-to-speech stack. As a vocoder, WaveRNN models raw audio with a compact recurrent neural network that can generate high-quality waveforms more efficiently than many traditional autoregressive models. The repository includes scripts and code for preprocessing datasets such as LJSpeech, training Tacotron to produce mel spectrograms, training WaveRNN on those spectrograms (with optional GTA data), and finally generating audio. A quick_start.py script allows users to immediately synthesize example sentences from a pretrained model and inspect both generated audio and attention plots. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    D-List

    D-List

    An extensive list subsystem, written in C

    D-List is an extensive, production quality, and robust list management subsystem written in C that can be used as a shared library or compiled directly into your project. D-List provides access to 3 classes of lists, and allows extensive manipulation and customization of these lists. D-List lists can also be easily incorporated as primitives into larger treed, or nested data management systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Display, log and upload data to the internet from your Oregon Scientific or AcuRite 02032C weather station. The Arduino WxShield also supported. See project web site for FAQs. Use Open Discussion for support requests.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 22

    EZStacking

    EZStacking is Jupyter notebook generator for machine learning

    EZStacking is Jupyter notebook generator for supervised learning problems using Scikit-Learn pipelines and stacked generalization. EZStacking handles classification and regression problems for structured data. It can also be viewed as a development tool, because a notebook generated with EZStacking contains: -an exploratory data analysis (EDA) used to assess data quality - a modelling producing a reduced-size stacked estimator - a server returning a prediction, a measure of the quality of input data and the execution time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    QuickPlot

    QuickPlot

    Simple user interface for gnuplot aimed for reflectometry data

    Graphical user interface for gnuplot to create publication quality figure very quickly. It supports templates for fast formatting of graphics, different plot styles, insets, axis and label options. One important feature is storing metadata in png and pdf files that can be used to reload any graph saved with QuickPlot.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    Starlight.jl

    Starlight.jl

    A greedy game engine for greedy programmers

    Welcome to the documentation for Starlight.jl, a greedy application framework for greedy developers. Its primary use case is video games, but the power of Julia, SDL2, Vulkan, and the Bullet Physics SDK can be leveraged to make just about anything you want. With a focus on flexibility and code quality, Starlight aims to be such a framework. It includes a suite of components and integrations that make it particuarly well-suited for video games, so it is not a stretch to call it a "game...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    QuickRedis

    QuickRedis

    QuickRedis is a free forever redis gui tool

    QuickRedis is a free forever Redis Desktop manager. It supports direct connection, sentinel, and cluster mode, supports multiple languages, supports hundreds of millions of keys, and has an amazing UI. Supports both Windows, Mac OS X and Linux platform.
    Downloads: 16 This Week
    Last Update:
    See Project
Auth0 Logo