Showing 888 open source projects for "data quality"

View related business solutions
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • Error to trace to log to deploy. One click. No SSH. Icon
    Error to trace to log to deploy. One click. No SSH.

    Catch the cause before the pager goes off.

    AppSignal links every error to the trace, the trace to the log, the log to the deploy that shipped it.
    Free 30 days.
  • 1
    GLM-4-32B-0414

    GLM-4-32B-0414

    Open Multilingual Multimodal Chat LMs

    ...It supports multilingual and multimodal chat capabilities with an extensive 32K token context length, making it ideal for dialogue, reasoning, and complex task completion. The model is pre-trained on 15 trillion tokens of high-quality data, including substantial synthetic reasoning datasets, and further enhanced with reinforcement learning and human preference alignment for improved instruction-following and function calling. Variants like GLM-Z1-32B-0414 offer deep reasoning and advanced mathematical problem-solving, while GLM-Z1-Rumination-32B-0414 specializes in long-form, complex research-style writing using scaled reinforcement learning and external search tools. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Improved Diffusion

    Improved Diffusion

    Release for Improved Denoising Diffusion Probabilistic Models

    improved-diffusion is an open source implementation of diffusion probabilistic models created by OpenAI. These models, also known as score-based generative models, are a class of generative models that have shown strong performance in producing high-quality synthetic data such as images. The repository provides code for training and sampling diffusion models with improved techniques that enhance stability, efficiency, and output fidelity. It includes scripts for setting up training runs, generating samples, and reproducing results from OpenAI’s research on diffusion-based generation. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    Open Crypto Tracker

    Open Crypto Tracker

    Bitcoin Alts portfolio tracker, email / text / alexa / telegram alerts

    100% FREE / open source / PRIVATE cryptocurrency portfolio tracker. Email / text / alexa / telegram price alerts, price charts, mining calcs, leverage / gain / loss / balance stats, news feeds +more. Privately track Bitcoin / Ethereum / unlimited cryptocurrencies. Customize as many assets / markets / alerts / charts as you want. Over 50 Exchanges / 40 Trading Pairs Supported (exchanges / pairings list at bottom of README.txt): https://tinyurl.com/ct-readme Nearly Unlimited Assets...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    grafx - Extensible Java Graphics Package

    grafx - Extensible Java Graphics Package

    Extensible Line-drawing & Polygon-filling Rasterization Libs for Java

    Line-drawing and polygon-filling rasterization algorithms in Java. Simple 2D package and more complex 3D package with Z-buffering. Texture mapping not supported at the moment. Color interpolation for lines, Gouraud shading for polygons fully supported. Floating point vertex coordinates, not integer. Pixel-level aliasing, not endpoint aliasing. Doesn't use Bresenham line style! Works with all Java versions. No external dependencies. Suitable for study in academic institutions,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 5

    Clipstitch

    Uility to make home movies from your digital camera files

    Full documentation: Download clipstitchX.Y.html To make movies from your camera (or phone) video files. FFmpeg is a professional-quality, free, open-source program for video editing, with the ability to implement a huge number of operations and handle every data format! This kind of ability comes at a cost: its commands are quite complex-looking and difficult to use and remember. Clipstitch runs as a front-end to ffmpeg so that you use only the sub-set of ffmpeg commands necessary for making a home movie from your digital camera, puts them in easier-to-read form, and internally combines multiple ffmpeg commands to do certain tasks.. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6

    openSkyMatch

    Matches OpenScience Observatories images with astronomical catalogs

    openSkyMatch is a collection of Linux shell and Python scripts designed for the OpenScience Observatories program. It automates the identification and matching of detected celestial objects in locally captured FITS images with entries in large-scale sky catalogs, notably Pan-STARRS1 DR2 (II/389/ps1_dr2). The toolkit supports data preprocessing, coordinate correlation, and catalog-based validation of astronomical detections. All tools are open-source and optimized for reproducibility and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    LabRPS

    LabRPS

    Random phenomena generator

    ...LabRPS is mainly to assist reseachers in related fields to quickly implement new simulation methods programmatically in their new research work based on the existing works, help engineers to numerically generate random phenomena in a more realistic way, helps students and new comers to this field to learn quickly. It is highly modular and is integrated with a data visualizer which can generate different types of publication quality 2D and 3D plots (such as line, scatter, bar, pie
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    UniversalTextExtractor

    Command-line toolset for extracting text from files

    Command-line toolset for extracting text from files (documents, images, archives) into SQLite with OCR support. Simple, expandable, one shell script only.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Ada industrial control widget library

    Ada industrial control widget library

    A GUI library for designing gauges, meters, oscilloscopes, plotters

    A library that is intended for designing high-quality industrial control widgets for Ada applications. The widgets are composed of transparent layers drawn by cairo. The widgets are fully scalable graphics. A time controlled refresh policy is supported for real-time and heavy-duty applications. The library supports caching graphical operations and stream I/O for serialization and deserialization. Ready-to-use gauge, meter, oscilloscope widgets are provided as samples as well as an editor...
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 10
    Neiki's Gallery

    Neiki's Gallery

    Vanilla JavaScript image gallery & lightbox

    Neiki's Gallery is a lightweight, production-ready image gallery and lightbox library built with vanilla JavaScript and CSS. It requires no dependencies and can be integrated with a single <script> tag, with automatic initialization out of the box. It provides a highly customizable experience for modern web projects, combining performance, flexibility, and rich UI interactions. Designed for both developers and end users, it supports responsive layouts, advanced lightbox features, touch...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    utzoo

    utzoo

    usenet archive of the 1980's

    Contains the UTZOO usenet archive spanning February 1981 to June of 1991. Tapes are in A news format and then later B news format.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    LaTeXML

    LaTeXML

    A TeX and LaTeX to XML/HTML/ePub/MathML translator

    LaTeXML is a tool that converts LaTeX documents into structured formats like HTML, MathML, and ePub. Unlike traditional TeX-to-PDF processors, LaTeXML preserves semantic content, making it suitable for web publishing, accessibility, and content reuse. It supports a wide range of LaTeX packages and is designed to enable high-quality rendering of mathematical and scientific documents.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    dbt-re-data

    dbt-re-data

    re_data - fix data issues before your users & CEO would discover them

    re_data is an open-source data reliability framework for the modern data stack. Currently, re_data focuses on observing the dbt project (together with underlaying data warehouse - Postgres, BigQuery, Snowflake, Redshift). Data transformations in re_data are implemented and exposed as models & macros in this dbt package. Gather all relevant outputs about your data in one place using our cloud. Invite your team and debug it easily from there. Go back in time, and see your past metadata. Set up...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    eBook

    eBook

    LaTeX book with examples, open-source eBook

    LaTeX can be a challenging system to learn, but with this book of different examples, it becomes interesting and easier than ever. This comprehensive guide covers a lot of essentials and is written in a clear and concise style, showing the result vs code. Plus, it's #opensource and freely available, creating a collaborative learning environment. In this book the I try to reveal how you can find necessary pieces of TeX code looking only at already done work. If you want to create...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    FileHashler

    FileHashler

    cross-platform (Java) file encryption and archiving tool

    Java based API (library) and console utility for cross-platform file and directory encryption and archiving. Encrypts single files up to 9.2 EB length. Directories can be encrypted as sets of single files in a single action. Uses Twofish (CBC) and SHA-256. Security level is strong and benefits from the PWS3 (Password Safe) encrypted file format. The level can be lowered to 128-bit key-length (normally 256).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Bert-VITS2

    Bert-VITS2

    VITS2 backbone with multilingual-bert

    ...It provides emotional modeling through “emo embeddings,” allowing voices to be conditioned on different affective states during synthesis. Releases include optimizations for Japanese and English alignment, expanded training data, spec caching and pre-generation tools, as well as ONNX export for more lightweight inference deployments.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    re_data

    re_data

    re_data - fix data issues before your users & CEO would discover them

    re_data is an open-source data reliability framework for the modern data stack. Currently, re_data focuses on observing the dbt project (together with underlying data warehouse - Postgres, BigQuery, Snowflake, Redshift). Gather all relevant outputs about your data in one place using our cloud. Invite your team and debug it easily from there. Go back in time, and see your past metadata. Set up Slack notifications to always know when a new report is produced or an existing one got updated.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Atlas aims to provide high quality maps of the world for use in FlightGear, an open source flight simulator.
    Leader badge
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    Adala

    Adala

    Adala: Autonomous DAta (Labeling) Agent framework

    Adala is a data-centric AI framework focused on dataset curation, annotation, and validation. It helps AI teams manage high-quality training datasets by providing tools for data auditing, error detection, and quality assessment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Resemble Enhance

    Resemble Enhance

    AI powered speech denoising and enhancement

    ...The models are trained on high-quality speech data, which helps the tool produce cleaner output than basic filtering alone. Its main value is giving developers and audio creators an open tool for upgrading imperfect speech recordings.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    axflow

    axflow

    The TypeScript framework for AI development

    ...Its core SDK enables developers to integrate language model capabilities into web applications while maintaining strong modular design principles. Additional components support data ingestion, evaluation, and model interaction workflows that are commonly required when building production AI systems. For example, the framework includes modules for connecting application data to language models, evaluating the quality of model outputs, and building streaming user interfaces. Because each component can be used independently, developers can adopt Axflow incrementally rather than committing to a monolithic framework. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    DPM-Solver

    DPM-Solver

    Fast ODE Solver for Diffusion Probabilistic Model Sampling

    DPM-Solver is a machine learning research implementation focused on accelerating the sampling process in diffusion probabilistic models used for generative AI tasks. Diffusion models are powerful generative systems capable of producing high-quality images and other data, but traditional sampling methods often require hundreds or thousands of computational steps. The project introduces a specialized numerical solver designed to approximate the diffusion process using a small number of high-order integration steps. By reformulating the sampling problem as the solution of a diffusion-related ordinary differential equation, the solver can produce high-quality samples much more efficiently. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    SolexaQA is a software to calculate quality statistics and visual representations of data quality for second-generation sequencing data.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    YiVal

    YiVal

    Your Automatic Prompt Engineering Assistant for GenAI Applications

    ...It focuses on experimentation and optimization by allowing users to test multiple prompt variations, configurations, and model parameters in parallel, then evaluate their outputs using structured metrics and scoring systems. The platform is particularly useful in production environments where prompt quality directly impacts user experience, as it provides a repeatable and data-driven approach to refining prompts rather than relying on manual trial and error. YiVal supports integration with various LLM providers and can orchestrate experiments across different models, making it adaptable to evolving AI ecosystems. It also includes evaluation pipelines that help quantify output quality based on criteria such as accuracy, coherence, or task-specific benchmarks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    Knobjex Info Manager

    Tool for PIM, mind-mapping, quality-management, knowledge-base.

    Also available for knowledge-enthusiasts: https://github.com/some-avail/freekwensie Knobjex 5.01 released partially; windows setup forthcoming; go to tab "Files" to download . Added shortcut keys (chapter 5.3) , dark themes better supported, removed bugs. No database-changes. Knobjex (short for Knowledge Objects) is an information-manager. It has many potential uses, such as calendar, task-list and sticky notes. Knobjex can also handle more advanced use-cases such as...
    Downloads: 1 This Week
    Last Update:
    See Project
Auth0 Logo