Showing 26 open source projects for "data quality"

View related business solutions
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Book2_Beauty-of-Data-Visualization

    Book2_Beauty-of-Data-Visualization

    Machine Learning, Criticism and Correction

    ...By combining theory with hands-on plotting exercises, the book helps readers build both analytical and presentation skills. Overall, it is intended as a foundational guide for anyone seeking to produce professional-quality data visualizations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Best-of Python

    Best-of Python

    A ranked list of awesome Python open-source libraries

    ...Correctly generate plurals, ordinals, indefinite articles; convert numbers. Libraries for loading, collecting, and extracting data from a variety of data sources and formats. Libraries for data batch- and stream-processing, workflow automation, job scheduling, and other data pipeline tasks.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    Scientific Visualization

    Scientific Visualization

    An open access book on scientific visualization using python

    The Scientific Visualization book is a freely available open-access textbook that introduces how to produce effective scientific visualizations using Python, focusing especially on leveraging the popular plotting library Matplotlib (and related tools). It goes beyond simple plotting tutorials and emphasizes design principles: how to choose colors, layout subplots, annotate graphs, and present data in a way that is both accurate and visually compelling. As such, it serves as a guide for researchers, data scientists, and academic authors who need to create publication-quality figures or explanatory graphics, rather than quick exploratory plots. It includes extensive examples that demonstrate best practices — for instance handling multiple subplots, combining line plots with scatter/density overlays, or rendering high-resolution vector graphics for print.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Pinyin

    Pinyin

    A high-quality solution for converting Chinese to Pinyin

    The Chinese to Pinyin tool based on the CC-CEDICT dictionary, more accurately supports the solution of Chinese characters to Pinyin for polyphonic characters. Memory type, suitable for servers with more memory space, advantages, fast conversion. Small memory type (default), suitable for environments with tight memory, advantages, small memory footprint, conversion is not as fast as memory type. I/O type, suitable for virtual machines with strict memory restrictions. Advantages: very minimal...
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 5
    List of Free Learning Resources

    List of Free Learning Resources

    Freely available programming books

    ...Maintained by the community, it organizes materials by topic, language, and skill level, making it easy to discover learning resources. The repository includes content on software development, computer science, data science, and more. It is continuously updated with new resources contributed by developers worldwide. The project emphasizes accessibility and open education, providing high-quality materials without cost. It serves as a central hub for self-learners and professionals alike. Its structured organization makes it a widely used reference for learning programming.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    LLM Course

    LLM Course

    Course to get into Large Language Models (LLMs)

    ...The materials also cover inference optimization and quantization to make serving LLMs feasible on commodity GPUs or even CPUs, which is crucial for side projects and startups. Evaluation is treated as a first-class topic, with examples of automatic and human-in-the-loop methods to catch regressions and verify quality beyond simple loss values. By the end, students have a mental model and a practical toolkit for iterating on datasets, training configs, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    AI Researcher

    AI Researcher

    An autonomous AI researcher

    ...Each agent operates with clear roles — such as researcher, analyst, and summarizer — and they communicate through a task-management interface that ensures progress tracking and iterative refinement. The system emphasizes modularity, so teams can swap in new reasoning modules, data retrieval strategies, or domain knowledge bases depending on the research topic. Through self-supervised feedback loops, agents adjust their strategies based on prior outcomes, improving both the quality and relevance of results over time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    MathGL

    MathGL

    A library for scientific data visualization

    A free cross-platform library of fast C++ routines for the plotting of up to 3-ranged data. It can export to bitmap and vector EPS/SVG files. There are window interfaces (GLUT/FLTK/Qt) and console tools. MathGL can be used from C/Fortran/Python/Octav/Lua
    Downloads: 53 This Week
    Last Update:
    See Project
  • 9
    Statistics101 - Resampling Statistics

    Statistics101 - Resampling Statistics

    Use simulation to perform statistical analyses.

    Statistics101 is an Integrated Development Environment (IDE) that uses a simple, powerful language called “Resampling Stats” to develop Monte Carlo programs to analyze and solve statistical problems. The original Resampling Stats language and computer program were developed by Dr. Julian Simon (https://www.juliansimon.com/) and Peter Bruce (https://www.scientificamerican.com/author/peter-bruce/) as a new way to teach Statistics to social science students. Of course, social science students...
    Downloads: 12 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 10

    openSkyMatch

    Matches OpenScience Observatories images with astronomical catalogs

    openSkyMatch is a collection of Linux shell and Python scripts designed for the OpenScience Observatories program. It automates the identification and matching of detected celestial objects in locally captured FITS images with entries in large-scale sky catalogs, notably Pan-STARRS1 DR2 (II/389/ps1_dr2). The toolkit supports data preprocessing, coordinate correlation, and catalog-based validation of astronomical detections. All tools are open-source and optimized for reproducibility and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    eBook

    eBook

    LaTeX book with examples, open-source eBook

    LaTeX can be a challenging system to learn, but with this book of different examples, it becomes interesting and easier than ever. This comprehensive guide covers a lot of essentials and is written in a clear and concise style, showing the result vs code. Plus, it's #opensource and freely available, creating a collaborative learning environment. In this book the I try to reveal how you can find necessary pieces of TeX code looking only at already done work. If you want to create...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    linSmith

    Smith chart intended for educational use

    A Smith charting program. You can enter either discrete components or transmission lines, see the results on screen and/or generate Postscript output. Component values can be changed numerically or using scrollbars.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    AllenNLP

    AllenNLP

    An open-source NLP research library, built on PyTorch

    AllenNLP makes it easy to design and evaluate new deep learning models for nearly any NLP problem, along with the infrastructure to easily run them in the cloud or on your laptop. AllenNLP includes reference implementations of high quality models for both core NLP problems (e.g. semantic role labeling) and NLP applications (e.g. textual entailment). AllenNLP supports loading "plugins" dynamically. A plugin is just a Python package that provides custom registered classes or additional...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    TemaTres: controlled vocabulary server

    TemaTres: controlled vocabulary server

    Manage, Publish and Share Ontologies, Taxonomies, Thesauri, Glossaries

    Web application for management formal representations of knowledge, thesauri, taxonomies and multilingual vocabularies / Aplicación para la gestión de representaciones formales del conocimiento, tesauros, taxonomías, vocabularios multilingües. For the latest version of code: https://github.com/tematres/TemaTres-Vocabulary-Server
    Downloads: 6 This Week
    Last Update:
    See Project
  • 15
    TESTIMAGES

    TESTIMAGES

    Testing images for scientific purposes

    ...Please cite the following papers when using any image in this archive: * ASUNI N, GIACHETTI A, "TESTIMAGES: A Large Data Archive For Display and Algorithm Testing", Journal of Graphics Tools, Volume 17, Issue 4, 2015, pages 113-125, DOI:10.1080/2165347X.2015.1024298 * ASUNI N, GIACHETTI A, "TESTIMAGES: a large-scale archive for testing visual devices and basic image processing algorithms", STAG - Smart Tools & Apps for Graphics Conference, 2014.
    Leader badge
    Downloads: 169 This Week
    Last Update:
    See Project
  • 16
    RNAseq Tutorial

    RNAseq Tutorial

    Informatics for RNA-seq: A web resource for analysis on the cloud

    rnaseq_tutorial is a tutorial and educational resource created by the Griffith Lab that guides users through the steps of RNA-seq data analysis. It includes working pipelines for alignment, differential expression, alternative splicing, visualization, and interpretation. It is designed to run in the cloud or local environments, providing introductory material on file formats, reference genomes / annotation, QC, mapping, quantifying expression, visualizing results, etc. The version in that...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Scripture

    Scripture

    Biblical Data-Research Tool

    the real is finite; the irreal is infinite. The unreal is bigger than infinite. real is the physical; irreal is the possible. Unreal is the incoherent, useless, meaningless, insane.. all religions offend me, when someone tries to sell me their delusions. when someone say i can't have morals without religion. when someone doesn't hold others to their level just because of a iron age book. when they refuse to understand why faith is harmful. when they kill and hate for their imagenary gods....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Semtinel is an application framework for the semi-automatic creation, maintenance and analysis of hierarchical concept schemes (thesaurus, classification or ontology). Semtinel supports the development of new analysis methods and visualization techniques
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Pamvotis is a Wireless LAN Simulator for all the current physical layer extentions of the IEEE 802.11 (IEEE 802.11 a/b/g) standard and for the IEEE 802.11e Draft for Quality of Service in WLANs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    This program is a graphical visualization of various engineering concepts like - - Antenna Radiation patterns (3D full & sectional views) - Electron Bunching in a 2 cavity Klystron amplifier - many more
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Cinege is aimed to be the ultimate library (management) system - one day. Currently, it it a very nice tool to make regexp based filtering of bibliographic and item data and find variuos bugs in your existing database.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Yabman is a tool for managing bibliographic references. Its key features are a quality user interface, a carefully designed data model, and sophisticated three-state hierarchical reference labeling. It is currently usable but in a pre-alpha stage.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    PP3 creates celestial charts. It generates resolution independent sky maps of very high graphical quality. They can be used for example as illustrations in books or on web pages. Databases are included but you may use own data if you wish.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 24
    Example Code Manager is an Eclipse-Plugin for managing sample code and sample data from repositories around the world. Mainly subversion repositories, but support for flat file or CVS repositories are planned.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Web 2.0 environment for collaborative and extensible design of object interconnection diagrams like UML, networks, circuits and other graph based diagrams. Client: Web-based (AJAX / Dojo Toolkit) Server: JEE Application (Struts Framework)
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Auth0 Logo