Showing 22 open source projects for "data quality"

View related business solutions
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 1
    Book2_Beauty-of-Data-Visualization

    Book2_Beauty-of-Data-Visualization

    Machine Learning, Criticism and Correction

    ...By combining theory with hands-on plotting exercises, the book helps readers build both analytical and presentation skills. Overall, it is intended as a foundational guide for anyone seeking to produce professional-quality data visualizations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Best-of Python

    Best-of Python

    A ranked list of awesome Python open-source libraries

    ...Correctly generate plurals, ordinals, indefinite articles; convert numbers. Libraries for loading, collecting, and extracting data from a variety of data sources and formats. Libraries for data batch- and stream-processing, workflow automation, job scheduling, and other data pipeline tasks.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    Scientific Visualization

    Scientific Visualization

    An open access book on scientific visualization using python

    The Scientific Visualization book is a freely available open-access textbook that introduces how to produce effective scientific visualizations using Python, focusing especially on leveraging the popular plotting library Matplotlib (and related tools). It goes beyond simple plotting tutorials and emphasizes design principles: how to choose colors, layout subplots, annotate graphs, and present data in a way that is both accurate and visually compelling. As such, it serves as a guide for researchers, data scientists, and academic authors who need to create publication-quality figures or explanatory graphics, rather than quick exploratory plots. It includes extensive examples that demonstrate best practices — for instance handling multiple subplots, combining line plots with scatter/density overlays, or rendering high-resolution vector graphics for print.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Pinyin

    Pinyin

    A high-quality solution for converting Chinese to Pinyin

    The Chinese to Pinyin tool based on the CC-CEDICT dictionary, more accurately supports the solution of Chinese characters to Pinyin for polyphonic characters. Memory type, suitable for servers with more memory space, advantages, fast conversion. Small memory type (default), suitable for environments with tight memory, advantages, small memory footprint, conversion is not as fast as memory type. I/O type, suitable for virtual machines with strict memory restrictions. Advantages: very minimal...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • 5
    List of Free Learning Resources

    List of Free Learning Resources

    Freely available programming books

    ...Maintained by the community, it organizes materials by topic, language, and skill level, making it easy to discover learning resources. The repository includes content on software development, computer science, data science, and more. It is continuously updated with new resources contributed by developers worldwide. The project emphasizes accessibility and open education, providing high-quality materials without cost. It serves as a central hub for self-learners and professionals alike. Its structured organization makes it a widely used reference for learning programming.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    LLM Course

    LLM Course

    Course to get into Large Language Models (LLMs)

    ...The materials also cover inference optimization and quantization to make serving LLMs feasible on commodity GPUs or even CPUs, which is crucial for side projects and startups. Evaluation is treated as a first-class topic, with examples of automatic and human-in-the-loop methods to catch regressions and verify quality beyond simple loss values. By the end, students have a mental model and a practical toolkit for iterating on datasets, training configs, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    AI Researcher

    AI Researcher

    An autonomous AI researcher

    ...Each agent operates with clear roles — such as researcher, analyst, and summarizer — and they communicate through a task-management interface that ensures progress tracking and iterative refinement. The system emphasizes modularity, so teams can swap in new reasoning modules, data retrieval strategies, or domain knowledge bases depending on the research topic. Through self-supervised feedback loops, agents adjust their strategies based on prior outcomes, improving both the quality and relevance of results over time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    MathGL

    MathGL

    A library for scientific data visualization

    A free cross-platform library of fast C++ routines for the plotting of up to 3-ranged data. It can export to bitmap and vector EPS/SVG files. There are window interfaces (GLUT/FLTK/Qt) and console tools. MathGL can be used from C/Fortran/Python/Octav/Lua
    Downloads: 53 This Week
    Last Update:
    See Project
  • 9
    Statistics101 - Resampling Statistics

    Statistics101 - Resampling Statistics

    Use simulation to perform statistical analyses.

    Statistics101 is an Integrated Development Environment (IDE) that uses a simple, powerful language called “Resampling Stats” to develop Monte Carlo programs to analyze and solve statistical problems. The original Resampling Stats language and computer program were developed by Dr. Julian Simon (https://www.juliansimon.com/) and Peter Bruce (https://www.scientificamerican.com/author/peter-bruce/) as a new way to teach Statistics to social science students. Of course, social science students...
    Downloads: 11 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    eBook

    eBook

    LaTeX book with examples, open-source eBook

    LaTeX can be a challenging system to learn, but with this book of different examples, it becomes interesting and easier than ever. This comprehensive guide covers a lot of essentials and is written in a clear and concise style, showing the result vs code. Plus, it's #opensource and freely available, creating a collaborative learning environment. In this book the I try to reveal how you can find necessary pieces of TeX code looking only at already done work. If you want to create...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    AllenNLP

    AllenNLP

    An open-source NLP research library, built on PyTorch

    AllenNLP makes it easy to design and evaluate new deep learning models for nearly any NLP problem, along with the infrastructure to easily run them in the cloud or on your laptop. AllenNLP includes reference implementations of high quality models for both core NLP problems (e.g. semantic role labeling) and NLP applications (e.g. textual entailment). AllenNLP supports loading "plugins" dynamically. A plugin is just a Python package that provides custom registered classes or additional...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    TemaTres: controlled vocabulary server

    TemaTres: controlled vocabulary server

    Manage, Publish and Share Ontologies, Taxonomies, Thesauri, Glossaries

    Web application for management formal representations of knowledge, thesauri, taxonomies and multilingual vocabularies / Aplicación para la gestión de representaciones formales del conocimiento, tesauros, taxonomías, vocabularios multilingües. For the latest version of code: https://github.com/tematres/TemaTres-Vocabulary-Server
    Downloads: 6 This Week
    Last Update:
    See Project
  • 13
    TESTIMAGES

    TESTIMAGES

    Testing images for scientific purposes

    ...Please cite the following papers when using any image in this archive: * ASUNI N, GIACHETTI A, "TESTIMAGES: A Large Data Archive For Display and Algorithm Testing", Journal of Graphics Tools, Volume 17, Issue 4, 2015, pages 113-125, DOI:10.1080/2165347X.2015.1024298 * ASUNI N, GIACHETTI A, "TESTIMAGES: a large-scale archive for testing visual devices and basic image processing algorithms", STAG - Smart Tools & Apps for Graphics Conference, 2014.
    Leader badge
    Downloads: 169 This Week
    Last Update:
    See Project
  • 14
    Scripture

    Scripture

    Biblical Data-Research Tool

    the real is finite; the irreal is infinite. The unreal is bigger than infinite. real is the physical; irreal is the possible. Unreal is the incoherent, useless, meaningless, insane.. all religions offend me, when someone tries to sell me their delusions. when someone say i can't have morals without religion. when someone doesn't hold others to their level just because of a iron age book. when they refuse to understand why faith is harmful. when they kill and hate for their imagenary gods....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Semtinel is an application framework for the semi-automatic creation, maintenance and analysis of hierarchical concept schemes (thesaurus, classification or ontology). Semtinel supports the development of new analysis methods and visualization techniques
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Pamvotis is a Wireless LAN Simulator for all the current physical layer extentions of the IEEE 802.11 (IEEE 802.11 a/b/g) standard and for the IEEE 802.11e Draft for Quality of Service in WLANs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    This program is a graphical visualization of various engineering concepts like - - Antenna Radiation patterns (3D full & sectional views) - Electron Bunching in a 2 cavity Klystron amplifier - many more
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Cinege is aimed to be the ultimate library (management) system - one day. Currently, it it a very nice tool to make regexp based filtering of bibliographic and item data and find variuos bugs in your existing database.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Yabman is a tool for managing bibliographic references. Its key features are a quality user interface, a carefully designed data model, and sophisticated three-state hierarchical reference labeling. It is currently usable but in a pre-alpha stage.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Example Code Manager is an Eclipse-Plugin for managing sample code and sample data from repositories around the world. Mainly subversion repositories, but support for flat file or CVS repositories are planned.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Web 2.0 environment for collaborative and extensible design of object interconnection diagrams like UML, networks, circuits and other graph based diagrams. Client: Web-based (AJAX / Dojo Toolkit) Server: JEE Application (Struts Framework)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    TouchFlowPro

    TouchFlowPro

    Elite typing engine with fatigue-aware analytics for professionals.

    TouchFlow Pro is a high-performance typing training platform designed for Medical, Legal, and Tech professionals. Breaking the "OK Plateau" requires more than games; it requires data-driven drills, kinesthetic memory encoding, and real-time fatigue monitoring. Our engine tracks your burst-to-drift ratio to detect cognitive fatigue before your error rate spikes, ensuring every minute of practice is high-quality. Perfect for specialists who rely on their hands for their livelihood.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo