Python binding to the Apache Tika™ REST services
Clean Jupyter notebooks of outputs, metadata, and empty cells
Metadata and data identification tool and Python library
Open Source Document Management System for Digital Archives
A high-quality tool for convert PDF to Markdown and JSON
A minimalist command line knowledge base manager
A cross-platform command-line utility that creates projects
CKAN is an open-source DMS for powering data hubs
A modular, high performance, headless e-commerce platform
Yahoo! Finance market data downloader
Production-ready data processing made easy and shareable
An orchestration platform for the development, production
Dataset Management Framework, a Python library and a CLI tool to build
A feature-rich event management system
tensorboard for pytorch (and chainer, mxnet, numpy, etc.)
Create HTML profiling reports from pandas DataFrame objects
Unified metadata lake for data & AI assets.
re_data - fix data issues before your users & CEO would discover them
Backup and Recovery Manager for PostgreSQL
Open-source metadata collector based on ODD Specification
Open-source GCP metadata collector based on ODD Specification
Simple user interface for gnuplot aimed for reflectometry data
ASCII Art Phase of the Moon (Python version)
We are building an open database of COVID-19 cases with chest X-ray
Open Data Profiling, Quality and Analysis on NYC OpenData dataset