A nimble options backtesting library for Python
Clean Jupyter notebooks of outputs, metadata, and empty cells
Scale your Pandas workflows by changing a single line of code
Making DAG construction easier
Synthetic data generators for structured and unstructured text
A python wrapper for Alpha Vantage API for financial data.
Always know what to expect from your data
Project structure for doing and sharing data science work
tensorboard for pytorch (and chainer, mxnet, numpy, etc.)
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
A dedicated app for collecting thousands of POI for OpenStreetMap
airda(Air Data Agent
The standard data-centric AI package for data quality and ML
A simple forecasting package
AutoGluon: AutoML for Image, Text, and Tabular Data
Data science on data without acquiring a copy
Create HTML profiling reports from pandas DataFrame objects
Train machine learning models within Docker containers
Best practices on recommendation systems
Automatically find issues in image datasets
An orchestration platform for the development, production
Python binding to the Apache Tika™ REST services
Python module that helps you build complex pipelines of batch jobs
A curated list of insanely awesome libraries, packages and resources
Benchmarking synthetic data generation methods