The Data Productivity Toolkit is a collection of linux command-line tools designed to facilitate the analysis of text-based data sets. Modeled after the general linux pipeline tools such as awk, grep, and sed, the kit provides powerful tools for selecting/combining data, performing statistics, and visualizing results. The tools are all written in python and in many instances provide a command-line API to basic python and numpy/scipy/matplotlib routines.
Follow Data Productivity Toolkit
Other Useful Business Software
Computing infrastructure in predefined or custom machine sizes to accelerate your cloud transformation. General purpose (E2, N1, N2, N2D) machines provide a good balance of price and performance. Compute optimized (C2) machines offer high-end vCPU performance for compute-intensive workloads. Memory optimized (M2) machines offer the highest memory and are great for in-memory databases. Accelerator optimized (A2) machines are based on the A100 GPU, for very demanding applications.
Rate This Project
Login To Rate This Project
User Reviews
-
I am the author of this project, so of course I give it five stars. :) In all seriousness though, in the seven years or so since I wrote this, the data tooling for python has improved immensely. I have written a far more capable project to replace this one. It is called Pandashells and you can find it on github. If you like this project, you'll really like pandashells
-
very good project, thanks!