The Data Productivity Toolkit is a collection of linux command-line tools designed to facilitate the analysis of text-based data sets. Modeled after the general linux pipeline tools such as awk, grep, and sed, the kit provides powerful tools for selecting/combining data, performing statistics, and visualizing results. The tools are all written in python and in many instances provide a command-line API to basic python and numpy/scipy/matplotlib routines.
Follow Data Productivity Toolkit
You Might Also Like
Red Hat Enterprise Linux (RHEL) on Microsoft Azure provides a secure, reliable, and flexible foundation for your cloud infrastructure. Red Hat Enterprise Linux on Microsoft Azure is ideal for enterprises seeking to enhance their cloud environment with seamless integration, consistent performance, and comprehensive support.
Rate This Project
Login To Rate This Project
User Reviews
-
I am the author of this project, so of course I give it five stars. :) In all seriousness though, in the seven years or so since I wrote this, the data tooling for python has improved immensely. I have written a far more capable project to replace this one. It is called Pandashells and you can find it on github. If you like this project, you'll really like pandashells
-
very good project, thanks!