Best practices on recommendation systems
A Distributed RESTful Search Engine
Benchmarking synthetic data generation methods
PDP-OmniSim simulating parallel and distributed processing systems
Performance and Productivity at Scale
Data and Text Mining Software for Everyone
DSTK - DataScience ToolKit for All of Us
Virtual Box VDI of SliTaz Linux with Savuka installed and configured
Data Vault loading automation using Pentaho Data Integration.