A Distributed RESTful Search Engine
Serial/TCP terminal: ANSI color, logging, HEX input, & XLSX docs.
Search engine and data mining applications and ClueWeb datasets.
Educational Python web scraping case collection for many sites
DSTK - DataScience ToolKit for All of Us
Workflow Designer, Hive Editor, Pig Editor, File System Browser
Graphical tool for visualizing changes in web pages
Squid log data warehouse
Log collector for FortiGate units (v4 MR3)
Integrated to system status data based on the HDFS
Hadoop, Hbase, HBase Web Client, Flume based log analytics system
China School Bus Data Analysis model
keyword search engine for semi-structured data (Tables, lists,...)
Log file parser and viewer