A Distributed RESTful Search Engine
Frouros is an open-source Python library for drift detection
Serial/TCP terminal: ANSI color, logging, HEX input, & XLSX docs.
Distributed web crawler admin platform for spiders management
Search engine and data mining applications and ClueWeb datasets.
Educational Python web scraping case collection for many sites
DSTK - DataScience ToolKit for All of Us
Workflow Designer, Hive Editor, Pig Editor, File System Browser
Squid log data warehouse
Log collector for FortiGate units (v4 MR3)
Integrated to system status data based on the HDFS
Hadoop, Hbase, HBase Web Client, Flume based log analytics system
China School Bus Data Analysis model
keyword search engine for semi-structured data (Tables, lists,...)