Search engine and data mining applications and ClueWeb datasets.
DSTK - DataScience ToolKit for All of Us
open-source, flexible Business Process Management (BPM) in Java
Workflow Designer, Hive Editor, Pig Editor, File System Browser
developing plattform for SLUG projects
keyword search engine for semi-structured data (Tables, lists,...)
Log file parser and viewer