Search engine and data mining applications and ClueWeb datasets.
You can do more!
The IRC's Talking Robot
Contao Open Source CMS (fka TYPOlight)
Simple but advanced should not be an oxymoron.
This is a base PHP framework for making simple websites with clean URL