JActor is a high-performance actor framework designed to make vertical scalability easy. JActor achieves its speed, in part, by almost always executing synchronously--though it is easy enough to force asynchronous processing when needed.
JActor is divided into several GitHub projects:
JActor - https://github.com/laforge49/JActor
JID - https://github.com/laforge49/JID
JFile - https://github.com/laforge49/JFile
JANetty - https://github.com/kumarshantanu/JANetty
Web-as-corpus tools in Java.
* Simple Crawler (and also integration with Nutch and Heritrix)
* HTML cleaner to remove boiler plate code
* Language recognition
* Corpus builder