Stanford CoreNLP, a Java suite of core NLP tools
Python binding to the Apache Tika™ REST services
Simple cross-platform application to cut and join any text file.
Software tool to subtract lines of any text file from another.
Simple application to remove duplicate and empty lines on text files.
Download websites as e-book: pdf, txt, epub.