Unicode XML TEI text analysis platform
We describe a simple XML format to share text documents and annotation
JSON based text search Java Project
Classify any two TXT documents, no training required - JAVA
Non-disjoint groupping of Documents based on word sequence approach