TXM is a free and open-source cross-platform Unicode & XML based text analysis environment and graphical client, supporting Windows, Linux and Mac OS X. It can also be used online as a J2EE standard compliant web portal (GWT based) with access control built in.
DOWNLOAD LATEST VERSION OF TXM : http://textometrie.ens-lyon.fr/spip.php?rubrique61&lang=en
TXM offers a comprehensive range of analysis tools (concordances, collocate search, frequency lists, etc.) based on the powerfull CQP...
CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources.
A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources.
CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on.
Here you...
Analyze text. Diagonal read subject, predicate, obj. Search other pdf.
Version 1.04. Applies and builds upon Apache OpenNLP. For English, French and German files. JavaFX Application, runs with Oracle Java Runtime Environment version 8 that is including JavaFX.
NLP extensions:
- Divide sentences in subclauses: segmentation.
- Divide plain text: subject, predicate, object.
- Count words: stemming.
- Search for similar content: pdf's.
Gives out subject, predicate and object of sentences of pdf and plain text files. Provides comfortable GUI. Automatic...