Program wordTabulator is intended for text analysis. It can generate index of word elements extracted from defined text set. Word elements may be words, N-grams or phrases (syntagmes). The program can process texts as in ordinary 1-byte encoding (ANSI), as in multibyte UTF-8 encoding.
Features
- ANSI and UTF-8 encodings
- NCR-codes, HTML-named entities. multi-language texts
- search by regular expressions
- set operations on source texts: subtraction, intersection and union
- morphology module for Russian language
- different formats of output index; true alphabetical ordering
License
GNU General Public License version 2.0 (GPLv2)Follow wordTabulator
Other Useful Business Software
Gen AI apps are built with MongoDB Atlas
MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
Rate This Project
Login To Rate This Project
User Reviews
-
Цитата из лицензии "2. Исходный код является собственностью автора и не подлежит модификации путем декомпиляции двоичного кода программных модулей."