KH Coder is a free software for quantitative content analysis or text data mining. It is also utilized for computational linguistics. You can analyze Japanese, English, French, German, Italian, Portuguese and Spanish text with KH Coder.
KH Coder provides various kinds of search and statistical analysis functions using back-end tools such as Stanford POS Tagger, Snowball stemmer, MySQL and R.
- Words: Frequency List
- Words: Searching
- Words: KWIC Concordance
- Words: Collocation Stats
- Words: Correspondence Analysis
- Words: Multi-Dimensional Scaling
- Words: Co-Occurrence Network
- Words: Hierarchical Cluster Analysis
- Categories: Developing Your Own Categories or Dictionaries
- Categories: Frequency List
- Categories: Cross Tabulation
- Categories: Correspondence Analysis
- Categories: Multi-Dimensional Scaling
- Categories: Co-Occurrence Network
- Categories: Hierarchical Cluster Analysis
- Documents: Searching
- Documents: Clustering
- Documents: Naive Bayes classifier
This may work well for Japanese, but it absolutely chokes on EVERY English-language txt file, no matter what encoding you try. The error will either be a 'BOM' error, or it will exclaim, 'There's something wrong with the POS tagger!' In addition, the interface is clumsy and clunky, straight out of 1998. The whole program has a shaky feel, as if it's about to freeze at any second... and then it does.
Such a good software for text analysis. The author is also very receptive for questions.
Fascinating and unique piece of software. Not the easiest to install and use, but very rewarding once you get the hang of it.
Works fast. Can manage pretty massive data volume. Highly recommended.