KH Coder is a free software for quantitative content analysis or text data mining. It is also utilized for computational linguistics. You can analyze Japanese, English, French, German, Italian, Portuguese and Spanish text with KH Coder. Also, Catalan, Chinese (simplified), Korean, Russian and Slovenian language data can be analyzed with the latest alpha release (Version 3).
KH Coder provides various kinds of search and statistical analysis functions using back-end tools such as Stanford POS Tagger, FreeLing, Snowball stemmer, MySQL and R.
- Words: Frequency List
- Words: Searching
- Words: KWIC Concordance
- Words: Collocation Stats
- Words: Correspondence Analysis
- Words: Multi-Dimensional Scaling
- Words: Co-Occurrence Network
- Words: Hierarchical Cluster Analysis
- Categories: Developing Your Own Categories or Dictionaries
- Categories: Frequency List
- Categories: Cross Tabulation
- Categories: Correspondence Analysis
- Categories: Multi-Dimensional Scaling
- Categories: Co-Occurrence Network
- Categories: Hierarchical Cluster Analysis
- Documents: Searching
- Documents: Clustering
- Documents: Naive Bayes classifier
This may work well for Japanese, but it absolutely chokes on EVERY English-language txt file, no matter what encoding you try. The error will either be a 'BOM' error, or it will exclaim, 'There's something wrong with the POS tagger!' In addition, the interface is clumsy and clunky, straight out of 1998. The whole program has a shaky feel, as if it's about to freeze at any second... and then it does.