KH Coder is a free software for quantitative content analysis or text data mining. It is also utilized for computational linguistics. You can analyze Japanese, English, French, German, Italian, Portuguese and Spanish text with KH Coder. Also, Catalan, Chinese (simplified), Korean, Russian and Slovenian language data can be analyzed with the latest alpha version.
KH Coder provides various kinds of search and statistical analysis functions using back-end tools such as Stanford POS Tagger, FreeLing, Snowball stemmer, MySQL and R.
- Words: Frequency List
- Words: Searching
- Words: KWIC Concordance
- Words: Collocation Stats
- Words: Correspondence Analysis
- Words: Multi-Dimensional Scaling
- Words: Co-Occurrence Network
- Words: Hierarchical Cluster Analysis
- Categories: Developing Your Own Categories or Dictionaries
- Categories: Frequency List
- Categories: Cross Tabulation
- Categories: Correspondence Analysis
- Categories: Multi-Dimensional Scaling
- Categories: Co-Occurrence Network
- Categories: Hierarchical Cluster Analysis
- Documents: Searching
- Documents: Clustering
- Documents: Naive Bayes classifier
This is an excellent software, particularly for Japanese, a language that is not readily covered by most text analytics softwares, even highly priced. The interface is quite old-fashioned but that does not matter as far as the functionality is here. Some complained about BOM errors, but this is quite noemal since the engine would not anallyze the byte order mark (a code preceding texts, proper to Unicode). Now, this is not exctly a newbie's toy. it requires a minimal knowledge in text analysis and the logic it is designed. It makes use of R modules which is doing most of the analytical job. The drawback is that it may be stuck if you are trying to analyze a very very long text. Also, the not-free Mac installer is disturbing. it does not provide any auto update function, so that you have to go to the "webstore" ech and every time you need or want to apgrade it to the latest version. They shoud make the instaler much more user-friendly, since they are asking for money for that, even if it is not too expensive...