This program is made to find the semantic similarities between the sentences, according to categories of their words. It is an enhancement of the Vector-Space analysis found withing the Classifier4j, which does not take into account the semantic meanings of the words. Furthermore, the Vector-Space analysis of the Classifier4J does not work well with the short sentences, while this enhancement does. A new dictionary of categories based on the EOWL list of words was developed, while the categories for each word from the DISCO's semantics were calculated.
The result is a tool that is more stable and several gigabytes smaller than DISCO, yet more powerful than the Classifier4j's Vector-Space analysis.
PLEASE BE AWARE: There are 98,376 words in a collection, and each word has the unique directory and a TXT file, which is great for JAVA's speed.
However, this may cause some of the developing environments such as Eclipse or Netbeans to slow-down, freeze or crash due to overload.
Be the first to post a review of Calculate Semantic Similarity!