SenseRelate uses measures of semantic similarity to perform word sense disambiguation. AllWords assigns a sense to each word in a text, TargetWord assigns a sense to a given word, and WordToSet assigns the sense of a word most related to a set of words.
Be the first to post a text review of SenseRelate. Rate and review a project by clicking thumbs up or thumbs down in the right column.
We are pleased to announce the release of version 0.19 of WordNet::SenseRelate::AllWords. This system allows you to assign meanings to words in context based on information found in the lexical database WordNet. This release includes a number of significant changes that improve overall performance. Please check out the details at http://senserelate.sourceforge.net
NAME CHANGES - Revision history for WordNet::SenseRelate::AllWords DESCRIPTION 0.19 Date : May 27, 2009 1) Added --backoff option to wsd.pl. This option backs off to WordNet sense1 if the measure can't assign any sense. 2) semcor-reformat.pl used querySense and reformatted the original text that contained the first word in the synset instead of the original word. It no more uses querySense and the formatted text contains the original semcor lemmas. 3) Added scripts utils/extract-semcor-plaintext.pl, utils/extract-semcor-contentwords.pl and utils/convert-PENN-to-WN.pl. utils/extract-semcor-plaintext.pl extracts plain text from a semcor formatted file. The text contains function words, content words as well as punctuation marks. This text is used for part-of-speech tagging. utils/extract-semcor-contentwords.pl extracts content words given an answer file (typically a plain text file extracted using extract-semcor-plaintext.pl which has been tagged using a part of speech tagger) and a key file extracted using extract-semcor-plaintext.pl --key option. utils/convert-PENN-to-WN.pl takes PENN treeb bank tagged text (format word PENNPOS per line) and converts it to WordNet tagged text. 4) Added default config files web/cgi-bin/allwords/user_data/lesk-stoplist.conf and web/cgi-bin/allwords/user_data/vector-stoplist.conf to the web folder. This is important because if the default config stoplists are not used for lesk and vector , they end up giving unexpected results. 5) Changed scorer2-format.pl to format the string only if it is in word#pos#sense format.
Be the first person to add a text review.
Copyright © 2009 Geeknet, Inc. All rights reserved. Terms of Use
Thanks for your rating!
Would you also like to write a review?
Thanks for your review!
Get credit for your review by logging in via OpenID. Click your account provider: