This project will produce a set of machine measures of text document similarity. A measure of document similarity quantifies the degree to which two text documents are related.


http://text-similarity.sourceforge.net





Separate each tag with a space.

Release Date:

2008-11-15

Topics:

License:

Ratings and Reviews

Be the first to post a text review of Text::Similarity. Rate and review a project by clicking thumbs up or thumbs down in the right column.

Project Feed

  • Text-Similarity v0.07 released!

    We are pleased to announce the release of version 0.07 of Text-Similarity. This package allows you to measure the similarity between two text documents using a variety of measures. This release includes a single fix to a test case that had caused problems with Windows installations. Please see the change log for more details. http://text-similarity.sourceforge.net

    posted by tpederse 358 days ago

  • text-similarity Text-Similarity-0.07 file released: Text-Similarity-0.07.tar.gz

    0.07 Released November 14, 2008 (all changes by TDP) * Changed test case that was tripping up Windows. In Linux these are treated as being the same (when order doesn't matter) but this is not the case in Windows. 'sir winston churchill' 'winston churchill SIR!!!' The case has been changed to : 'sir winston churchill' 'winston churchill sir'

    posted 358 days ago

  • File released: /text-similarity/Text-Similarity-0.07/Text-Similarity-0.07.tar.gz

    posted 358 days ago

  • Code committed

    tpederse committed patchset 110 of module Text-Similarity to the Text::Similarity CVS repository, changing 1 files

    posted by tpederse 359 days ago

  • Code committed

    tpederse committed patchset 109 of module Text-Similarity to the Text::Similarity CVS repository, changing 4 files

    posted by tpederse 359 days ago

  • Text-Similarity version 0.06 released!

    We are pleased to announce the release of version 0.06 of Text-Similarity! This package now includes support for the lesk measure and the Dice Coefficient. Please see http://text-similarity.sourceforge.net for additional details.

    posted by tpederse 582 days ago

  • text-similarity Text-Similarity-0.06 file released: Text-Similarity-0.06.tar.gz

    0.06 Released April 6, 2008 (all changes by TDP) * Added Dice coefficient to Overlaps.pm output. Dice is equivalent to F-measure, but formulated slightly differently so could be useful to catch errors. * Modified Overlaps method to provide lesk text matching score, that is the sum of the squared lengths of all phrasal matches (optionally normalized by the product of the lengths of the strings). It provides both Raw lesk and lesk (the normalized form) when run in verbose mode. * Reogranized some documentation to make it more clear that Overlaps is just one possible way of measuring similarity, and that other methods can and should be added. * Renamed text_compare.pl as the more natural and fitting text_similarity.pl

    posted 582 days ago

  • File released: /text-similarity/Text-Similarity-0.06/Text-Similarity-0.06.tar.gz

    posted 582 days ago

  • Code committed

    tpederse committed patchset 108 of module Text-Similarity to the Text::Similarity CVS repository, changing 4 files

    posted by tpederse 582 days ago

  • Code committed

    tpederse committed patchset 107 of module Text-Similarity to the Text::Similarity CVS repository, changing 1 files

    posted by tpederse 582 days ago

Rate and Review

Be the first person to add a text review.

Would you recommend this project?






<

Related Projects

Text::Similarity Actions

Thanks for your rating!

Would you also like to write a review?





Skip Review