This project is an approach for synonym extraction and extending WordNet by the so found synonyms.
The python application is realised as a kind of pipe that starts with a web-corpus-reader which is followed by several workers (tokenizers, lemmatizers, ...) and finally completed by a result writer.

In contrast to the state of the art approaches, this implementation is based on single words found in the web used as a corpus and translated to other languages. If translations of different source words intersect, it is assumed that the source words are synonymous.
Finally, the matches are written into a proprietary file format in conjunction with WordNet synsets (note currently the result writer uses a very trivial method for placing the matches into WordNet and will be modified in the near future)

Features

  • Extracts synonym pairs from the web
  • Inserts found pairs to WordNet synsets

Project Activity

See All Activity >

Categories

Linguistics

License

GNU General Public License version 3.0 (GPLv3)

Follow WebSynonymExtractor

WebSynonymExtractor Web Site

You Might Also Like
Red Hat Enterprise Linux on Microsoft Azure Icon
Red Hat Enterprise Linux on Microsoft Azure

Deploy Red Hat Enterprise Linux on Microsoft Azure for a secure, reliable, and scalable cloud environment, fully integrated with Microsoft services.

Red Hat Enterprise Linux (RHEL) on Microsoft Azure provides a secure, reliable, and flexible foundation for your cloud infrastructure. Red Hat Enterprise Linux on Microsoft Azure is ideal for enterprises seeking to enhance their cloud environment with seamless integration, consistent performance, and comprehensive support.
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of WebSynonymExtractor!

Additional Project Details

Intended Audience

Science/Research, Education

User Interface

Command-line

Programming Language

Python

Related Categories

Python Linguistics Software

Registered

2012-09-12