This project is an approach for synonym extraction and extending WordNet by the so found synonyms.
The python application is realised as a kind of pipe that starts with a web-corpus-reader which is followed by several workers (tokenizers, lemmatizers, ...) and finally completed by a result writer.

In contrast to the state of the art approaches, this implementation is based on single words found in the web used as a corpus and translated to other languages. If translations of different source words intersect, it is assumed that the source words are synonymous.
Finally, the matches are written into a proprietary file format in conjunction with WordNet synsets (note currently the result writer uses a very trivial method for placing the matches into WordNet and will be modified in the near future)

Features

  • Extracts synonym pairs from the web
  • Inserts found pairs to WordNet synsets

Project Activity

See All Activity >

Categories

Linguistics

License

GNU General Public License version 3.0 (GPLv3)

Follow WebSynonymExtractor

WebSynonymExtractor Web Site

Other Useful Business Software
MongoDB Atlas runs apps anywhere Icon
MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of WebSynonymExtractor!

Additional Project Details

Intended Audience

Education, Science/Research

User Interface

Command-line

Programming Language

Python

Related Categories

Python Linguistics Software

Registered

2012-09-12