This project is an approach for synonym extraction and extending WordNet by the so found synonyms.
The python application is realised as a kind of pipe that starts with a web-corpus-reader which is followed by several workers (tokenizers, lemmatizers, ...) and finally completed by a result writer.

In contrast to the state of the art approaches, this implementation is based on single words found in the web used as a corpus and translated to other languages. If translations of different source words intersect, it is assumed that the source words are synonymous.
Finally, the matches are written into a proprietary file format in conjunction with WordNet synsets (note currently the result writer uses a very trivial method for placing the matches into WordNet and will be modified in the near future)

Features

  • Extracts synonym pairs from the web
  • Inserts found pairs to WordNet synsets

Project Activity

See All Activity >

Categories

Linguistics

License

GNU General Public License version 3.0 (GPLv3)

Follow WebSynonymExtractor

WebSynonymExtractor Web Site

You Might Also Like
Achieve perfect load balancing with a flexible Open Source Load Balancer Icon
Achieve perfect load balancing with a flexible Open Source Load Balancer

Take advantage of Open Source Load Balancer to elevate your business security and IT infrastructure with a custom ADC Solution.

Boost application security and continuity with SKUDONET ADC, our Open Source Load Balancer, that maximizes IT infrastructure flexibility. Additionally, save up to $470 K per incident with AI and SKUDONET solutions, further enhancing your organization’s risk management and cost-efficiency strategies.
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of WebSynonymExtractor!

Additional Project Details

Intended Audience

Science/Research, Education

User Interface

Command-line

Programming Language

Python

Related Categories

Python Linguistics Software

Registered

2012-09-12