Various tools for creating annotated parallel corpora including pre-trained tagging and parsing models for various languages, sentence alignment tools and word alignment tools.
Uplug also includes a web-based interface for interactive sentence and word alignment and scripts for indexing and querying parallel corpora using the Corpus Work Bench CWB.
Download 'uplug-main' first and then add other packages.
License
GNU General Public License version 3.0 (GPLv3)Follow Uplug corpus tools
Other Useful Business Software
Discover HPCC Systems - the truly open source big data solution that allows you to quickly process, analyze and understand large data sets, even data stored in massive, mixed-schema data lakes. Designed by data scientists, HPCC systems is a complete integrated solution from data ingestion and data processing to data delivery. The free online introductory courses and a robust developer community allow you to get started quickly.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of Uplug corpus tools!