**CODE MOVED TO GITHUB: https://github.com/bitextor **

Bitextor is an application created to generate translation memories using multilingual websites as a corpus source. It downloads an entire website and applies a set of heuristics (based mainly on HTML tag structure and text block length) to find bitexts.

Project Activity

See All Activity >

License

GNU General Public License version 2.0 (GPLv2)

Follow Bitextor

Bitextor Web Site

Other Useful Business Software
$300 in Free Credit Towards Top Cloud Services Icon
$300 in Free Credit Towards Top Cloud Services

Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
Get Started
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Bitextor!

Additional Project Details

Operating Systems

Linux

Intended Audience

Science/Research

User Interface

Command-line

Programming Language

C++

Related Categories

C++ XML Software, C++ HTML XHTML, C++ Information Analysis Software

Registered

2006-03-16