**CODE MOVED TO GITHUB: https://github.com/bitextor **

Bitextor is an application created to generate translation memories using multilingual websites as a corpus source. It downloads an entire website and applies a set of heuristics (based mainly on HTML tag structure and text block length) to find bitexts.

Project Activity

See All Activity >

License

GNU General Public License version 2.0 (GPLv2)

Follow Bitextor

Bitextor Web Site

Other Useful Business Software
Stop vibe-debugging. Icon
Stop vibe-debugging.

Plug Claude into your app's actual errors.

AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
Free 30 days.
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Bitextor!

Additional Project Details

Operating Systems

Linux

Intended Audience

Science/Research

User Interface

Command-line

Programming Language

C++

Related Categories

C++ XML Software, C++ HTML XHTML, C++ Information Analysis Software

Registered

2006-03-16