Statistical machine translation support toolbox to extract, filter, align and transform text data from multilingual documents into parallel training corpora.
- Workflow manager for parallel text data
- Reusable plug-in architecture
- Modular, configuration-driven, filters
- Media filter graph metaphor
- GIZA++, Moses Decoder, Joshua Decoder compatibility
- extract-tmx-corpus compatibility
Be the first to post a review of CorpusFiltergraph!