LanguageTool requires Lucene 5.2.1 and we can't have two different versions, so for [#970] we must upgrade our tokenizers to 5.2.1 as well.
Items of note:
Version API is no longer available, so it is no longer possible to specify a "tokenizer behavior" as in the past. The pulldown menus in Project Properties as well as the command line switches have been removed.LuceneGermanTokenizer had an older behavior specified as default; this behavior has been reimplemented as a custom analyzer.LuceneChineseTokenizer (this was never any better than the language-agnostic behavior)Snowball*TokenizersTinySegmenterTokenizer (this was always worse than LuceneJapaneseTokenizer)LuceneKoreanTokenizer (this was always broken and is incompatible with Lucene 5)
Implemented in the released version 4.0 of OmegaT.
Didier