Adding an analyzer that splits ligatured letters (æ, œ...) into their components (ae, oe...) would be a plus for linguistically rich contents. It could arguably part of diacritics processing.
Log in to post a comment.