Aspell global config overriding wordlist datafile setting
Status: Beta
Brought to you by:
kevina
When aspell creates a wordlist from a cwl file encoded differently than the default encoding, set in /etc/aspell/aspell.config, the information about the encoding, declared in the .dat file, is overridden by the configuration file setting.
The results are that on a system with the encoding set to utf-8 in the global configuration file, the command
/usr/bin/word-list-compress d < is.cwl | /usr/bin/aspell --lang=is create master ./is.rws
acting on a iso-8859-1 .cwl file, returns an error of the form:
Warning: The string "Barðadóttir" is invalid. Invalid UTF-8 sequence at position 4. Skipping string.
for each non-utf-8 string in it.
A workaround is to remove the configuration setting from the configuration file.
Logged In: YES
user_id=6591
Originator: NO
I will try to have this fixed for Aspell 0.61. If the changes are minor I might backport the fix to the Aspell 0.60 branch.
This issue has moved to GitHub: https://github.com/GNUAspell/aspell/issues/404