Menu

#250 hunspell truncate a text more than 8190 characters

v1.0 (example)
open
nobody
None
5
2014-06-27
2014-06-27
No

Hi everyone

Hunspell will truncate a text with the length more than 8190 characters. If the text is more than 8190, the hunspell output will have a blank line at the position of 8190, indicating that it has completed processing a text. This caused a problem because it might truncate a real-word around 8190 and regards it as misspell. Not only that, when it processes the remaining of the text (that is, the text after position 8190), the offset of a misspell will be reset. It would count from the beginning of the remaining text, not from the beginning of the original text.

The example attached shows the hunspell detects a misspell at 8189 but a word is actually being truncated. After that, it prints a blank line, indicating it has completed the process. After this, if it detects another misspell, the offset will be starting over again.

Thanks!

1 Attachments

Discussion