Hi Ellen,

I'm no expert, but creating a good spellchecker is not always as straight forward as one thinks.

It's sometimes good to omit words that are technically correct because they are usually written by misrake, a typo. This increases the spellcheckers "detection rate" (ability to detect actual misspellings). Of course, at the same time it increases the spellcheckers "false positive" rate (flagging a correctly spelled word as misspelled), so one has to weigh in the advantages and disadvantages and make a decision.

Consider the single letter "i". Sure, it could be accepted by the spellchecker, but this would mean that it wouldn't recognize the very common typo (lower case i as pronoun) and suggest the correct capitalized form "I". In this case it's far more beneficial to mark lower case i as incorrect.

There are ways around this in other spelling engines (such as autocorrecting, giving the user a warning through a green underline etc.), but I don't think there's a good solution to this using Hunspell only.

It's good to use statistics to rule out words that are uncommon or jargon, perhaps that's why "vi" isn't in the dictionary file?

/Sahand

Den 26 jun 2013 12:39 skrev "Ellen Johnson" <E.Johnson@pilz.ie>:

I see some inconsistencies in the results using the EN_GB dictionary.

Some examples, some single letter words are not marked as misspelled - b, c, d, e, f, g, h, k, l n, p, q, r, s, t, v, x, y, z.
But others are - i, j, m, o u, w.

Some two letter words are not not marked as misspelled - cc, iv.
But vi is marked as misspelled.

Is this a bug?
If so, I would like to raise it, but I don't see how to do that from the web-site.


Ellen Johnson
Software Development

PILZ Ireland Industrial Automation,
Business & Technology Park, Model Farm Road, Cork, Ireland.
Telephone +353 (021)  434-6535
E-Mail:
mailto:e.johnson@pilz.ie, Internet: http://www.pilz.ie

This email is intended solely for the use of the named addressee(s). Any unauthorised disclosure, copying or distribution of the confidential information contained therein, or the taking of any action based on it, is prohibited. The sender disclaims any liability for the integrity of this email. Legally binding declarations must be in written form.


This e-mail, including any attachments, may contain private or confidential information. Any unauthorised disclosure, copying or distribution of the confidential information contained therein, or the taking of any action based on it, is prohibited. The sender disclaims any liability for the integrity of this email. Legally binding declarations must be in written form. If you think you may not be the intended recipient, or if you have received this e-mail in error, please contact the sender immediately and delete all copies of this e-mail. If you are not the intended recipient, you must not reproduce any part of this e-mail or disclose its contents to any other party. This message represents the opinion of the sender and not necessarily the opinion of the company.




------------------------------------------------------------------------------
This SF.net email is sponsored by Windows:

Build for Windows Store.

http://p.sf.net/sfu/windows-dev2dev
_______________________________________________
Hunspell-devel mailing list
Hunspell-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/hunspell-devel