I am working for Morphological Hunspell Dictionary for Persian Language. I have faced some problems. Fortunately I could find some workarounds for my issues. but just for one issue still I am working. Let me explain it here.
Verbs in Persian Language have three main parts: Prefix, Stem and Suffix. (PFX+STM+SFX)
The problem is between prefix and stem (or stem and suffix) can be a space, zero-with-non-joiner, or none of them.
look at here. to make it simple I just show you a simple verb with prefix and stem here:
As you can see
Number one is combination of prefix and stem without space.
Number two is combination of prefix and stem with zero-with-non-joiner.
Number three is combination of prefix and stem with a space. (it is a common mistake in written persian language)
I want to know if there is any way to replace space with zwnj (number 2) or remove space (number 1)?
regards,
mostafa
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Please move this discussion to GitHub and make the project in SF read-only.
On 09/22/2015 12:46 PM, Mostafa wrote:
Hi,
I hope I can explain my issue correclty.
I am working for Morphological Hunspell Dictionary for Persian Language.
I have faced some problems. Fortunately I could find some workarounds
for my issues. but just for one issue still I am working. Let me explain
it here.
Verbs in Persian Language have three main parts: Prefix, Stem and
Suffix. (PFX+STM+SFX)
The problem is between prefix and stem (or stem and suffix) can be a
space, zero-with-non-joiner, or none of them.
look at here. to make it simple I just show you a simple verb with
prefix and stem here:
As you can see
Number one is combination of prefix and stem without space.
Number two is combination of prefix and stem with zero-with-non-joiner.
Number three is combination of prefix and stem with a space. (it is a
common mistake in written persian language)
I want to know if there is any way to replace space with zwnj (number 2)
or remove space (number 1)?
I am working for Morphological Hunspell Dictionary for Persian Language.
I have faced some problems. Fortunately I could find some workarounds
for my issues. but just for one issue still I am working. Let me explain
it here.
Verbs in Persian Language have three main parts: Prefix, Stem and
Suffix. (PFX+STM+SFX)
The problem is between prefix and stem (or stem and suffix) can be a
space, zero-with-non-joiner, or none of them.
look at here. to make it simple I just show you a simple verb with
prefix and stem here:
As you can see
Number one is combination of prefix and stem without space.
Number two is combination of prefix and stem with zero-with-non-joiner.
Number three is combination of prefix and stem with a space. (it is a
common mistake in written persian language)
I want to know if there is any way to replace space with zwnj (number 2)
or remove space (number 1)?
Hunspell can check words that contain a space, if you feed them as one
string. Problem is that a text editor does not know what are
collocations and what not, so from there all is passed to Hunspell with
never a space in a word. Se the challenge is more in the location where
you want to do spell checking and not Hunspell, IMHO.
But let me clear my idea with one specified example.
As I understood from the manual (hunspell manual) there is an amazing example:
It can suggest "a lot" (with space) instead of "alot", simplily by this line:
REP alot a_lot
So, my question is: Is there any way for vise versa? I mean to suggest "alot" instead of "a lot";
REP a_lot alot (?)
Regards
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hi,
I hope I can explain my issue correclty.
I am working for Morphological Hunspell Dictionary for Persian Language. I have faced some problems. Fortunately I could find some workarounds for my issues. but just for one issue still I am working. Let me explain it here.
Verbs in Persian Language have three main parts: Prefix, Stem and Suffix. (PFX+STM+SFX)
The problem is between prefix and stem (or stem and suffix) can be a space, zero-with-non-joiner, or none of them.
look at here. to make it simple I just show you a simple verb with prefix and stem here:
As you can see
Number one is combination of prefix and stem without space.
Number two is combination of prefix and stem with zero-with-non-joiner.
Number three is combination of prefix and stem with a space. (it is a common mistake in written persian language)
I want to know if there is any way to replace space with zwnj (number 2) or remove space (number 1)?
regards,
mostafa
Please move this discussion to GitHub and make the project in SF read-only.
On 09/22/2015 12:46 PM, Mostafa wrote:
On 09/22/2015 12:46 PM, Mostafa wrote:
Hunspell can check words that contain a space, if you feed them as one
string. Problem is that a text editor does not know what are
collocations and what not, so from there all is passed to Hunspell with
never a space in a word. Se the challenge is more in the location where
you want to do spell checking and not Hunspell, IMHO.
Dear Pande,
Thanks for your reply.I think I get your point.
But let me clear my idea with one specified example.
As I understood from the manual (hunspell manual) there is an amazing example:
It can suggest "a lot" (with space) instead of "alot", simplily by this line:
REP alot a_lot
So, my question is: Is there any way for vise versa? I mean to suggest "alot" instead of "a lot";
REP a_lot alot (?)
Regards