Hi,
Sorry its taken a while to get back to this. Part of my reluctance to answer this is that the question is a vague and the answer potentially long.
There are separate models for each type of processing, sentence detection, pos-tagging, etc. You would need to figure out which types of processing you need, find training data, and then build your models off of this training data. Check out the "Training the Tools" section of the README for pointer to the classes used for training specific models.
Most of them already support specifying an encoding which you would need for non-ascii text. Hope this helps...Tom
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
I want to build a model for the Italian.
What steps should I do? There is a guide that you can advise me?
Thanks!
Antonio Bruno
Hi,
Sorry its taken a while to get back to this. Part of my reluctance to answer this is that the question is a vague and the answer potentially long.
There are separate models for each type of processing, sentence detection, pos-tagging, etc. You would need to figure out which types of processing you need, find training data, and then build your models off of this training data. Check out the "Training the Tools" section of the README for pointer to the classes used for training specific models.
Most of them already support specifying an encoding which you would need for non-ascii text. Hope this helps...Tom