From: Finn G. L. <fi...@gr...> - 2009-11-11 10:11:22
|
Hi, I am new to this mailing list and to LanguageTool. My name is Finn Gruwier Larsen and I am a Danish open source developer. We have an initial implementation of LanguageTool in Danish, but it needs a lot more work to become useful. I am curious to know a bit more about how Languagetool works. According to the documentation LT makes an initial tokenization (division into words and sentences) of the text and then analyzes word class and flexion. How does it make the word class and flexion analysis? I know that Hunspell can do that. Does LT use the Hunspell API here, or does it have its own analysis mecanism? One of the reasons why I am interested in this is that I think that tokenization and word class & flexion analysis is interesting in a broader perspective - not just for grammar checking, but also for other kinds of text analyzing as well as for language learning tools. I have made some extensions for OOo in these areas, and I wonder if some parts of LT could be reused in these areas. Regards, Finn Gruwier Larsen |