From: Josep M. F. <jos...@gm...> - 2011-01-05 08:09:16
|
Hi Jimmy. Thanks a lot for the prompt response. >> I was trying to convert an Apertium formatted dictionary into a format >> usable by Freeling. From what I saw in >> <http://wiki.apertium.org/wiki/Freeling> the tool to do that is a script .... <snip> > 'es-tags.parole.txt' will not work. 'oldca' is not part of Apertium, > per se, and uses its own tagset. You will need to supply a mapping of > that tagset to Parole to get usable output. OK, I guess I'll have to ask Mikel Forcada for that tagset. > It's right there, in the file: > # > # This is a conversion script for the format outputted by lt-expand to the > # format accepted by the freeling indexdict program. > # > # Input is an expanded Apertium dictionary: > # > # tadoù:tad<n><m><sg> > # > # Output is a Freeling dictionary: > # > # tadoù tag NCMPV0 > # > # To convert the Apertium tagset into a PAROLE-compatible tagset, a file > # with the parole tag and Apertium tag list is used. The two are separated > # by a tab: > # > # NCMPV0 <n><m><sg> OK, you are right, there is documentation in the comments of the script. I actually had opened the script in my editor and I had had a look at it but with the rush I had not seen the reference to the options (-l, -n etc.) which is what I was wondering about. Sorry. I opened the new version you sent me now and I see that the options are different: Options: -n Skip Proper Names -l Skip entries containing spaces -p Use existing lt-expand output -h Displays this help message In the previous version there was an -m option, wasn't there? OK, I'll try to get the tagset for the dictionary I'm trying to work with and then try the new version of dix-to-maco.py you sent me (thanks!). Josep M. |