I'm trying Uplug in my own server. I just got all the stuff installed but I'm experimenting some problems with following step:
3a) Basic pre-processing (text -> xml)
(a text in Swedish and English, encoded in ISO-8859-1 (latin1))
/cvs/uplug systems/pre/basic -ci 'iso-8859-1' -in 1988sv.txt > 1988sv.xml
/cvs/uplug systems/pre/basic -ci 'iso-8859-1' -in 1988en.txt > 1988en.xml
Look at 1988sv.xml and 1988en.xml! Both files are (hopefully
tokenised and marked with basic XML-tags)
I just become an empty file
anneta@anneta-Latitude-D420:/etc/uplug$ systems/pre/basic -ci 'iso-8859-1' -in /home/anneta/corpus/1988en.txt > /home/anneta/corpus/1988en.xml
Could you provide some more information about the messages you get when running the command? It's hard to tell what was going wrong.
By the way, I'm currently working on an update and I will move the code to bitbucket. There is already a new version at https://bitbucket.org/tiedemann/uplug which you may want to test. Download the package unplug-main-0.03. There are going to be language-specific add-on which I started to upload right now. Any feedback is welcome!
Thanks for answering!
I did not become a message at all. That's all. An empty file.
I going to test the new one and I keep you informed!