Activity for TBXTools

  • Bahgat Ahmed Bahgat Ahmed posted a comment on ticket #6

    Thank you for your answer Dr. Antoni. But how could I use the wildcards for regular expressions to shorten or group patterns? Could you provide any simple example on how to use them? since they aren't mentioned in the documentation. I am looking forward to receiving your response. Best Regards, Bahgat Ahmed

  • Antoni Oliver Antoni Oliver posted a comment on ticket #6

    Hello: In the patterns you should use the same tags than your tagger. If your tagger uses PPER you should use PPER in the POS patterns. Remember that you can use wildcards from regular expressions to shorten or group patters. Best regards Antoni Antoni Oliver González Estudis d'Arts i Humanitats Director del màster en Traducció i tecnologies aoliverg@uoc.edu ResearchGate https://www.researchgate.net/profile/Antoni_Oliver2 / Twitter https://twitter.com/aoliverg?lang=en / Linkedin https://www.linkedin.com/in/antonioliver/...

  • Bahgat Ahmed Bahgat Ahmed modified a comment on ticket #6

    Thank you for your answer, Dr. Antoni. I am very sorry for my late follow-up question. So do you mean that if the tagger uses a different tagset than the ones you mentioned in your code (TBXTools, Freeling, or Conll) formats, the POS patterns must be modified? For example, if the tagger tags the "personal pronoun" by this tag "PPER", while the corresponding Conll tag is "PRP" should I replace any "PPER" tag with "PRP" tag for TBXTools to work correctly? So he|he|PPER must become ---> he|he|PRP ?...

  • Bahgat Ahmed Bahgat Ahmed posted a comment on ticket #6

    Thank you for your answer Dr. Antoni. I am very sorry for my late follow-up question. So do you mean that if the tagger uses a different tagset than the ones you mentioned in your code (TBXTools, Freeling, or Conll) formats, the POS patterns must be modified? For example, if the tagger tags the "personal pronoun" by this tag "PPER", while the corresponding Conll tag is "PRP" I should replace any "PPER" tag with "PRP" tag for TBXTools to work correctly? so he|he|PPER must become ---> he|he|PRP ? Thank...

  • Antoni Oliver Antoni Oliver posted a comment on ticket #6

    Hello: You can use any tagger BUT: POS patterns may be changed if the used tagger uses a different tagset. The format for a tagged corpus should be as described, that is, each token should be represented as word_form|lemma|tag and each of these tokens should be separated by spaces. Remember that we have moved our repository to Github: https://github.com/aoliverg/TBXTools Best regards Antoni Oliver Antoni Oliver González Estudis d'Arts i Humanitats Director del màster en Traducció i tecnologies aoliverg@uoc.edu...

  • Bahgat Ahmed Bahgat Ahmed modified a comment on ticket #6

    Thank you very much Dr. Antoni for your answer, I have another question please. Here are my question details: I did what you said. Moreover, I have experimented with different taggers, and lemmatizers. I have tested them against your ready sample of tagged corpus "corpus-control-JRC-tagged-eng.txt", and I compared the ratio true to fake, and the ratio fake to all terminologies extracted. I used your provided terminologies file "JRC-control-evaluation-terms2g3g-eng.txt" for getting the true terms...

  • Bahgat Ahmed Bahgat Ahmed modified a comment on ticket #6

    Thank you very much Dr. Antoni for your answer, I have another question please. Here are my question details: I did what you said. Moreover, I have experimented with different taggers, and lemmatizers. I have tested them against your ready sample of tagged corpus "corpus-control-JRC-tagged-eng.txt", and I compared the ratio true to fake, and the ratio fake to all terminologies extracted. I used your provided terminologies file "JRC-control-evaluation-terms2g3g-eng.txt" for getting the true terms...

  • Bahgat Ahmed Bahgat Ahmed posted a comment on ticket #6

    Thank you very much Dr. Antoni for your answer, I have another question please. Here are my question details: I did what you said. Moreover, I have experimented with different taggers, and lemmatizers. I have tested them against your ready sample of tagged corpus "corpus-control-JRC-tagged-eng.txt", and I compared the ratio true to fake, and the ratio fake to all terminologies extracted. I used your provided terminologies file "JRC-control-evaluation-terms2g3g-eng.txt" for getting the true terms...

  • Antoni Oliver Antoni Oliver posted a comment on ticket #6

    Hello: Sorry for the delay in my answer The Freeling API connects with Freeling to tag the text and puts the output in this special format. You can use any tagger and adapt the output to have the same format. Please, note that the POS tags may differ from one tagger to another so the POS patterns should be changed accordingly. Please, also remember that the project has moved to Github, so the lattest versions will be availablre only there: https://github.com/aoliverg/TBXTools Best regads Antoni

  • Bahgat Ahmed Bahgat Ahmed created ticket #6

    Freeling API functionalities

  • Mohamed Hady Mohamed Hady posted a comment on ticket #5

    Thanks Antoni, much appreciated

  • Antoni Oliver Antoni Oliver posted a comment on ticket #5

    Hello: Sorry for the delay in my answer. I'm currently working in the new version but I'm moving the repository to github: https://github.com/aoliverg/TBXTools In the following weeks there will be new versions and the documentation. Best regards Antoni Antoni Oliver González Estudis d'Arts i Humanitats Director del màster en Traducció i tecnologies aoliverg@uoc.edu ResearchGate https://www.researchgate.net/profile/Antoni_Oliver2 / Twitter https://twitter.com/aoliverg?lang=en / Linkedin https://www.linkedin.com/in/antonioliver/...

  • Mohamed Hady Mohamed Hady created ticket #5

    New release

  • Antoni Oliver Antoni Oliver posted a comment on ticket #3

    Hola: Did you also installed the Freeling API? If you experience problems with the connection between TBXTools and Freeling, you can tag your corpus with freeling, adapt the format of the corpus, and load the tagged corpus directly into TBXTools. I'm afraid I'm not able to help with the current version of TBXTools. I'm about to release a new version soon, and this new version will be fully documented. Best regards Antoni Antoni Oliver González Estudis d'Arts i Humanitats Director del màster en Traducció...

  • Mohamed Hady Mohamed Hady created ticket #3

    WIndows Freeling issue

  • Mohamed Hady Mohamed Hady modified a comment on ticket #2

    Hello Antoni : I hope you spent a wonderful holiday. I am already have my own python3 environment installed in my computer Anaconda which has it's own editor Jupyter notebook and can run any python file.

  • Mohamed Hady Mohamed Hady posted a comment on ticket #2

    Hello Antoni : I hope you spent a wonderful holiday. I am already have so many interpreters installed in your computer like Python3 and jupyter notebook.

  • Antoni Oliver Antoni Oliver posted a comment on ticket #2

    Hello Mohamed: Sorry for the delay in my answer. I'm now in holidays until January 7th. Do you have the Python interpreter installed in your computer? Yo need a Python 3 interpreted. You can freely download from www.python.org. Best regards Antoni Antoni Oliver González Estudis d'Arts i Humanitats Director del màster en Traducció i tecnologies aoliverg@uoc.edu ResearchGate https://www.researchgate.net/profile/Antoni_Oliver2 / Twitter https://twitter.com/aoliverg?lang=en / Linkedin https://www.linkedin.com/in/antonioliver/...

  • Mohamed Hady Mohamed Hady posted a comment on ticket #2

    Any help please ?!!

  • Mohamed Hady Mohamed Hady created ticket #2

    Start using TBXTools

  • TBXTools TBXTools released /talk-TBXTools-code_examples.zip

  • TBXTools TBXTools released /2019-01-01-v0.2/TBXTools.py

  • TBXTools TBXTools released /2019-01-01-v0.2/GNU-GPL.txt

  • TBXTools TBXTools released /2019-01-01-v0.1/TBXTools.py

  • TBXTools TBXTools released /2019-01-01-v0.1/GNU-GPL.txt

  • TBXTools TBXTools released /2015-09-07-v1.0/example/statistical.py

  • TBXTools TBXTools released /2015-09-07-v1.0/example/statistical.py

  • TBXTools TBXTools released /2015-09-07-v1.0/example/ECB-1000-tagged-eng.txt

  • TBXTools TBXTools released /2015-09-07-v1.0/example/linguistic.py

  • TBXTools TBXTools released /2015-09-07-v1.0/example/patterns-bigrams-eng.txt

  • TBXTools TBXTools released /2015-09-07-v1.0/example/TBXTools.py

  • TBXTools TBXTools released /2015-09-07-v1.0/example/experiments-bigrams-lemes-estadistic-byfreq-recall.py

  • TBXTools TBXTools released /2015-09-07-v1.0/example/stop-eng.txt

  • TBXTools TBXTools released /2015-09-07-v1.0/example/ECB-termes-bigrams-lemes.txt

  • TBXTools TBXTools released /2015-09-07-v1.0/example/morphopatterns-eng.txt

  • TBXTools TBXTools released /2015-09-07-v1.0/example/TBXTools.py

  • TBXTools TBXTools released /2015-09-07-v1.0/example/ECB-1000-eng.txt

  • TBXTools TBXTools released /2015-09-07-v1.0/TBXTools.py

  • Antoni Oliver Antoni Oliver modified a wiki page

    Home

  • TBXTools TBXTools released /2014-08-18/TBXTools.py

1