We describe a simple XML format to share text documents and annotation
A minimalist approach to share text documents and data annotations. Allows a large number of different annotations to be represented.
Project files contain:
- simple code to hold/read/write data and perform sample processing.
- BioC-formatted corpora
- BioC tools that work with BioC corpora
BioC goals
- simplicity
- interoperability
- broad use
- reuse
There should be little investment required to learn to use a format or a software module to process that format. We are...
TBLTools is a set of GATE processing resources that implements the Fast Transformation Based Learning Algorithm. You can train it to learn rules for NLP tasks such as Named Entity Recognition and Shallow parsing.