
CALBC (Collaborative Annotation of a Large Biomedical Corpus) is a European Support Action addressing the automatic generation of a very large, community-wide shared text corpus annotated with biomedical entities. We propose to create a broadly scoped and diversely annotated corpus (about one million Medline immunology-related abstracts annotated with different semantic types) by automatically integrating the annotations from different named entity recognition systems.
More information available on: http://www.calbc.eu/
Here you will find the tools to analyse and exploit CALBC data using Java objects.
The library is available here: https://sourceforge.net/projects/calbc/files/
The explanations on how to use it are here: https://sourceforge.net/p/calbc/home/how-to/
The source code (+ classes for conversion into OWL) is available here: https://sourceforge.net/projects/calbc/files/
More info and support: http://www.ebi.ac.uk/Rebholz/ or http://www.samuelcroset.com/