Currently we have made available four abbreviation definition corpora for biomedical literature, described in Islamaj Dogan et al. 2014:
The BioC-PMC corpus, which has been described in Islamaj Dogan et al., can be downloaded from this FTP website.
The corpus is separated into 4 files, and can be found in both ASCII and UNICODE versions.
The "pmc.key" file details all the tags that are used in this corpus. ... read more
BioC SWIG implementation can be found in this package:
BioC_SWIG.
This includes BioC-SWIG-Python and BioC-SWIG-Perl.
More details on what is included in the package, installation and sample test programs are found in the ReadMe file.
BioC Java implementation can be found in this package:
BioC_Java_1.0.
In addition
BioCPreprocessingJavaPipeline contains the BioC adaptation of the Stanford tool set for basic NLP preprocessing. ... read more
BioC C++ implementation has currently two versions in the download section:
BioC_C++_1.0 and
BioC_C++_1.1
Details on what is included in each package, installation and sample test programs are found in the respective ReadMe files of each package. ... read more