We describe a simple XML format to share text documents and annotation

Add a Review
14 Downloads (This Week)
Last Update:
Download go_bioc_1.0.tar.gz
Browse All Files



A minimalist approach to share text documents and data annotations. Allows a large number of different annotations to be represented.

Project files contain:

- simple code to hold/read/write data and perform sample processing.
- BioC-formatted corpora
- BioC tools that work with BioC corpora

BioC goals

- simplicity
- interoperability
- broad use
- reuse

There should be little investment required to learn to use a format or a software module to process that format. We are interested in reuse, and we focus on common NLP tasks that are broadly useful for textmining.

BioC Web Site


  • simple XML format

Update Notifications

Write a Review

User Reviews

Be the first to post a review of BioC!

Additional Project Details

Intended Audience


Programming Language

Python, Perl, C++, Ruby, Java


Screenshots can attract more users to your project.
Features can attract more users to your project.

Icons must be PNG, GIF, or JPEG and less than 1 MiB in size. They will be displayed as 48x48 images.