Does anyone know what data was manually annotated and used to train the named entity and coreference models? I'm doing experiments over particular sections of the Penn TreeBank (23 and 24), and would like to avoid using tools that train over these sections, for obvious reasons.
Thanks!
Matt Gerber
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hello,
Does anyone know what data was manually annotated and used to train the named entity and coreference models? I'm doing experiments over particular sections of the Penn TreeBank (23 and 24), and would like to avoid using tools that train over these sections, for obvious reasons.
Thanks!
Matt Gerber
*bump*
Is anyone familiar with the issue I raised?
Hi,
It's trained on Penn Treebank sections 00 and 01. Hope this helps…Tom
Thanks, Tom! It does help.