Name | Modified | Size | Downloads / Week |
---|---|---|---|
Parent folder | |||
README.md | 2024-04-19 | 1.3 kB | |
v4.5.7 - Constituency to Dependency Converter Upgrades source code.tar.gz | 2024-04-19 | 51.5 MB | |
v4.5.7 - Constituency to Dependency Converter Upgrades source code.zip | 2024-04-19 | 53.0 MB | |
Totals: 3 Items | 104.5 MB | 0 |
UD converter upgrades
Inspired by https://github.com/UniversalDependencies/docs/issues/717, although the work is not finished
- Add an option to use the PTBCorrector, which fixes many (although not all) incorrect POS tags https://github.com/stanfordnlp/CoreNLP/commit/5e57eaba40897ee93b69ed3f11bda511f6b427d8
- Treat
sort of
the same askind of
https://github.com/stanfordnlp/CoreNLP/commit/bc4acf11d165c4185121ff501c26b354a05a2477 en masse
is flat https://github.com/stanfordnlp/CoreNLP/commit/cb338cd57fdcd9ef0fc1aa1fe2fa563d578fea15dinna
is an MWT https://github.com/stanfordnlp/CoreNLP/commit/1dd746cfea4f82e3b1c161bcc95c457f0d8a2618- Use
AUX
as the POS in the converter when appropriate https://github.com/stanfordnlp/CoreNLP/commit/30f2f8e7d92492a152dd5fc8b85327860b44cc2a - Fix (heh)
all but
andwhether or not
https://github.com/stanfordnlp/CoreNLP/commit/25136768ee22e5431051d756c4c63c41af00de99 - Dependency
dep
->ccomp
for frontedsay
verbs https://github.com/stanfordnlp/CoreNLP/commit/a76a854ce249ae028eec010b1a48d68748d59a61
Parser evaluation improvements
- Include the F1 scores of each tree when scoring a constituency dataset https://github.com/stanfordnlp/CoreNLP/commit/2725b06fa96400e9c25e314b5e16b18720764ab2