Name | Modified | Size | Downloads / Week |
---|---|---|---|
Parent folder | |||
README.md | 2024-02-01 | 2.1 kB | |
v4.5.6_ Lemmatizer _ Tokenizer bugfixes source code.tar.gz | 2024-02-01 | 50.8 MB | |
v4.5.6_ Lemmatizer _ Tokenizer bugfixes source code.zip | 2024-02-01 | 52.4 MB | |
Totals: 3 Items | 103.2 MB | 0 |
English Lemmatizer upgrades
- enroll, appall as American spellings, instead of enrol & appal. de- as a verb prefix, blog and xfer as double letter exceptions https://github.com/stanfordnlp/CoreNLP/commit/8adcbfe67fc47f952b773f2eb2afe3a0af73889e
- cowritten https://github.com/stanfordnlp/CoreNLP/commit/2dd08da9de9de381d241c228f15723123a438600
- elder / eldest https://github.com/stanfordnlp/CoreNLP/commit/9b5bec8919bbba9b359eb77171b2ab65828612c7
- Yazidi as a demonym https://github.com/stanfordnlp/CoreNLP/commit/2852da8b1e5feb465083f81f98d14dcc7a71ca0d
Tokenizer upgrades
-
number as a single thing after an abbreviation https://github.com/stanfordnlp/CoreNLP/issues/1396 https://github.com/stanfordnlp/CoreNLP/commit/ad37f2acfa8f9216e1d83e15e93476d9a63b900c
UD Processing upgrades
- 'twas and 'tis as MWT in the UD converter https://github.com/stanfordnlp/CoreNLP/commit/b9f19a67c0f0de1bae99e0d35dbcb8f94dbc0026
- Sort morpho features in alphabetical order when writing out UD https://github.com/stanfordnlp/CoreNLP/commit/f77a9b414cb28c9429a7f100b76272355bf78bb6
Other Bugfixes
- Crash when deleting the endpoints of an
IntervalTree
https://github.com/stanfordnlp/CoreNLP/issues/1405 https://github.com/stanfordnlp/CoreNLP/commit/6d17c2390bcf745f919134a5725629783086f712 - Find and remove extraneous uses of
yield
, which became a keyword: https://github.com/stanfordnlp/CoreNLP/commit/e5c9d443984e1f7434f588e07e0e3212c33f8841 https://github.com/stanfordnlp/CoreNLP/commit/b084233fd6d5da6474d27c6d6832fd35b3a9cb8b
Minor API change
- Updating the text on a CoreLabel no longer wipes out the Lemma https://github.com/stanfordnlp/CoreNLP/commit/c03522b59be2a41beca71817b63af904ed1db729
- Update to more recent Jakarta Servlet https://github.com/stanfordnlp/CoreNLP/commit/8a671fd065c9fcf68f3b19cd2bb9d7e1c79a4e3e
Ssurgeon
- UpdateMorphoFeatures edit https://github.com/stanfordnlp/CoreNLP/commit/27c6703b5f99470c60e0a2fc199cda4af8e5f386
- Lemmatize operation (only works on English) https://github.com/stanfordnlp/CoreNLP/commit/c26b25e118dbc688c9dbc113f8eaa8f44875d8ab