Name | Modified | Size | Downloads / Week |
---|---|---|---|
Parent folder | |||
README.md | 2019-09-07 | 225 Bytes | |
v1.0.4 source code.tar.gz | 2019-09-07 | 852.3 kB | |
v1.0.4 source code.zip | 2019-09-07 | 871.9 kB | |
Totals: 3 Items | 1.7 MB | 1 |
Changelog:
- Pages that belong to Module and TimedText namespaces are now ignored while creating
DumpDB
- Improved normalization rules of entity titles of Wikipedia links [#36] [#38]
- Fixed Jieba tokenizer [#27]