| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| keeping up with the evolving python ecosystem source code.tar.gz | 2020-03-01 | 2.0 MB | |
| keeping up with the evolving python ecosystem source code.zip | 2020-03-01 | 2.2 MB | |
| README.md | 2020-03-01 | 1.5 kB | |
| Totals: 3 Items | 4.2 MB | 0 | |
New:
- Added a logo to textacy's documentation and social preview :page_with_curl:
- Added type hints throughout the code base, for more expressive type indicators in docstrings and for static type checkers used by developers to code more effectively (PR [#289])
- Added a preprocessing function to normalize sequences of repeating characters (Issue [#275])
Changed:
- Improved core
Corpusfunctionality using recent additions to spacy (PR [#285]) - Re-implemented
Corpus.save()andCorpus.load()using spacy's newDocBinclass, which resolved a few bugs/issues (Issue [#254]) - Added
n_processarg toCorpus.add()to set the number of parallel processes used when adding many items to a corpus, following spacy's updates tonlp.pipe()(Issue [#277]) - Bumped minimum spaCy version from 2.0.12 => 2.2.0, accordingly
- Added handling for zero-width whitespaces into
normalize_whitespace()function (Issue [#278]) - Improved a couple rough spots in package administration:
- Moved package setup information into a declarative configuration file, in an attempt to keep up with evolving best practices for Python packaging
- Simplified the configuration and interoperability of sphinx + github pages for generating package documentation
Fixed:
- Fixed typo in ConceptNet docstring (Issue [#280])
- Trained and distributed a
LangIdentifiermodel usingscikit-learn==0.22, to prevent ambiguous errors when trying to load a file that didn't exist (Issues [#291], [#292])