A full spaCy pipeline and models for scientific/biomedical documents
The Classical Language Toolkit
Chinese XLNet pre-trained model
Resources, corpora, and tools for Chinese natural language processing
PyTorch original implementation of Cross-lingual Language Model
Tools to download and cleanup Common Crawl data
Natural Language Processing Best Practices & Examples
A Chinese information extraction tool
Text categorization, arabic language processing, language modeling
We describe a simple XML format to share text documents and annotation
TextBlob is a Python library for processing textual data