Curated list of datasets and tools for post-training
Powerful search library, best suited for computer-aided translation
The most comprehensive database of Chinese poetry
Question answering dataset in "Teaching Machines to Read & Comprehend"
TextBlob is a Python library for processing textual data