Text categorization, arabic language processing, language modeling
DeepMind's Tacotron-2 Tensorflow implementation
Powerful search library, best suited for computer-aided translation
43 Queries for Arabic Information Retrieval Collection
An open source system for Arabic corpora processing
front end to Hipparchia corpora: searching, browsing, concordances, texts, dictionaries, parsing
The most comprehensive database of Chinese poetry
Beautiful visualizations of how language differs among document types
Arabic business and management corpus
natural language corpora search engine
We describe a simple XML format to share text documents and annotation
Question answering dataset in "Teaching Machines to Read & Comprehend"
Dialogue Similarity
A tool for evaluating automatic terminology extraction.
GloVe model for distributed word representation
Hadoop framework for scalable processing of large web corpora
cross-languages resources
THIS PROJECT MIGRATED TO https://gitlab.com/mwetoolkit/mwetoolkit3/
Python, NLTK-based package for shallow parsing of Brazilian Portuguese
An Arabic Corpora Processing Tool