Editing large language models within 10 seconds
Repo for external large-scale work
Classical piano MIDI dataset
Reading Wikipedia to Answer Open-Domain Questions
PyTorch original implementation of Cross-lingual Language Model
PyTorch implementation of SimCLR: A Simple Framework
Tools to download and cleanup Common Crawl data
A recommender system for discovering GitHub repos
Natural Language Processing Best Practices & Examples
A Chinese information extraction tool
Phrase-Based & Neural Unsupervised Machine Translation
Text categorization, arabic language processing, language modeling
DeepMind's Tacotron-2 Tensorflow implementation
Beautiful visualizations of how language differs among document types
natural language corpora search engine
We describe a simple XML format to share text documents and annotation
Question answering dataset in "Teaching Machines to Read & Comprehend"
THIS PROJECT MIGRATED TO https://gitlab.com/mwetoolkit/mwetoolkit3/
Python, NLTK-based package for shallow parsing of Brazilian Portuguese
A tool for exporting Wikipedia data
TextBlob is a Python library for processing textual data
a synonym extractor based on web-corpora and a multilingual translator