Modular Suite of NLP Tools
A tool for learning vector representations of words and entities
Evaluation code for various unsupervised automated metrics
Tools to download and cleanup Common Crawl data
A 50 million tokens corpus of Classical Arabic.
The open-source virtual assistant for Ubuntu based Linux distributions
Basic Utilities for PyTorch Natural Language Processing (NLP)
Text categorization, arabic language processing, language modeling
Named-entity recognition using neural networks
Library to scrape and clean web pages to create massive datasets
Open Source tool for Arabic text readability
NLP tool for statistical analysis of words, sentences, documents
This project presents a new corpus for NEWS text analysis in Persian
A corpus that could be of help for researchers working on Arabic NLP
English-Khmer Automatic Statistic Machine Translation (SMT)
CRFSharp is a .NET(C#) implementation of Conditional Random Field