A 50 million tokens corpus of Classical Arabic.
Converting text to a structured representation
Probabilistic Noising of Natural Language
OWL/DL ontologies for linguistic annotations
Resources for speech processing in Brazilian Portuguese
Named-entity recognition using neural networks
Library to scrape and clean web pages to create massive datasets
TextRank implementation for Python 3
Safe Harbor Deidentification for medical documents
A smart search engine for medical documents
Text Analytics Platform
Text categorization, arabic language processing, language modeling
Implementation of research papers on Deep Learning+ NLP+ CV in Python
Analyze text. Diagonal read subject, predicate, obj. Search other pdf.
Predicting Organic Reactions using Neural Networks.
AiLearning, data analysis plus machine learning practice
Chatbot in 200 lines of code using TensorLayer
Parallel Optimization Library for Java
This repository contains a new generative model of chatbot
SOA infrastracture initially developed by NICT Language Grid Project
text file quick lemmater
Natural Language Processing (NLP) for the Masses