Code for "Improving Language Understanding by Generative Pre-Training"
Elasticsearch to Pandas dataframe or CSV
SOA infrastracture initially developed by NICT Language Grid Project
OCR web based for Browser Firefox & PC
We describe a simple XML format to share text documents and annotation
KDE servicemenu for pdf
Edit the OCR text layer of DjVu documents in a web browser
MedicalRecords is an integrated medical information system.
The DjVu complete solution,with OCR Technology(Arabic ,English).
DjVu Read Documents,With OCR Technology(Arabic ,English ),Small Size
NLP tool for statistical analysis of words, sentences, documents
JSON based text search Java Project
Consilium – User Defined sentence Suggestion Tool.
Classify any two TXT documents, no training required - JAVA
Community-based linguistic annotation work on clinical documents.
Non-disjoint groupping of Documents based on word sequence approach
commandline multiclass email and text filter