The goal of this project is to investigate optimal ways to do genre classification for the ten indigenous South African languages. Funded by Dept of Arts and Culture of the SA Government.
http://www.trifonius.co.za/projects/genre-classification
Safe Harbor Deidentification for medical documents
Phalanx - Deidentify
Safe Harbor Deidentification Mode of Phalanx is an abridged pipeline of NLP annotators culminating in NER annotators which write output of text offsets. It uses the Safe Harbor deidentification method.
This project concerns the development of human language technology resources, based on the approach to share or recycle resources between closely related language.
http://gerhard.pro/closely-related-languages/
Perstem is a Persian (Farsi) stemmer, morphological analyzer, transliterator, and partial part-of-speech tagger. Inflexional morphemes are separated or removed from their stems. Perstem can also tokenize and transliterate between various character set encodings and romanizations.
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.
You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
A database of linguistic annotation of medical text (from MEDLINE), including corpora used with ABGene, BioCreative I and II, and the MedPost training corpus.