linux command free download

Safe Harbor Deidentification

Safe Harbor Deidentification for medical documents

Phalanx - Deidentify Safe Harbor Deidentification Mode of Phalanx is an abridged pipeline of NLP annotators culminating in NER annotators which write output of text offsets. It uses the Safe Harbor deidentification method.

Downloads: 0 This Week

Last Update: 2019-09-10

See Project

ElixirFM

Functional Arabic Morphology

ElixirFM is a high-level implementation of Functional Arabic Morphology. The core of ElixirFM is written in Haskell, while interfaces in Python and Perl support lexicon editing and other interactions. http://github.com/otakar-smrz/elixir-fm

1 Review

Downloads: 0 This Week

Last Update: 2016-06-28

See Project

Encode Arabic

Encode Arabic provides tools for encoding and decoding Arabic in Haskell, Python, Perl, or LaTeX. Interprets the ArabTeX notation to generate original orthography or phonetic transcription. Supports Buckwalter and other romanizations. Converts legacy byte encodings into Unicode. http://github.com/otakar-smrz/encode-arabic

1 Review

Downloads: 0 This Week

Last Update: 2016-06-28

See Project

Resources for Closely Related Languages

This project concerns the development of human language technology resources, based on the approach to share or recycle resources between closely related language. http://gerhard.pro/closely-related-languages/

Downloads: 0 This Week

Last Update: 2015-12-29

See Project

Lingala NLP

This project is devoted to the development of natural language processing tools and resources for the Lingala language, which is spoken by tens of millions of people in central Africa.

Downloads: 2 This Week

Last Update: 2014-11-13

See Project

Automatic Compound Processing (AuCoPro)

Automatic compound splitting and semantic analysis of compounds

The central problem to be addressed in this project concerns a multidisciplinary (linguistics and computational linguistics) investigation into sharing of knowledge and resources between closely-related languages, specifically relating to the automatic processing of compounds. Specifically, we will explore the possibility to create new knowledge about closely-related languages, and efficiently develop additional, more advanced resources for (a) compound segmentation; and (b) the semantic...

Downloads: 0 This Week

Last Update: 2015-07-28

See Project

Perstem

Perstem is a Persian (Farsi) stemmer, morphological analyzer, transliterator, and partial part-of-speech tagger. Inflexional morphemes are separated or removed from their stems. Perstem can also tokenize and transliterate between various character set encodings and romanizations.

1 Review

Downloads: 0 This Week

Last Update: 2016-11-23

See Project

Uplug corpus tools

Various tools for creating annotated parallel corpora including pre-trained tagging and parsing models for various languages, sentence alignment tools and word alignment tools. Uplug also includes a web-based interface for interactive sentence and word alignment and scripts for indexing and querying parallel corpora using the Corpus Work Bench CWB. Download 'uplug-main' first and then add other packages.

Downloads: 0 This Week

Last Update: 2013-04-29

See Project

BioEvent

This is a Java-based project for complex event extraction from text and co-reference resolution. Currently the code can read BioNLP shared task format (http://2011.bionlp-st.org/) and i2b2 Natural Language Processing for Clinical Data shared task format (https://www.i2b2.org/NLP/DataSets/Main.php). Event extraction includes finding events and the parameters for an event in a text. The method is based on SVM but other ML algorithms can be adopted. The method details are explained in the...

Downloads: 0 This Week

Last Update: 2013-04-25

See Project

Simple Semantic Classifier

The Simple Semantic Classifier classifies short chunks of natural language text into broad semantic classes that correspond to the OBO ontologies provided as input.

Downloads: 0 This Week

Last Update: 2013-04-11

See Project

Perl Turing Machine

Sample turing machine for educational purposes.

Downloads: 0 This Week

Last Update: 2013-04-09

See Project

Wikipedia for language research

This project tries to make Spanish Wikipedia a useful resource for the language research community.

Downloads: 0 This Week

Last Update: 2013-04-17

See Project

Aramorpher

Based on the Buckwalter Morphological Analyzer (Version 1.0) for doing Arabic stemming and POS tagging. Includes a rewrite of the original Perl script, with better documentation and more flexible options, and a C++ interface (usable as a library or app).

Downloads: 1 This Week

Last Update: 2016-08-09

See Project

stance

Stance is a perl script for generating random sentences in Dutch, which can be used as translation exercises for students of Dutch. In its finished version, it should be able to generate only gramatically correct sentences.

Downloads: 0 This Week

Last Update: 2012-12-12

See Project

Search Results for "linux command"

Showing 14 open source projects for "linux command"

Safe Harbor Deidentification

ElixirFM

Encode Arabic

Resources for Closely Related Languages

Lingala NLP

Automatic Compound Processing (AuCoPro)

Perstem

Uplug corpus tools

BioEvent

Simple Semantic Classifier

Perl Turing Machine

Wikipedia for language research

Aramorpher

stance

Search Results for "linux command"

Showing 14 open source projects for "linux command"

Safe Harbor Deidentification

ElixirFM

Encode Arabic

Resources for Closely Related Languages

Lingala NLP

Automatic Compound Processing (AuCoPro)

Perstem

Uplug corpus tools

BioEvent

Simple Semantic Classifier

Perl Turing Machine

Wikipedia for language research

Aramorpher

stance

Related Searches

Related Categories