ghost 4 linux free download

WordCount

Count frequency of single, 2-word and 3-word clusters in a text

The program can read a text file and count the occurrences of single words and clusters of 2 and 3 words. The resulting list will be sorted in descending order (highest frequency on top).

Downloads: 3 This Week

Last Update: 2025-02-01

See Project

Arabic Corpus

Text categorization, arabic language processing, language modeling

The Arabic Corpus {compiled by Dr. Mourad Abbas ( http://sites.google.com/site/mouradabbas9/corpora ) The corpus Khaleej-2004 contains 5690 documents. It is divided to 4 topics (categories). The corpus Watan-2004 contains 20291 documents organized in 6 topics (categories). Researchers who use these two corpora would mention the two main references: (1) For Watan-2004 corpus ---------------------- M. Abbas, K. Smaili, D. Berkani, (2011) Evaluation of Topic Identification Methods on...

Downloads: 16 This Week

Last Update: 2019-03-05

See Project

Presage

the intelligent predictive text entry platform

Presage (formerly Soothsayer) is an intelligent predictive text entry system. Presage generates predictions by modelling natural language as a combination of redundant information sources. Presage computes probabilities for words which are most likely to be entered next by merging predictions generated by the different predictive algorithms. Presage's modular and extensible architecture allows its language model to be extended and customized to utilize statistical, syntactic, and semantic...

3 Reviews

Downloads: 229 This Week

Last Update: 2018-10-11

See Project

Helsinki Finite-State Technology

The Helsinki Finite-State Transducer toolkit is intended for processing natural language morphologies. The toolkit is demonstrated by wide-coverage implementations of a number of languages of varying morphological complexity.

Downloads: 0 This Week

Last Update: 2017-09-14

See Project

Arramooz Alwaseet Arabic Dictionary

Arramooz Alwaseet Open Arabic Dictionary for morphological analyze. To be useful for Arabic language processing. This dictionary is derived from the Ayaspell Arabic spell checker.

Downloads: 2 This Week

Last Update: 2016-12-22

See Project

poliqarp2

natural language corpora search engine

This project aims at building an efficient indexer and search engine for natural language corpora with multilevel annotations.

Downloads: 0 This Week

Last Update: 2016-12-19

See Project

BioC

We describe a simple XML format to share text documents and annotation

A minimalist approach to share text documents and data annotations. Allows a large number of different annotations to be represented. Project files contain: - simple code to hold/read/write data and perform sample processing. - BioC-formatted corpora - BioC tools that work with BioC corpora BioC goals - simplicity - interoperability - broad use - reuse There should be little investment required to learn to use a format or a software module to process that format. We are...

Downloads: 3 This Week

Last Update: 2016-08-08

See Project

ACOPOST - a collection of POS taggers

Part-of-speech tagging is the task of assigning symbols from a particular set to words in a natural language text. ACOPOST implements and extends well-known machine learning techniques and provides a uniform environment for testing.

1 Review

Downloads: 0 This Week

Last Update: 2016-02-26

See Project

Speech Research Tools

Software for speech research. It includes programs and libraries for signal processing, along with general purpose scientific libraries. Most of the code is in Python, with C/C++ supporting code. Also, contains code releases corresponding to publishe

Downloads: 0 This Week

Last Update: 2015-12-13

See Project

Alfanous

Quran Search Engine

Alfanous (The Lantern - الفانوس ) is an Arabic search engine API provide the simple and advanced search in the Holy Quran , more features and many interfaces...

2 Reviews

Downloads: 0 This Week

Last Update: 2019-07-20

See Project

Aelius Brazilian Portuguese POS-Tagger

Python, NLTK-based package for shallow parsing of Brazilian Portuguese

Aelius is an ongoing open source project aiming at developing a suite of Python, NLTK-based modules and interfaces to external freely available tools for shallow parsing of Brazilian Portuguese. It also includes language resources such as language models, sample texts, and gold standards. Presently, Aelius already offers facilities for POS-tagging and chunking corpora and outputting annotations in different formats, such as in XML in the TEI P5 encoding scheme.

1 Review

Downloads: 0 This Week

Last Update: 2014-11-03

See Project

Mishkal: Arabic Text Vocalization

Arabic Text Vocalization system

Automatic system of vocalization of arabic text.

5 Reviews

Downloads: 33 This Week

Last Update: 2017-10-29

See Project

t2t-pipe

automatic alignment pipeline for parallel treebanks

The *Tree-to-Tree (t2t) Alignment Pipe* is a collection of python scripts, co-ordinating the process of automatic alignment of parallel treebanks from plain text files with a single call from a unix command line. Supported Languages: DE, FR, EN

Downloads: 0 This Week

Last Update: 2014-01-07

See Project

Donatus Parsing Tools for Portuguese

Donatus is an on-going project consisting of Python, NLTK-based tools and grammars for deep parsing and syntactical annotation of Brazilian Portuguese corpora. It includes a user-friendly graphical user interface for building syntactic parsers with the NLTK, providing some additional functionalities.

Downloads: 0 This Week

Last Update: 2016-08-28

See Project

Corpus redundancy manager

Redundancy due to cut-paste operations in text creates bias in machine learning for NLP. This module takes a directory and produces a subset of the files in that directory (in a list) with an upper bound on similarity between two files.

Downloads: 0 This Week

Last Update: 2014-06-30

See Project

Sylli

Sylli is a universal syllabifier. Developed for Italian, it can easily be adapted to any language that is claimed to respect the SSP. Sylli divides timit, strings, files and directories into syllables.

Downloads: 0 This Week

Last Update: 2012-10-15

See Project

WordNetLMF

WordNetLMF converts WordNet (http://wordnet.princeton.edu/) lexicographer files into KYOTO-LMF, the LMF dialect used in the KYOTO project (http://www.kyoto-project.eu/).

Downloads: 0 This Week

Last Update: 2013-04-11

See Project

Varro

The Varro toolkit is a system for identifying and frequently recurring unordered subtrees in semi-structured data. It is mostly for linguistics but has applications in semi-structured data mining too.

Downloads: 0 This Week

Last Update: 2015-06-04

See Project

py-translate

Moved to Github: http://github.com/tremby/py-translate

1 Review

Downloads: 0 This Week

Last Update: 2015-11-25

See Project

Open Source Linguistics

A public repository of open source scripts and small programs related to linguistics and language.

Downloads: 0 This Week

Last Update: 2014-08-22

See Project

Search Results for "ghost 4 linux"

Showing 20 open source projects for "ghost 4 linux"

WordCount

Arabic Corpus

Presage

Helsinki Finite-State Technology

Arramooz Alwaseet Arabic Dictionary

poliqarp2

BioC

ACOPOST - a collection of POS taggers

Speech Research Tools

Alfanous

Aelius Brazilian Portuguese POS-Tagger

Mishkal: Arabic Text Vocalization

t2t-pipe

Donatus Parsing Tools for Portuguese

Corpus redundancy manager

Sylli

WordNetLMF

Varro

py-translate

Open Source Linguistics

Search Results for "ghost 4 linux"

Showing 20 open source projects for "ghost 4 linux"

WordCount

Arabic Corpus

Presage

Helsinki Finite-State Technology

Arramooz Alwaseet Arabic Dictionary

poliqarp2

BioC

ACOPOST - a collection of POS taggers

Speech Research Tools

Alfanous

Aelius Brazilian Portuguese POS-Tagger

Mishkal: Arabic Text Vocalization

t2t-pipe

Donatus Parsing Tools for Portuguese

Corpus redundancy manager

Sylli

WordNetLMF

Varro

py-translate

Open Source Linguistics

Related Searches

Related Categories