git:/git.code.sf.net/p/docfetcher/code free download

Live Transcribe Speech Engine

Live Transcribe is an Android application

...The engine manages audio front-end processing—such as noise suppression and voice activity detection—before feeding audio into compact, accurate acoustic and language models. Partial hypotheses stream as words are recognized, then stabilize with minimal jitter as confidence increases, which is crucial for usability. The code emphasizes efficient use of CPU and neural accelerators to balance battery life with responsiveness. Deployed in accessibility contexts, it aims for dependable behavior across accents, environments, and intermittent connectivity, with graceful degradation when resources are constrained.

Downloads: 0 This Week

Last Update: 2025-10-10

See Project

BioC

We describe a simple XML format to share text documents and annotation

A minimalist approach to share text documents and data annotations. Allows a large number of different annotations to be represented. Project files contain: - simple code to hold/read/write data and perform sample processing. - BioC-formatted corpora - BioC tools that work with BioC corpora BioC goals - simplicity - interoperability - broad use - reuse There should be little investment required to learn to use a format or a software module to process that format. We are interested in reuse, and we focus on common NLP tasks that are broadly useful for textmining.

Downloads: 4 This Week

Last Update: 2016-08-08

See Project

Drug Extraction

Drug name extraction

...Using CONLL-Evaluation: processed 32065 tokens with 3656 phrases; found: 3251 phrases; correct: 2786. accuracy: 95.25%; precision: 85.70%; recall: 76.20%; FB1: 80.67 Using GATE Corpus Benchmark: Strict: P: 0.65 R: 0.73 F1: 0.69 Lenient: P: 0.74 R: 0.84 F1: 0.78 The details of how to reproduce evaluation, see README. To use standalone version for tagging download DrugExtractionStandalone.tar.gz from Files.

Downloads: 0 This Week

Last Update: 2015-06-12

See Project

Metalanguage And Analysis Toolkit

Downloads: 0 This Week

Last Update: 2015-05-09

See Project

LexSub

A Lexical Substitution Framework

Lexical substitution framework for supervised all-words lexical substitution using delexicalized features. For a runnable (but GPL-licensed) version of LexSub, see LexSub-GPL (sf.net/p/lexsub/lexsub-gpl)

Downloads: 0 This Week

Last Update: 2015-04-01

See Project

Stemmer Gujarati

Offline stemmer for Gujarati , which is one of 22 Indian languages.

...There has been lot of significant work in the development and evaluation of stemmer for non-Indian languages, but very less or no significant work has been done on Indian front especially for Gujarati language.The code of this stemmer is based on algorithm designed under guidance of Prof. Nikita Desai, India. It takes input file of type .txt containing Gujarati text encoded as UTF-8 and then removes stop words which are unessential. After processing rest of the words, it outputs corresponding file containing all stem words plus other details.

Downloads: 0 This Week

Last Update: 2015-04-05

See Project

TML - Text Mining Library for LSA & CMM

TML is a Java Library for LSA and extracting Concept Maps from text

TML has moved to http://www.villalon.cl/tml.html and the code to https://github.com/villalon/tml

3 Reviews

Downloads: 1 This Week

Last Update: 2013-08-05

See Project

miac-p

Code for syntactic parsing and other NLP apps.

Code for syntactic parsing and other natural language processing applications.

Downloads: 0 This Week

Last Update: 2013-02-07

See Project

jaxlr

A simple java library for text and object oriented code. Among the different available packages, there are for text analysis (levenshtein and ngram fingerprinting), a grammar framework, simple object persistence (very light and dependence free), ...

Downloads: 0 This Week

Last Update: 2014-06-04

See Project

FullFiller

Data Base Benchmarking tool

...It fills MySQL tables columns; perform customized tests; and outputs the results on CSV format. It uses Xeger, a java package for generating random text from regular expressions (http://code.google.com/p/xeger/). Xeger uses dk.brics.automaton java package developed by Anders Møller (http://cs.au.dk/~amoeller/automaton/index.html).

Downloads: 0 This Week

Last Update: 2012-06-17

See Project

BioEvent

This is a Java-based project for complex event extraction from text and co-reference resolution. Currently the code can read BioNLP shared task format (http://2011.bionlp-st.org/) and i2b2 Natural Language Processing for Clinical Data shared task format (https://www.i2b2.org/NLP/DataSets/Main.php). Event extraction includes finding events and the parameters for an event in a text. The method is based on SVM but other ML algorithms can be adopted.

Downloads: 0 This Week

Last Update: 2013-04-25

See Project

Multiparse

This project is contains implementations of algorithms to integrate the output of different NLP tools (part of speech taggers, morphologies, parsers, etc.) in order to obtain more accurate, more robust and more fine-grained linguistic analyses. Note that the code is outdated, but left here for documentation purposes. Its functionality may be reimplemented within the NLP2RDF project (http://code.google.com/p/nlp2rdf).

Downloads: 0 This Week

Last Update: 2013-04-25

See Project

Porter Stemmer

Java version of Porter's Stemming algorithm

...This version extends Martin Porter's original stemming algorithm by allowing capital letters to exist in words. This version should also be plugged in wherever the old algorithm is used with few accommodations necessary. The code in this version is more readable (in my opinion) than the old version. There is a main at the bottom that shows how to use the Stemmer.

Downloads: 0 This Week

Last Update: 2015-10-07

See Project

HebMorph

Making Hebrew properly searchable by IR software. Right now, most work is being done in our mailing list (planning), and on our github repository (concept code, see below).

Downloads: 0 This Week

Last Update: 2013-04-15

See Project

Search Results for "git:/git.code.sf.net/p/docfetcher/code"

Showing 14 open source projects for "git:/git.code.sf.net/p/docfetcher/code"

Live Transcribe Speech Engine

BioC

Drug Extraction

Metalanguage And Analysis Toolkit

LexSub

Stemmer Gujarati

TML - Text Mining Library for LSA & CMM

miac-p

jaxlr

FullFiller

BioEvent

Multiparse

Porter Stemmer

HebMorph

Search Results for "git:/git.code.sf.net/p/docfetcher/code"

Showing 14 open source projects for "git:/git.code.sf.net/p/docfetcher/code"

Live Transcribe Speech Engine

BioC

Drug Extraction

Metalanguage And Analysis Toolkit

LexSub

Stemmer Gujarati

TML - Text Mining Library for LSA & CMM

miac-p

jaxlr

FullFiller

BioEvent

Multiparse

Porter Stemmer

HebMorph

Related Searches

Related Categories