MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
Download search engine and directory with Rapidshare and Torrent - zoozle Download Suchmaschine
All The files that run the World Leading German Download Search Engine in 2010 with 500 000 unique visitors a day - all the tools you need to set up a clone.
Code Contains:
- PHP Files for zoozle
- Perl Crawler for gathering new content to database and all other cool tools i have...
This is a Java-based project for complex event extraction from text and co-reference resolution. Currently the code can read BioNLP shared task format (http://2011.bionlp-st.org/) and i2b2 Natural Language Processing for Clinical Data shared task format (https://www.i2b2.org/NLP/DataSets/Main.php). Event extraction includes finding events and the parameters for an event in a text.
The method is based on SVM but other ML algorithms can be adopted. The method details are explained in the...
Simple monitoring utility to help optimizing a (MySQL) database server. Run the utility for a time to record queries on the database. Generate reports while the utility is running or afterwards. Reports are available as text or excel-sheets.
Small XML to ASCII Parser written in Perl as an OO-Perl Module. Can be used to extract pure ascii text from a given xml document. Later: extract text from web documents, or given arrays and hashs.
plaindb is a plain textdatabase package. It comprises the file format, and a
set of modules on Perl, Tcl and python to access the databases, plus a basic
shell interface.
Higher level interfaces are also included for local (GUI) and web
access.