Join/Login
Open Source Software
Business Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Open Source Software

Business Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Browse Open Source
Scientific/Engineering
Linguistics Software
Search Results

Search Results for "base-files"

x

Sort By:

Relevance

Clear All Filters

OS

Linux 17
Windows 17
Mac 14
More...
BSD 8
ChromeOS 7
Desktop Operating Systems 1
Mobile Operating Systems 1

Category

Scientific/Engineering 20
Artificial Intelligence 6
Education 3
Software Development 3
Database 2
Business 1
Formats and Protocols 1
Multimedia 1
Religion and Philosophy 1
System 1
Text Editors 1

License

OSI-Approved Open Source 13
Other License 3
Public Domain 3
Creative Commons Attribution License 2

Translations

English 4
German 2
Portuguese 2
Brazilian Portuguese 1
More...
French 1
Korean 1
Spanish 1

Programming Language

Java 20
C++ 1
Perl 1
Python 1
Ruby 1
More...
XSL (XSLT/XPath/XSL-FO) 1

Status

Production/Stable 7
Alpha 5
Beta 3
Planning 2
More...
Pre-Alpha 2

Showing 20 open source projects for "base-files"

View related business solutions

Linguistics Java Clear Filters & Widen Search

Top-Rated Free CRM Software
216,000+ customers in over 135 countries grow their businesses with HubSpot

HubSpot is an AI-powered customer platform with all the software, integrations, and resources you need to connect your marketing, sales, and customer service. HubSpot's connected platform enables you to grow your business faster by focusing on what matters most: your customers.

Get started free
Deliver secure remote access with OpenVPN.
Trusted by nearly 20,000 customers worldwide, and all major cloud providers.

OpenVPN's products provide scalable, secure remote access — giving complete freedom to your employees to work outside the office while securely accessing SaaS, the internet, and company resources.

Get started — no credit card required.
1

Korean Analyzer Rhino

Parsing Korean words by morpheme and part-of-speech

RHINO parses Korean words by morpheme and part-of-speech. Its dictionaries are based on Korean Modern Tagged Corpus(12 million phrases scale) which was made by Korean government. So it analyses many cases of stems and endings. And the newly developed Dynamic Dictionary Technology can make words to react with their context. That is, a programmed database. For more information see the files in the help folder.

Downloads: 23 This Week

Last Update: 2020-10-11
See Project
2

SimpleLemmatizer

This program is for text lemmatization

It lemmatizes texts based on supplied model. The base model is for slovak texts and is created from Slovak National Corpus, copyright by Ľ. Štúr Institute of Linguistics, Slovak Academy of Sciences

Downloads: 0 This Week

Last Update: 2020-03-22
See Project
3

TIES

A smart search engine for medical documents

TIES (Text Information Extraction System) is a clinical text search engine that uses Natural Language Processing techniques to extract medical concepts from free text clinical reports. It provides secure de-identified access to this information and has in built collaboration tools and honest broker functionality. It is licensed for academic use under the BSD license. For commercial use please contact Nexi at http://nexihub.com *** NOTICE: this software and forum are no longer...

1 Review

Downloads: 2 This Week

Last Update: 2019-09-09
See Project
4

HermeneutiX

Your graphical tool for Syntactic/Semantic Structure Analysis of texts

HermeneutiX is a tool for diagramming syntactic and semantic structures of complex (not necessarily foreign-language) texts (e.g. bible or other historical excerpts). HermeneutiX is now part of SciToS (the scientific tool set). Starting with version 2.0.0, HermeneutiX can be found on GitHub. Please check out the release summary: https://github.com/scientific-tool-set/scitos/releases For an introduction, check out this video: https://youtu.be/uQjewyG0Ad8 PS: To run a Java...

Downloads: 1 This Week

Last Update: 2017-09-28
See Project
Auth0 Free: 25K MAUs + 5-Min Setup
Enterprise Auth, Zero Friction: Any Framework • 30+ SDKs • Universal Login

Production-ready login in 10 lines of code. SSO, MFA & social auth included. Scale seamlessly beyond free tier with Okta’s enterprise security.

Get Your API Keys
5

BioC

We describe a simple XML format to share text documents and annotation

A minimalist approach to share text documents and data annotations. Allows a large number of different annotations to be represented. Project files contain: - simple code to hold/read/write data and perform sample processing. - BioC-formatted corpora - BioC tools that work with BioC corpora BioC goals - simplicity - interoperability - broad use - reuse There should be little investment required to learn to use a format or a software module to process that format. We...

Downloads: 3 This Week

Last Update: 2016-08-08
See Project
6

Drug Extraction

Drug name extraction

... indicates the presence of the drug name in the DrugBank. Using CONLL-Evaluation: processed 32065 tokens with 3656 phrases; found: 3251 phrases; correct: 2786. accuracy: 95.25%; precision: 85.70%; recall: 76.20%; FB1: 80.67 Using GATE Corpus Benchmark: Strict: P: 0.65 R: 0.73 F1: 0.69 Lenient: P: 0.74 R: 0.84 F1: 0.78 The details of how to reproduce evaluation, see README. To use standalone version for tagging download DrugExtractionStandalone.tar.gz from Files.

Downloads: 0 This Week

Last Update: 2015-06-12
See Project
7

ISO GrAF

Experimental Java library for reading and writing GrAF/XML files.

The Graph Annotation Framework (GrAF) models linguistic annotations using a data model based on Graph theory and algorithms. The GrAF standard is a work product of ISO TC37SC4 Working Group 1. This Java library is NOT part of the GrAF standard and standoff annotation files produced by the library may not be GrAF compliant.

Downloads: 1 This Week

Last Update: 2015-03-07
See Project
8

DCTFinder

Extract title and creation time from web page.

... time recognition. DCTFinder is released under CeCILL free software license agreement. The system is described in the following paper (see 'Files' section): Xavier Tannier. "Extracting News Web Page Creation Time with DCTFinder". Proceedings of the 9th Language Resources and Evaluation Conference. Reykjavik, Iceland.

Downloads: 0 This Week

Last Update: 2016-10-21
See Project
9

Mechaglot, Calculate Semantic Similarity

Calculate semantic similarity for any human and human-like languages

WARNING: There are too many false-positives! This is Alpha release, expect many things to improve, including the algorithms. PLEASE GO TO BROWSE ALL FILES TO READ A FULL DESCRIPTION. The goal of this project is simple: Input two sentences of the same language, and obtain the number (from 0 to 1) denoting the similarity between the inputted sentences, according to semantic categories. This project models my previous project: https://sourceforge.net/projects/semantics/ Difference...

Downloads: 0 This Week

Last Update: 2014-10-07
See Project
Never Get Blocked Again | Enterprise Web Scraping
Enterprise-Grade Proxies • Built-in IP Rotation • 195 Countries • 20K+ Companies Trust Us

Get unrestricted access to public web data with our ethically-sourced proxy network. Automated session management and advanced unblocking handle the hard parts. Scale from 1 to 1M requests with zero blocks. Built for developers with ready-to-use APIs, serverless functions, and complete documentation. Used by 20,000+ companies including Fortune 500s. SOC2 and GDPR compliant.

Get Started
10

FALCON - Text Search Java Project

JSON based text search Java Project

----------------- - What is it? - ----------------- The "Falcon Search" is a JAVA API and tool to search inside the documents. It was originally started to search the content in pdf files under the project "HAWK Search". Searching with this tool is query-based not word-based as in most of the document search tools OR document readers. It also takes care of jumbling of words within query and spelling mistakes. Commonly used techniques in this project are Natural Language...

Downloads: 0 This Week

Last Update: 2014-04-18
See Project
11

BioLemmatizer

Lemmatization tool for morphological analysis of biomedical literature

The BioLemmatizer is a domain-specific lemmatization tool for the morphological analysis of biomedical literature. It is tailored to the biological domain through integration of several published lexical resources related to molecular biology. It focuses on the inflectional morphology of English, including the plural form of nouns, the conjugations of verbs, and the comparative and superlative form of adjectives and adverbs. README: https://sourceforge.net/projects/biolemmatizer/files...

Downloads: 0 This Week

Last Update: 2013-10-23
See Project
12

NetBeans Dictionaries

Additional dictionary files for the NetBeans spellchecker.

Additional dictionary files for the NetBeans spellchecker.

Downloads: 9 This Week

Last Update: 2013-03-16
See Project
13

GramLab

Le projet Gramlab vise à mettre à disposition des entreprises des outils logiciels OpenSource et gratuits, qui peuvent être mis en oeuvre par des développeurs qui ne sont pas spécialistes du traitement des langues. Note : L'outil GLabCorpus Manager nécessite l'installation d'un serveur SolR. Pour le télécharger et plus d'information, veuillez vous rendre dans la section Files.

Downloads: 0 This Week

Last Update: 2016-03-10
See Project
14

FullFiller

Data Base Benchmarking tool

A very simple tool to automate benchmarking tests on MySQL DBs. It fills MySQL tables columns; perform customized tests; and outputs the results on CSV format. It uses Xeger, a java package for generating random text from regular expressions (http://code.google.com/p/xeger/). Xeger uses dk.brics.automaton java package developed by Anders Møller (http://cs.au.dk/~amoeller/automaton/index.html).

Downloads: 0 This Week

Last Update: 2012-06-17
See Project
15

iLastic

Query, integrate and manipulate data using natural languages.

iLastic is an open-source framework to query, integrate and manipulate any type of data in English. Extract, transform and merge information from the web, databases, files or any other data repository using a language you already know... English

Downloads: 0 This Week

Last Update: 2013-10-31
See Project
16

Ontology Creation

The program creates OWL ontology files that describe relationships between entities. Basis are definitions found by searching Wikipedia articles for specific lexico-syntactic patterns.

Downloads: 0 This Week

Last Update: 2014-06-26
See Project
17

SeMap

Standardizing the existing RelEx2Frame Engine of the RelEx semantic dependency relationship extractor and adding an statistical learning AI for automatic extension of the rule base

Downloads: 0 This Week

Last Update: 2015-07-31
See Project
18

stocleka

stocleka is a project divided into a UI and a library for cleaning user stories and converting them to arff files (used for Weka). it may be mainly used for research and scientific purposes.

Downloads: 0 This Week

Last Update: 2013-04-12
See Project
19

Wikipedia Concept Association Map

Wikipedia Concept Association Map (WCAM) is new approach for textual knowledge representation and understanding. All concepts and associations are stored in a graph database for better performance and easy distribution.

Downloads: 0 This Week

Last Update: 2014-03-28
See Project
20

Nasira

Nasira is a Java library for reading text files with non-ASCII characters (e.g. documents in German, Swedish,...). To do so, it automatically determines the character encoding (iso-8859-1, utf-8) used to encode the file through user-provided hints.

Downloads: 0 This Week

Last Update: 2013-04-22
See Project

Previous
You're on page 1
Next

Related Searches

annotation

1.4.5

search text

niv bible file

drug

calculate semantic similarity between the sentences

war files

morphological analysis

netbeans

word

Related Categories

Scientific/Engineering

Artificial Intelligence

Education

Software Development

Database

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
225 Broadway Suite 1600
San Diego, CA 92101
+1 (858) 454-5900

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2024 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: