Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Scientific/Engineering
Linguistics Software
Search Results

Search Results for "java open source" - Page 5

x

Sort By:

Relevance

Clear All Filters

OS

Linux 282
Windows 259
Mac 216
More...
BSD 155
ChromeOS 116
Desktop Operating Systems 11
Mobile Operating Systems 11
Embedded Operating Systems 1
Game Consoles 1
Server Operating Systems 1

Category

Scientific/Engineering 350
- Linguistics 350
  - Machine Translation 88
- Bio-Informatics 10
- HMI 11
- Information Analysis 35
- Interface Engines 1
- Mapping 1
- Mathematics 7
- Medical 3
- Molecular Science 2
- More...
- Robotics 2
- Simulation 1
Artificial Intelligence 83
Education 58
Software Development 46
Text Editors 23
Business 17
Multimedia 14
Internet 11
Formats and Protocols 10
Social sciences 9
System 8
Games 6
Religion and Philosophy 6
Communications 5
Desktop Environment 5
Database 4
Mobile 2
Security 2
Printing 1
Productivity 1

License

OSI-Approved Open Source 342
Creative Commons Attribution License 9
Other License 4
Public Domain 3

Translations

Programming Language

Status

Beta 97
Production/Stable 86
Alpha 65
Pre-Alpha 39
More...
Planning 26
Mature 7
Inactive 2

Showing 350 open source projects for "java open source"

View related business solutions

Linguistics Clear Filters & Widen Search

Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
Cloud tools for web scraping and data extraction
Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.

Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.

Explore 10,000+ tools
1

LINNAEUS

Entity recognition and normalization software for biomedical text

Downloads: 0 This Week

Last Update: 2016-05-05
See Project
2

Hebrew Deflector

A proram to de-inflect modern Hebrew words

Hebrew Deflector tries to guess the root, the pattern and the form of a modern Hebrew word provided by the user. It uses the existing rules of the language to do that, and displays the list of possible answers. It is not a dictionary and it doesn't know whether the word (and the listed forms of it) exists or not. It also doesn't know anything about exception to the rules.

Downloads: 0 This Week

Last Update: 2016-11-29
See Project
3

Welsh Natural Language Toolkit

WNLT is a suite of open source natural language modules for the Welsh

The project supports the Welsh Language Technology domain with a set of NLP tools that drive innovation and advance the development of sophisticated textual analysis solutions. The WNLT project delivers four core NLP modules; a) Word Segmentation for separating text into words b) Sentence Boundary Disambiguation for finding sentence boundaries c) Part of Speech Tagger for determining the part of speech of each word d) Morphological Analyser for identifying the root form (lemma) of words....

Downloads: 1 This Week

Last Update: 2016-11-29
See Project
4

Multi-Lingual Vocabulary Trainer

Cross-platform application aimed at helping users to learn vocabulary from any foreign language(s). Add/Edit/Delete vocab words (w/ translation, category, sentence, notes, picture). Review (Quiz) vocabulary words.

7 Reviews

Downloads: 0 This Week

Last Update: 2016-05-03
See Project
Atera all-in-one platform IT management software with AI agents
Ideal for internal IT departments or managed service providers (MSPs)

Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.

Learn More
5

diasim

Dialogue Similarity

Tools for calculating similarity (including lexical and syntactic) between speakers in dialogue, across standard and randomised corpora.

Downloads: 0 This Week

Last Update: 2016-03-31
See Project
6

texrex

Web corpus creation software (moved to GitHub)

This project has moved to GitHub: https://github.com/rsling/texrex https://github.com/rsling/cow

Downloads: 0 This Week

Last Update: 2016-04-20
See Project
7

bnf2xml

simple BNF parser makes xml markup of matches

bnf2xml a simple BNF parser that takes text as input, searches according to a BNF query file, and outputs text marked up by the xml labels that show context. bnf2xml is as simple to use as any text binary ie, awk(1) grep(1). bnf2xml does not require C API because it outputs simple xml labeling. README is visible on file dl page. EXAMPLE: $ echo "hi" | bnf2xml patternfile <word><alph>h</alph><alph>i</alph></word> or <gas>hydrogen iodide</gas> patternfile says how to find...

Downloads: 0 This Week

Last Update: 2016-04-08
See Project
8

ACOPOST - a collection of POS taggers

Part-of-speech tagging is the task of assigning symbols from a particular set to words in a natural language text. ACOPOST implements and extends well-known machine learning techniques and provides a uniform environment for testing.

1 Review

Downloads: 0 This Week

Last Update: 2016-02-26
See Project
9

Classical Arabic Corpus

A corpus contains more than 1 M distinct Arabic words.

This project has been developed as part of a master thesis named "Edit Distance Adapted to Natural Language Words". The available project consists three parts. First, the corpus gathers more than one million distinct Arab words. Second, the text files of Arabic resources. Third, the index file presents some information about these resources. Additional details about these parts are available in README file.

Downloads: 0 This Week

Last Update: 2016-01-19
See Project
Grafana: The open and composable observability platform
Faster answers, predictable costs, and no lock-in built by the team helping to make observability accessible to anyone.

Grafana is the open source analytics & monitoring solution for every database.

Learn More
10

Speech Research Tools

Software for speech research. It includes programs and libraries for signal processing, along with general purpose scientific libraries. Most of the code is in Python, with C/C++ supporting code. Also, contains code releases corresponding to publishe

Downloads: 0 This Week

Last Update: 2015-12-13
See Project
11

FREJ

FREJ stands for "Fuzzy Regular Expressions for Java" - it is a command-line tool and library which allow you easily compare strings with patterns disregarding nasty typos and considering several variants (like "Barack Obama", "B.H.Obama" etc.) Project sources are moved to github: https://github.com/RodionGork/FREJ

Downloads: 0 This Week

Last Update: 2015-10-17
See Project
12

Morfologik

ATTENTION! Morfologik is now at GitHub: https://github.com/morfologik/

1 Review

Downloads: 0 This Week

Last Update: 2015-09-10
See Project
13

KneeTex

KneeTex is an open–source, stand–alone application for information extraction from narrative reports that describe an MRI scan of the knee. Given an MRI report as input, the system outputs the corresponding clinical findings in the form of JavaScript Object Notation objects. The extracted information is mapped onto TRAK, an ontology that formally models knowledge relevant for the rehabilitation of knee conditions.

Downloads: 0 This Week

Last Update: 2015-09-11
See Project
14

ModelBlocks

C++ template library for modular construction of factored probabilistic time-series models, model trainers, and recognizers.

Downloads: 0 This Week

Last Update: 2016-11-27
See Project
15

Virastyar

Virastyar is an spell checker for low-resource languages

Virastyar is a free and open-source (FOSS) spell checker. It stands upon the shoulders of many free/libre/open-source (FLOSS) libraries developed for processing low-resource languages, especially Persian and RTL languages Publications: Kashefi, O., Nasri, M., & Kanani, K. (2010). Towards Automatic Persian Spell Checking. SCICT. Kashefi, O., Sharifi, M., & Minaie, B. (2013).

14 Reviews

Downloads: 60 This Week

Last Update: 2020-03-05
See Project
16

mwetoolkit

THIS PROJECT MIGRATED TO https://gitlab.com/mwetoolkit/mwetoolkit3/

THIS PROJECT MIGRATED TO https://gitlab.com/mwetoolkit/mwetoolkit3/ The Multiword Expressions toolkit aids in the automatic identification and extraction of multiword units in running text. These include idioms (kick the bucket), noun compounds (cable car), phrasal verbs (take off, give up), etc. Even though it focuses on multiword expresisons, the framework is quite complete and can also be useful in any corpus-based study in computational linguistics. The mwetoolkit can be...

1 Review

Downloads: 0 This Week

Last Update: 2019-05-01
See Project
17

BANNER Named Entity Recognition System

BANNER is a named entity recognition system intended primarily for biomedical text. It uses conditional random fields as the primary recognition engine and includes a wide survey of the best techniques described in recent literature.

Downloads: 1 This Week

Last Update: 2015-07-30
See Project
18

eAlign

A parallel corpora (bitext) aligning tool. Create TMX databases

(Full support available under superalign.sourceforge.net) Aligning parallel corpora Creating TMX, csv, Tab Delimited TMs Automatic aligning of text Super fast handling of multiple files Very easy GUI handling of files under Windows CAT tool assistant

Downloads: 0 This Week

Last Update: 2016-11-26
See Project
19

SuperAlign

SuperAlign was fully updated as of 15 July 2013 and is now released under the name eAlign as well. A parallel corpora (bitext) aligning tool. Create TMX databases and align translations for Translation Memory databases. Use multiple files in multiple formats to align them with their translations. The full workflow is built in with a GUI interface. SuperAlign-eAlign uses the hunalign algorithm.

1 Review

Downloads: 0 This Week

Last Update: 2016-02-15
See Project
20

AJ-JpnRa Tool

｢AJ-JpnRa Tool｣ is Japanese text readability analysis program.

We temporarily suspend the release of the program due to a patent application. -2020.09 AJ-JpnRa Tool is Japanese text readability analysis program, is mainly ordered by the guidelines of JLPT. You can analyze Japanese-Text Readability with the length and Chinese character level of the text by using the AJ-JpnRa Tool. And Chinese character level is analyzed by the database(AJ-JpnRa Tool), which was built according to essential Chinese character education guideline of Japan elementary...

Downloads: 0 This Week

Last Update: 2020-09-29
See Project
21

AsiEs

AsiEs stands for Asistente de Escritura (writing assistant). It provides word prediction and autocomplete for fast writing. Thought for people with difficulties writing on keyboard, improves the writing speed preventing the user from pressing at most 50% of keys to write and avoids ortographic errors. Made by Fundación Teletón Uruguay (http://www.teleton.org.uy/home/)

Downloads: 0 This Week

Last Update: 2015-06-17
See Project
22

Metalanguage And Analysis Toolkit

Downloads: 0 This Week

Last Update: 2015-05-09
See Project
23

JInsect

The JINSECT toolkit is a Java-based toolkit and library that supports and demonstrates the use of n-gram graphs within Natural Language Processing applications, ranging from summarization and summary evaluation to text classiﬁcation and indexing.

3 Reviews

Downloads: 0 This Week

Last Update: 2015-08-25
See Project
24

Alfanous

Quran Search Engine

Alfanous (The Lantern - الفانوس ) is an Arabic search engine API provide the simple and advanced search in the Holy Quran , more features and many interfaces...

2 Reviews

Downloads: 2 This Week

Last Update: 2019-07-20
See Project
25

TERCpp

This tool is made to score machine translation performance with the TER metric. This code is based on Snover's algorithm.

Downloads: 0 This Week

Last Update: 2015-11-11
See Project

Previous
1
2
3
4
You're on page 5
6
7
8
9
Next

Related Searches

speech recognition in telugu language

vocabulary trainer

pos

arabic corpus

frej

morfologik

virastyar

corpus

named entity recognition source code

tmx

Related Categories

Scientific/Engineering

Artificial Intelligence

Education

Software Development

Text Editors

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

×

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: