Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "java text mining preprocessing" - Page 2

x

Sort By:

Relevance

Clear All Filters

OS

Mac 47
Linux 47
Windows 46
More...
BSD 27
ChromeOS 25
Desktop Operating Systems 3
Mobile Operating Systems 1

Category

Scientific/Engineering 26
Artificial Intelligence 20
Software Development 11
Business 10
Internet 7
Text Editors 4
Database 3
Education 3
Communications 2
Formats and Protocols 2
System 2
Multimedia 1
Social sciences 1
Terminals 1

License

OSI-Approved Open Source 38
Other License 2
Creative Commons Attribution License 1
Public Domain 1

Translations

English 12
German 4
Portuguese 1
Russian 1
More...
Spanish 1

Programming Language

Java 39
C++ 3
XSL (XSLT/XPath/XSL-FO) 3
C 2
More...
Groovy 2
Perl 2
Prolog 2
Ruby 2
Fortran 1
JavaScript 1
Kotlin 1
Objective C 1
Python 1
Unix Shell 1

Status

Beta 14
Alpha 11
Production/Stable 11
Pre-Alpha 3
More...
Planning 2
Mature 2

Showing 47 open source projects for "java text mining preprocessing"

View related business solutions

Mac Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
1

text-analysis

This project aims to implement in java the following text mining techniques: Text Language Detection, Keywords and keyphrases extraction, Text Classification, Text Clustering, Single or multiple documents Summarization, Plagiarism Detection.

Downloads: 0 This Week

Last Update: 2014-05-20
See Project
2

Text Mining Commons API

This library offers an API to useful tools that can be utilized for any text mining project. More details will be mentioned in the project's future documentation

Downloads: 0 This Week

Last Update: 2016-10-23
See Project
3

Data Mining LiWeCool

DM LiWeCool is a tool for preprocessing light-weight CSV data files as Weka-compatible. It includes merging different header lines into one, editing values (encoding, categorizing, etc) and saving data as ARFF or XRFF (Weka native). It is Java-based.

Downloads: 0 This Week

Last Update: 2015-02-06
See Project
4

Contextor

Contextor is a light-weight simple-to-use Java based library to help developers and researchers working with the general concept of a resource; as examples, resources can be text resources, web resources, images and videos.

Downloads: 0 This Week

Last Update: 2013-04-18
See Project
Keep company data safe with Chrome Enterprise
Protect your business with AI policies and data loss prevention in the browser

Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.

Download Chrome
5

Wikipedia Concept Association Map

Wikipedia Concept Association Map (WCAM) is new approach for textual knowledge representation and understanding. All concepts and associations are stored in a graph database for better performance and easy distribution.

Downloads: 0 This Week

Last Update: 2014-03-28
See Project
6

PubCurator

PubCurator is a biomedical text mining platform and validation helper built on top of Eclipse RCP.

Downloads: 0 This Week

Last Update: 2013-05-03
See Project
7

ProtoSM

A tool to aid in the execution of Systematic Mapping Studies.

Downloads: 0 This Week

Last Update: 2016-09-04
See Project
8

moara

Moara is a biological text mining tool and consists of a Java library and some auxiliary MySQL databases for gene/protein training and extraction of mentions and its further normalization and disambiguation.

Downloads: 0 This Week

Last Update: 2013-04-10
See Project
9

Tools for Text Analytics

This project is a compilation of tools/libraries to help with tasks related to Text Analytics mainly in Java. These tools range from simple wrappers to sophisticated mining tasks that can improve the productivity of researchers and engineers.

Downloads: 1 This Week

Last Update: 2015-11-27
See Project
Level Up Your Cyber Defense with External Threat Management
See every risk before it hits. From exposed data to dark web chatter. All in one unified view.

Move beyond alerts. Gain full visibility, context, and control over your external attack surface to stay ahead of every threat.

Try for Free
10

OpenDMAP

OpenDMAP (Open Source Direct Memory Access Parser) is a natural language processing (text mining) application: a semantic parser for information extraction.

Downloads: 0 This Week

Last Update: 2013-04-30
See Project
11

FIBER

The Fiber project seeks to create a modular open source text mining tool that provides a contextual foundation for analysis in the dissemination of large quantities of text data.

Downloads: 0 This Week

Last Update: 2015-05-20
See Project
12

@Note

@Note2 is now available in www.anote-project.org @Note is a Biomedical Text Mining workbench that integrates current Biomedical Text Mining (BioTM) methods and provides biologists with intuitive tools capable of supporting their bibliographic searches and further literature curation.

Downloads: 0 This Week

Last Update: 2013-07-22
See Project
13

iDocs

iDocs is a intellectual document work flow with text mining options project.

Downloads: 0 This Week

Last Update: 2014-04-08
See Project
14

hypKNOWsys

hypKNOWsys aims at developing a Java-based workbench for knowledge discovery and knowledge management. Currently, hypKNOWsys has released two intermediate tools: DIAsDEM Workbench (text mining for semantic tagging) and WUMprep (Web mining pre-processing)

Downloads: 0 This Week

Last Update: 2013-04-15
See Project
15

Java Data Mining Framework

JDMF is a data mining framework written in Java. Main features include: simplicity, flexibility, many algorithms to choose from, many formats of input (e.g. XML, CSV, JDBC, Java beans) and output data (e.g. XML, plain text info, charts).

Downloads: 1 This Week

Last Update: 2013-04-19
See Project
16

Namboo KDD

This project intends to create an indexing search engine, for knowledge management. The primary object is to apply an information retrieval core. And implement a knowledge data discovery theory such as data mining algorithm, text mining.

Downloads: 0 This Week

Last Update: 2013-03-13
See Project
17

TMG - Text Mining for german documents

TMG - Text Mining for german language documents

Downloads: 0 This Week

Last Update: 2013-03-07
See Project
18

TextToOnto

The aim of TextToOnto is to support developers in the ontology construction process by applying text mining techniques. For this purpose it builds on KAON (http://kaon.semanticweb.org)

Downloads: 0 This Week

Last Update: 2015-07-31
See Project
19

txtkit

txtkit is a visual text mining tool for exploring large amounts of multilingual texts. It's an multiuser-application which mainly focuses on the process of reading and reasoning as series of decisions and events.

1 Review

Downloads: 0 This Week

Last Update: 2013-02-27
See Project
20

webExtractor

webExtractor is a Java application that is used for extracting specific content from web based HTML, XML, CSV, and free form text. The extracted data can be used for data gathering and mining purposes.

Downloads: 7 This Week

Last Update: 2014-06-26
See Project
21

GraphSpider/MPL

GraphSpider is a pattern matcher which searches parsed text in phrase-structure tree or dependency graph format for syntactic structures matching a set of patterns in MPL, a regexp-like pattern language. Applications: information extraction, text mining.

Downloads: 0 This Week

Last Update: 2013-04-19
See Project
22

reputron

reputron is a knowledge extraction engine platform that covers all aspect of text mining, relevance, indexing and querying on a corpus of text documents.

Downloads: 0 This Week

Last Update: 2015-04-08
See Project

Previous
1
You're on page 2
Next

Related Searches

text summarization

text mining

opinion mining project

arff

java text area

data mining

wordnet 2.0

html source extractor

java text mining preprocessing

xml csv

Related Categories

Scientific/Engineering

Artificial Intelligence

Software Development

Business

Internet

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2025 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

×

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: