Open Source Java Text Processing Software - Page 2

Java Text Processing Software

View 91 business solutions

Browse free open source Java Text Processing Software and projects below. Use the toggles on the left to filter open source Java Text Processing Software by OS, license, language, programming language, and project status.

  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Leader badge
    Downloads: 16 This Week
    Last Update:
    See Project
  • 2
    Web Book Downloader

    Web Book Downloader

    Download websites as e-book: pdf, txt, epub.

    This application allows user to download chapters from website in 3 ways: - from table of contents; - from range: first chapter address, last chapter address; - by crawling from first chapter to n; In settings you can customize language, input(website encoding) for simplicity output is in the same encoding. If you want your language add new class into strings package, and new fields into Settings class and GUI menu(initialize method).
    Downloads: 8 This Week
    Last Update:
    See Project
  • 3
    Early Access iText, a PDF generation library in Java
    Downloads: 7 This Week
    Last Update:
    See Project
  • 4
    JODReports is a solution for generating dynamic documents and reports in Java based on the OpenDocument format (ODF). Templates can be easily composed with a word processor such as OpenOffice.org Writer. Data sources include POJOs and XML.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 5
    GATE
    NOTE THAT THE SOURCE CODE AND ISSUE TRACKER HAVE NOW MOVED TO GITHUB. FIND US AT https://github.com/GateNLP/ GATE (General Architecture for Text Engineering) is an architecture, framework and development environment for developing, evaluating and embedding Human Language Technology. See http://gate.ac.uk for full details.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    jPod is a rich PDF manipulation and rendering framework. A complete rendering library based on jPod is available here at "jPodRenderer". To see jPod & jPodRenderer at work, have a look at www.cabaret-solutions.com
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    Regular Expression Editor (RegExpEditor)

    Regular Expression Editor (RegExpEditor)

    regex as a tool, not as a problem

    Regular Expressions (aka regex, regexp) made easy. This simple tool manipulates text with regular expressions. Highlighting of regular expression results. See the real power of regex! Use Scala to do manipulate your search results even more.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    XML Differencing and Patching tools. XML based tools to mimic the functionality of traditional line based diff and patch utils, except operating on the hierarchical structure of XML.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 9
    The Text Trix editor is an open-source, cross-platform text editor with a mind for user-friendly file and text navigation. See our new GitHub home for the latest and greatest Text Trix development: https://github.com/the4thchild/texttrix
    Downloads: 2 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10

    Ghawwas_V4

    An open source system for Arabic corpora processing

    Ghawwas (previously known as Khawas) is an open source system for Arabic corpora processing. Ghawwas V4.0 provides the following main functions: a. Frequency list for single word and N-Grams b. Concordance c. Collocation (MI, CHI Squared, LL, T-Score, Z Score, Dice, Log Dice, Weirdness Coefficient) d. Lexical patterns search e. Two corpora frequency profile comparison based on MI, CHI, LL, T-Score, Z Score, Dice, Log Dice, Weirdness Coefficient f. Accept Windows and UTF-8 character encoding g. Accept TXT, DOC, DOCX, RTF and HTML formats h. Export the processing results in CSV file format
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    CPLed is an OpenSIPS tool for editing CPL scripts in a friendly and easy graphical way. It can be used as a standalone application or embedded in a web page as applet. It also provide CPL script transport functionalities via SIP and HTTP protocols.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 12
    A universal suite of utilities for large corpora processing.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13
    A Java toolbox with commonly used Farsi Language functions. Includes functions for text manipulation, standardization, normalization, search, replace and changing words and ligatures. Fixing White space problems, Jalai date and Calendar, etc...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    SimplyHTML is an application and a java component for rich text processing. It stores documents as HTML files in combination with Cascading Style Sheets (CSS). SimplyHTML is not intended to be used as an editor for web pages.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    OmegaT+ CAT Tools
    A translation tools suite for Computer-Aided Translation / Computer-Assisted Translation (CAT). A translation processor with translation memory, machine translation and project support, bitext aligner/converter, TMX validator, and others.
    Leader badge
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    AESTextCrypt

    AESTextCrypt

    Encrypt and decrypt text using AES 256 bit encryption

    AESTextCrypt is an easy-to-use open source tool for text encryption and decryption. Primarily intended for use with email, use it wherever you need to protect text from prying eyes. The encrypted text can be copy/pasted into any text-handling application (e.g. email) instead of plain text. Convenience buttons are provided for clipboard operations. AESTextCrypt uses AES-256 bit encryption which is the strongest available encryption scheme. It also employs bcrypt, which implements key-stretching and an adaptive key setup phase, the complexity (number of rounds) of which is automatically set to match the processing power of the encrypting computer. This makes it highly resistant to dictionary attack. AESTextCrypt is written in Java, so can be run on all desktop platforms - Windows, Mac and Linux.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17
    AurigaDoc is a java-xml based documentation tool for writing documents in xml format and converting them to other formats like HTML, DHTML, PDF, PostScript, Formating object(FO), RTF, Java Help, HTML Help, Oracle Help, etc. AurigaDoc is not an xml editor
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    Powerful text editor for fb2 books file processing with a text bugs fixing.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    J2ME Memopad is a simple MIDP application designed to allow storage and retrieval of notes. It will have the ability to search and generate a list of results, as well as categorize your memos. The basic design of the memopad is similar to the Palm.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    Xelem is a compact Java-library to read and write Excel files of type SpreadsheetML. It can produce sophisticated, intricate and complex spreadsheets from within any Java program. And, since the release of xelem.2.0, it can read xml-spreadsheets.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 21
    Mavscript

    Mavscript

    Calculations in a text document

    Mavscript allows the user to do calculations in a text document. Plain text, LaTeX and OpenOffice Writer files (.odt) are supported. The calculation is done by the algebra system Yacas (default), Jasymca or by the Java interpreter BeanShell.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    Script Echo Color

    Script Echo Color

    Terminal text colorizing and simplifies script coding.

    ScriptEchoColor simplifies Linux terminal text colorizing, formatting and several steps of script coding.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23

    Change File Encoding

    Change encoding of text files.

    Change File Encoding is a utility that allows you to change the encoding of text files. For example, files saved in US-ASCII can be converted to UTF-8. Over 170 encodings are supported. Requires Java 1.8 or higher.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    OmegaT+ Computer Assisted Translation (CAT) tools platform that includes OmegaT+ (translation processor), bitext2tmx (aligner/TMX editor), and Validator (TMX validation).
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    ant4docbook

    ant4docbook

    ANT4DOCBOOK is an ANT task for DOCBOOK

    ANT4DOCBOOK is an ANT task for DOCBOOK, a semantic markup language for technical documentation.
    Downloads: 2 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB