SourceForge
Browse Enterprise Blog Help Jobs
Log In or Join

Solution Centers

Go Parallel HTML5 Windows 8 Smarter IT Big Data
Newsletters
  • Home
  • Browse
  • Text Processing
Advanced
Filters
  • OS: OS Independent (Written in an interpreted language) ×
  • License: BSD License ×
Refine your search
Translations
  • English (22)
  • Dutch (3)
  • Japanese (3)
  • Chinese (2)
  • Chinese (2)
  • French (2)
  • Korean (2)
  • Russian (2)
  • Afrikaans (1)
  • Arabic (1)
  • Brazilian Portuguese (1)
  • German (1)
  • Hungarian (1)
  • Indonesian (1)
  • Italian (1)
License
  • OSI-Approved Open Source (42)
    • GNU General Public License version 2.0 (3)
    • Apache License V2.0 (1)
    • Common Development and Distribution License (1)
    • Common Public License 1.0 (1)
    • Open Software License 3.0 (1)
    • zlib/libpng License (1)
  • Other License (1)
Programming Language
  • Java (19)
  • Python (8)
  • PHP (6)
  • C (3)
  • Perl (3)
  • Ruby (2)
  • AWK (1)
  • C# (1)
  • C++ (1)
  • Haskell (1)
  • JavaScript (1)
  • Visual Basic (1)
Status
  • 5 - Production/Stable (17)
  • 4 - Beta (11)
  • 3 - Alpha (6)
  • 2 - Pre-Alpha (4)
  • 1 - Planning (3)
  • 7 - Inactive (2)
  • 6 - Mature (1)
OS
  • Grouping and Descriptive Categories (42)
    • All 32-bit MS Windows (7)
    • All POSIX (5)
    • 32-bit MS Windows (2)
    • 32-bit MS Windows (2)
    • OS Portable (2)
    • 64-bit MS Windows (1)
    • All BSD Platforms (1)
  • Linux (42)
  • Mac (42)
  • Windows (42)
  • Modern (4)
    • Linux (4)
    • OS X (1)
    • Solaris (1)
    • Win2K (1)
    • WinXP (1)
  • Emulation and API Compatibility (1)
    • Cygwin (1)
    • Fink (1)
Freshness
  • Recently updated (9)

Text Processing

Sort By
Most Popular
  • Most Popular
  • Last Updated
  • Name
  • Rating

Showing page 1 of 2.

  • PDFBox Icon
    PDFBox

    PDFBox is a Java PDF Library. This project will allow access to all of the components in a PDF document. More PDF manipulation features will be added as the project matures. This ships with a utility to take a PDF document and output a text file.

    276 weekly downloads
  • RText Icon
    RText

    RText is a customizable programmer's text editor written in Java. Some of its features include: syntax highlighting, editing multiple documents at once, printing and print preview, find/replace/find in files dialogs, undo/redo, and online help.

    93 weekly downloads
  • jPod intarsys PDF library Icon
    jPod intarsys PDF library

    jPod is a rich PDF manipulation and rendering framework. A complete rendering library based on jPod is available here at "jPodRenderer". To see jPod & jPodRenderer at work, have a look at www.cabaret-solutions.com

    37 weekly downloads
  • FMPP - FreeMarker-based PreProcessor Icon
    FMPP - FreeMarker-based PreProcessor

    Command-line/Ant-task/embeddable text file preprocessor. Macros, flow control, expressions. Recursive directory processing. Extendable in Java to display data from any data sources (as database). Can generate complete homepages (tree of HTML-s, image

    33 weekly downloads
  • farsi-commons Icon
    farsi-commons

    A Java toolbox with commonly used Farsi Language functions. Includes functions for text manipulation, standardization, normalization, search, replace and changing words and ligatures. Fixing White space problems, Jalai date and Calendar, etc...

    75 weekly downloads
  • DocDiff: Compare text word by word Icon
    DocDiff: Compare text word by word

    DocDiff compares two text files and shows the difference.

    23 weekly downloads
  • Morfologik Icon
    Morfologik

    Polish morphological analyzer and Java libraries interfacing it. First completely open-source and comprehensive morphological tools and finite-state technology for Polish and other languages.

    63 weekly downloads
  • latex-mk Icon
    latex-mk

    LaTeX-Mk is a collection of makefile fragments for managing small to large LaTeX based documentation projects. The idea is that especially large documents, there may be many many steps required to typeset the document (export modified figures to postscr

    24 weekly downloads
  • Conversion of other file formats to PDF Icon
    Conversion of other file formats to PDF

    xtopdf: Tools to convert other formats (x) to PDF; x as in math. - solve for x :-) Currently x == {.txt, .DBF}. Others to follow. Benefits: all those of PDF (better cross-platform viewing/printing, read-only, etc.)

    7 weekly downloads
  • NaNoWriTool Icon
    NaNoWriTool

    NaNoWriTool is a text editor with features specifically geared towards NaNoWriMo, the National Novel Writing Month. It contains a live word counter, daily word count target, a timer for word wars and automatic backup feature.

    8 weekly downloads
  • wordaxe Icon
    wordaxe

    wordaxe (formerly deco-cow): A hyphenation library for Python. Several hyphenation algorithms: - the pattern-based from TeX/OOO, - by decomposition of compound words for German language. Includes support for paragraph line-breaking with ReportLab.

    9 weekly downloads
  • Java Text Processing Framework Icon
    Java Text Processing Framework

    A framework that allows textprocessing to be integrated into any Java application in a generic manner. It represents an approbiate abstraction of the necessary elements and offers a generic interface to the application that needs a textprocessing service.

    3 weekly downloads
  • Wiki2TEI Icon
    Wiki2TEI

    Convert wiki pages to TEI.

    3 weekly downloads
  • Markout Icon
    Markout

    Markout is a pure-Java lightweight wiki markup parser based on John Gruber's Markdown.

    2 weekly downloads
  • Notepage Icon
    Notepage

    A +featured text editor based on Java.

    2 weekly downloads
  • RCodeLeveler Icon
    RCodeLeveler

    A Ruby file parser/interpreter/preprocessor that comments lines of code based on conditions at the time the file is required. Very handy to implement debugging logs and code that has to be commented (not just dynamically switched off).

    2 weekly downloads
  • jtr Icon
    jtr

    Java library that emulates the Perl 5 "transliterate" operation on a given string. Most Perl 5 features are supported, including all the standard modifiers and most Perl escape sequences. Patterns are compiled for speed, and runtime performance is fast.

    2 weekly downloads
  • TexBeans Icon
    TexBeans

    a set LaTex plugins for Netbeans with full project management (multiple files allowed), editor, code completion (Ctrl-Space), build and view support (latex, bibtex and xdvi, linux), code injection (Alt-Enter), spellcheck, error and warning handling ....

    1 weekly downloads
  • D2AC-A2DC Icon
    D2AC-A2DC

    From March 2011, this project has moved to: rcrl.sourceforge.net

    1 weekly downloads
  • DocBook Publishing Utilities Icon
    DocBook Publishing Utilities

    The DocBook Publishing Utilities tools, which make creation and publishing of DocBook easier. The tools are: Maven plug-in to Transform HTML into XML (use after docbkx); Eclipse DocBook table editor; Eclipse wizards for initial DocBook files.

    1 weekly downloads
  • Doco Icon
    Doco

    Doco is a simple but feature rich and powerful markup language for converting text documents into highly-presentable and navigable web content.

    1 weekly downloads
  • Hierarchical Project Tree Icon
    Hierarchical Project Tree

    This tool is designed to help break a project down into smaller and smaller chunks, allowing you to go into fine detail without losing sight of the big picture. Particularly good for certain types of dyslexia.

    1 weekly downloads
  • JNotePad Icon
    JNotePad

    JNotePad is a very flexible text editor. With lots of modules, cou can create your own user-friendly editor. By choosing only the modules YOU need, you get a very productive editor.

    1 weekly downloads
  • Java Transliterator Icon
    Java Transliterator

    translit is a J2EE web application written in Java to execute convertion between different encodings.

    1 weekly downloads
  • Pyana Icon
    Pyana

    Pyana is a extension module that allows Python programs to interface with the Apache Software Foundation's Xalan XSLT transformation engine.

    1 weekly downloads
  • Back
  • 1
  • 2
  • Next

Staff Picks

  • Icon Clover EFI bootloader
  • Icon Dungeon Crawl Reference
  • Icon Data Crow
  • Icon LibreCAD
  • Icon OpenCPN
  • Icon OS4
  • Icon Pinguy OS
  • Icon PNotes
  • Icon Synfig

Top Downloaded

Powered by Dice Logo Latest Tech Jobs

  • Loading... The latest tech jobs.
See All Jobs ››
SourceForge
About Site Status @sfnet_ops
Find and Develop Software
Create a Project Software Directory Top Downloaded Projects
Community
Blog @sourceforge Job Board
Help
Site Documentation Support Request Real-Time Support
Copyright © 2013 Dice. All Rights Reserved.
SourceForge is a Dice Holdings, Inc. service.
Terms Privacy Cookies/Opt Out Advertise SourceForge.JP Big Data