SourceForge
Browse Enterprise Blog Help Jobs
Log In or Join

Solution Centers

Go Parallel HTML5 Windows 8 Smarter IT Big Data
Newsletters
  • Home
  • Browse
  • Text Processing
Advanced
Filters
  • Status: 4 - Beta ×
  • OS: Linux ×
Refine your search
Translations
  • English (179)
  • German (27)
  • Russian (16)
  • French (15)
  • Brazilian Portuguese (7)
  • Japanese (7)
  • Spanish (6)
  • Dutch (5)
  • Italian (4)
  • Chinese (3)
  • Polish (3)
  • Hungarian (2)
  • Persian (2)
  • Turkish (2)
  • Ukrainian (2)
License
  • OSI-Approved Open Source (324)
    • GNU General Public License version 2.0 (196)
    • GNU Library or Lesser General Public License version 2.0 (46)
    • BSD License (25)
    • GNU General Public License version 3.0 (21)
    • Apache License V2.0 (11)
    • MIT License (11)
    • Artistic License (5)
    • Academic Free License (3)
    • Affero GNU Public License (3)
    • Common Public License 1.0 (3)
    • Eclipse Public License (3)
    • Mozilla Public License 1.1 (3)
    • Adaptive Public License (2)
    • GNU Library or Lesser General Public License version 3.0 (2)
    • Open Software License 3.0 (2)
  • Public Domain (11)
  • Other License (4)
Programming Language
  • Java (117)
  • C++ (53)
  • C (43)
  • Python (37)
  • PHP (25)
  • Perl (22)
  • JavaScript (21)
  • Unix Shell (12)
  • C# (10)
  • Delphi/Kylix (8)
  • Pascal (5)
  • Tcl (5)
  • Ruby (3)
  • BASIC (2)
  • JSP (2)
Status
  • 5 - Production/Stable (34)
  • 3 - Alpha (9)
  • 7 - Inactive (6)
  • 1 - Planning (4)
  • 2 - Pre-Alpha (2)
  • 6 - Mature (2)
OS
  • Windows (289)
  • Grouping and Descriptive Categories (262)
    • OS Independent (147)
    • All POSIX (101)
    • All 32-bit MS Windows (50)
    • OS Portable (35)
    • 32-bit MS Windows (18)
    • All BSD Platforms (12)
    • 32-bit MS Windows (9)
    • 64-bit MS Windows (5)
  • Mac (257)
  • BSD (88)
  • Modern (77)
    • Linux (69)
    • OS X (17)
    • WinXP (16)
    • Win2K (9)
    • FreeBSD (4)
    • Vista (4)
    • NetBSD (3)
    • Windows 7 (3)
    • Solaris (2)
    • OpenBSD (1)
  • Other Operating Systems (17)
    • Other (5)
    • WinNT (4)
    • MS-DOS (2)
    • IBM AIX (1)
    • IBM OS/2 (1)
    • Microsoft Windows Server 2003 (1)
    • Win98 (1)
    • WinME (1)
Freshness
  • Recently updated (63)

Text Processing

Sort By
Most Popular
  • Most Popular
  • Last Updated
  • Name
  • Rating

Showing page 1 of 14.

  • iText®, a JAVA-PDF library Icon Enterprise
    iText®, a JAVA-PDF library

    iText is an open source Java library for PDF generation and manipulation. It can be used to create PDF documents from scratch, to convert XML to PDF (using the extra XFA Worker DLL), to fill out interactive PDF forms, to stamp new content on existing PDF documents, to split and merge existing PDF documents, and much more. Several iText engineers are actively supporting the project on the iText mailing-list itext-questions@lists.sourceforge.net.

    3,759 weekly downloads
  • LaTeX to RTF converter Icon
    LaTeX to RTF converter

    LaTeX to RTF convertor that handles equations, figures, and cross-refe

    805 weekly downloads
  • pdftohtml Icon
    pdftohtml

    Pdftohtml is a tool based on the Xpdf package which translates pdf documents into html format.

    687 weekly downloads
  • Diffuse Icon
    Diffuse

    Diffuse is a graphical tool for comparing and merging text files. It can retrieve files for comparison from Bazaar, CVS, Darcs, Git, Mercurial, Monotone, RCS, Subversion, and SVK repositories.

    563 weekly downloads
  • jPDF Tweak Icon
    jPDF Tweak

    A Swiss Army Knife GUI application for PDF documents: combine, split, rotate, reorder (n-up, booklet), watermark, edit bookmarks/fileinfo/pagetransition, compress, encrypt, decrypt, sign, repair, edit attachments and more.

    394 weekly downloads
  • Colorer Library Icon
    Colorer Library

    Colorer provides source text syntax highlighting services. It colorizes source codes in editor systems (more than 200 syntaxes). Uses powerful HRC format(XML, RE, context free grammas), allowing to support any language. Available as Eclipse plugin.

    323 weekly downloads
  • PDF Clown Icon
    PDF Clown

    Crunch PDF files the fun way!

    210 weekly downloads
  • Vrapper Icon
    Vrapper

    Vim-like editing in Eclipse

    122 weekly downloads
  • Anaphraseus Icon
    Anaphraseus

    Anaphraseus is a CAT (Computer Aided Translation) tool, OpenOffice.org 2-3 macro set similar to famous Wordfast. It works with the Wordfast Translation Memory format (*.TXT), and supports text segmentation.

    126 weekly downloads
  • omegat-plugins Icon
    omegat-plugins

    Third-party plugins for OmegaT (https://sourceforge.net/projects/omegat)

    108 weekly downloads
  • bitext2tmx CAT bitext aligner/converter Icon
    bitext2tmx CAT bitext aligner/converter

    A free computer-aided translation / computer-assisted translation (CAT) tool to align and converter bitext into TMX translation memory format to be used in other CAT tools by translators and other language professionals.

    84 weekly downloads
  • RegexKit Icon
    RegexKit

    An Objective-C Framework for Regular Expressions using the PCRE Library for Mac OS X Cocoa and GNUstep.

    84 weekly downloads
  • XML differencing and patching tools Icon
    XML differencing and patching tools

    XML Differencing and Patching tools. XML based tools to mimic the functionality of traditional line based diff and patch utils, except operating on the hierarchical structure of XML.

    201 weekly downloads
  • OmegaT+ CAT Tools Icon
    OmegaT+ CAT Tools

    A translation tools suite for Computer-Aided Translation / Computer-Assisted Translation (CAT). A translation processor with translation memory, machine translation and project support, bitext aligner/converter, TMX validator, and others.

    71 weekly downloads
  • rtf2html converter Icon
    rtf2html converter

    RTF to HTML converter for use both with your applications and as a standalone tool. Small and fast. Processes tables better than any other tool I've seen.

    40 weekly downloads
  • Banjon - Multilingual Input Method Icon
    Banjon - Multilingual Input Method

    Banjon is a context sensitive input method (transliterator) for natural Languages. It is designed to analyze the pattern and context of input character sequences and generate output characters based on a predefined map script.

    98 weekly downloads
  • Diff shell extension Icon
    Diff shell extension

    Diff-ext is an extension for filemanagers such as Windows Explorer and Nautilus that allows to launch diff/merge tools on selected files.

    27 weekly downloads
  • TextRoom Icon
    TextRoom

    Open Source Cross Platform Full Screen Rich Text Editor For Writers

    30 weekly downloads
  • DocDiff: Compare text word by word Icon
    DocDiff: Compare text word by word

    DocDiff compares two text files and shows the difference.

    23 weekly downloads
  • FarsiWeb Icon
    FarsiWeb

    General information, and a pack of tools for manipulating the Persian (Farsi) language and script, on different platforms and operating systems.

    65 weekly downloads
  • Treebeard Icon
    Treebeard

    (XSLT transformer/editor) A text editor that allows the loading and editing of an XML document and an XSLT document at the same time. It also can apply the XSLT to the XML and display the output for further editing/saving. Plugable XML and XSLT parsers

    64 weekly downloads
  • PyRTF Icon
    PyRTF

    PyRTF is a pure python module for the efficient creation of RTF documents.

    62 weekly downloads
  • regexxer Icon
    regexxer

    regexxer is a nifty GUI search/replace tool featuring Perl-style regular expressions. If you need project-wide substitution and you're tired of hacking sed command lines together, then you should definitely give regexxer a try.

    12 weekly downloads
  • My Dream Diary Icon
    My Dream Diary

    My Dream Diary is a computer diary, that allows you to create and manage descriptions of dreams. Password protection, archiving, statistics, dream signs are the basic functions of this program.

    54 weekly downloads
  • TruBlog: The true weblogging Icon
    TruBlog: The true weblogging

    Lightweight system for running a weblog. Features multiple authors, topics, Trackback, RSS (amongst others). TruBlog comes with easy installation and strong caching mechanisms, it's localisable and produces a valid XHTML. Theming is done through CSS.

    50 weekly downloads
  • Back
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • …
  • Next

Staff Picks

  • Icon Clover EFI bootloader
  • Icon Dungeon Crawl Reference
  • Icon Data Crow
  • Icon LibreCAD
  • Icon OpenCPN
  • Icon OS4
  • Icon Pinguy OS
  • Icon PNotes
  • Icon Synfig

Top Downloaded

Powered by Dice Logo Latest Tech Jobs

  • Loading... The latest tech jobs.
See All Jobs ››
SourceForge
About Site Status @sfnet_ops
Find and Develop Software
Create a Project Software Directory Top Downloaded Projects
Community
Blog @sourceforge Job Board
Help
Site Documentation Support Request Real-Time Support
Copyright © 2013 Dice. All Rights Reserved.
SourceForge is a Dice Holdings, Inc. service.
Terms Privacy Cookies/Opt Out Advertise SourceForge.JP Big Data