SourceForge
Browse Enterprise Blog Help Jobs
Log In or Join

Solution Centers

Go Parallel HTML5 Windows 8 Smarter IT Big Data
Newsletters
  • Home
  • Browse
  • Development
  • Text Processing
Advanced
Refine your search
Translations
  • English (771)
  • German (98)
  • French (65)
  • Russian (62)
  • Chinese (43)
  • Spanish (40)
  • Italian (37)
  • Japanese (36)
  • Brazilian Portuguese (32)
  • Polish (23)
  • Czech (20)
  • Dutch (19)
  • Chinese (18)
  • Portuguese (18)
  • Turkish (18)
License
  • OSI-Approved Open Source (1,464)
    • GNU General Public License version 2.0 (914)
    • GNU Library or Lesser General Public License version 2.0 (168)
    • BSD License (119)
    • GNU General Public License version 3.0 (77)
    • MIT License (55)
    • Apache License V2.0 (45)
    • Academic Free License (26)
    • Mozilla Public License 1.1 (25)
    • Artistic License (17)
    • Affero GNU Public License (14)
    • Open Software License 3.0 (12)
    • Apache Software License (10)
    • Common Public License 1.0 (10)
    • Mozilla Public License 1.0 (10)
    • Eclipse Public License (9)
  • Public Domain (77)
  • Other License (28)
  • Creative Commons Attribution License (10)
    • Creative Commons Attribution ShareAlike License V3.0 (1)
Programming Language
  • Java (397)
  • C++ (269)
  • C (200)
  • Python (155)
  • C# (122)
  • PHP (106)
  • Perl (87)
  • JavaScript (82)
  • Delphi/Kylix (59)
  • Visual Basic .NET (54)
  • Visual Basic (44)
  • Unix Shell (30)
  • XSL (24)
  • Pascal (19)
  • Ruby (17)
Status
  • 5 - Production/Stable (485)
  • 4 - Beta (454)
  • 3 - Alpha (232)
  • 1 - Planning (199)
  • 2 - Pre-Alpha (174)
  • 6 - Mature (43)
  • 7 - Inactive (43)
OS
  • Windows (1,380)
  • Linux (1,239)
  • Grouping and Descriptive Categories (1,096)
    • OS Independent (473)
    • All 32-bit MS Windows (326)
    • All POSIX (322)
    • 32-bit MS Windows (185)
    • OS Portable (119)
    • 32-bit MS Windows (64)
    • 64-bit MS Windows (41)
    • All BSD Platforms (39)
    • Project is OS Distribution-Specific (2)
    • Project is an Operating System Distribution (1)
    • Project is an Operating System Kernel (1)
  • Mac (955)
  • Modern (438)
    • Linux (266)
    • WinXP (171)
    • OS X (92)
    • Win2K (84)
    • Vista (60)
    • Windows 7 (43)
    • Solaris (25)
    • FreeBSD (17)
    • OpenBSD (7)
    • NetBSD (6)
  • BSD (277)
  • Other Operating Systems (89)
    • MS-DOS (19)
    • Other (15)
    • WinNT (15)
    • Microsoft Windows Server 2003 (10)
    • Win98 (10)
    • Apple Mac OS Classic (5)
    • WinME (4)
    • Console-based Platforms (3)
    • HP-UX (3)
    • IBM OS/2 (3)
    • Win95 (3)
    • IBM AIX (2)
    • AmigaOS (1)
    • BSD/OS (1)
    • BeOS (1)
Freshness
  • Recently updated (262)

Text Processing

Sort By
Most Popular
  • Most Popular
  • Last Updated
  • Name
  • Rating

Showing page 6 of 64.

  • IniTranslator Icon
    IniTranslator

    IniTranslator is a Windows tool for developers and users to simplify the translation and localization of ini style language files in a manner similar to how poEdit works. IniTranslator can also load and save other formats through its plugin interface.

    27 weekly downloads
  • Charset detector Icon
    Charset detector

    Library for automatic charset detection of a given text or file. Input buffer will be analysed to guess used encoding. The result (charset name or code page id) can be used as control parameter for charset conversation. Make your programs Unicode aware!

    26 weekly downloads
  • Early Access iText Icon
    Early Access iText

    Early Access iText, a PDF generation library in Java

    26 weekly downloads
  • LaTeX for Economists Icon
    LaTeX for Economists

    LaTeX classes and BibTeX styles for Economists

    26 weekly downloads
  • PDFizer Icon
    PDFizer

    A XHTML to PDF converter: with this library, you can transform simple XHTML pages to nice and printable PDF files. This project is based on the excellent webzine article "Pdfizer, a dumb HTML to PDF converter, in C#" written by Jonathan de Halleux.

    26 weekly downloads
  • Piccolo XML Parser for Java Icon
    Piccolo XML Parser for Java

    Piccolo is the fastest SAX parser for Java, supporting SAX1, SAX2, and JAXP (SAX only). Piccolo is different from other parsers in that it was developed using parser generators. It weighs 160K including XML APIs. See http://piccolo.sf.net for more info.

    26 weekly downloads
  • ViMate - a vi plugin for TextMate Icon
    ViMate - a vi plugin for TextMate

    Vi plugin for TextMate.

    16 weekly downloads
  • The Songs Package (for LaTeX) Icon
    The Songs Package (for LaTeX)

    Create beautiful song books for your church or fellowship using this LaTeX package and related tools.

    25 weekly downloads
  • ASCIIMathML Icon
    ASCIIMathML

    ASCIIMathML.js: a JavaScript to convert ASCII math notation (and some LaTeX) to Presentation MathML while your webpage loads. Now also simple graphs are translates to SVG. Works with Firefox 2.0+ or with Internet Explorer 6/7+MathPlayer+Adobe SVGview.

    11 weekly downloads
  • Libxml2 for pascal Icon
    Libxml2 for pascal

    Pascal units accessing the popular XML API from Daniel Veillard ( http://www.xmlsoft.org ). This should be usable at least from Kylix and Delphi, but hopefully also from other Pascal compilers (like freepascal).

    10 weekly downloads
  • XSLT syntax highlighting Icon
    XSLT syntax highlighting

    Java based XSLT Processor extension for syntax highlighting

    15 weekly downloads
  • ArabicDictionary Icon
    ArabicDictionary

    Platform independant Arabic - Enligsh Dictionary in Java, uses morphological arabic stemming techniques for searching.

    42 weekly downloads
  • Novelang Icon
    Novelang

    Novelang is a document generator based on a Wiki syntax.

    42 weekly downloads
  • STED - Transliterator Icon
    STED - Transliterator

    Transliterator between any Language files - Map Fonts, Create Encoding Scheme, Input Phonetic, Indian, Roman, Tamil, Hindi, English, French, German, Spanish or Any World Language Keyboard. Ex: [Phonetic Input]-[Any World Language Output] or ViceVersa.

    42 weekly downloads
  • FreeDOS Edlin Icon
    FreeDOS Edlin

    The FreeDOS Edlin project is the standard line editor in the FreeDOS operating system.

    9 weekly downloads
  • DocFrac Icon
    DocFrac

    DocFrac is a document converter that can convert between RTF, HTML and ASCII text. This includes RTF to HTML and HTML to RTF. Supports text formatting (e.g. bold); tables; and most European languages. Available for Windows; Linux; ActiveX and DLL.

    41 weekly downloads
  • SE|PY ASEditor Icon
    SE|PY ASEditor

    SE|PY is an ActionScript editor written in python, wxPython and using scintilla for text highlight, code collapsing. some features: snippets panel, functions panel and much more. Contain also Flush

    40 weekly downloads
  • latex-mk Icon
    latex-mk

    LaTeX-Mk is a collection of makefile fragments for managing small to large LaTeX based documentation projects. The idea is that especially large documents, there may be many many steps required to typeset the document (export modified figures to postscr

    22 weekly downloads
  • Rephrase Icon
    Rephrase

    Rephrase is a simple string replacement application. The default package comes with a wordiness rule file, a 1337 rule file, and an English to French rule file. It can be used on the command line as part of a set of other tools as well.

    39 weekly downloads
  • Snippet Manager Icon
    Snippet Manager

    This program is a snippet repository, in which the user can store snippets in various languages both locally and online. Key features include syntax highlighting, clipboard logging, tabbed editing, share & favourite snippets! http://snippets.gabehabe.com

    38 weekly downloads
  • converting XML to formatted text Icon
    converting XML to formatted text

    xml2txt is a text formatter for XMl in the same way the FO is a PDF formatter. It uses python to convert an XML document to well-formatted text, wtih borders, indents, and tables.

    38 weekly downloads
  • wordTabulator Icon
    wordTabulator

    Program wordTabulator is intended for text analysis. With help of wordTabulator you can generate index of word elements extracted from defined text set. Word elements may be words, N-grams (of defined size) or phrases (syntagmes). The program can process texts as in ordinary 1-byte encoding (ANSI), as in multibyte UTF-8 encoding. Source texts are defined as a set of flat text files or HTML/XML/SGML documents. In the last case the program can filter content from markup. Moreover, you can process only defined content within selected paired tags. Or you can skip that content from processing. As additional feature you can analyse a pair of text sets and compare them by common or different elements. Output word index may be generated in HTML format and contain frequences of each text element and links to original content. Also it may be generated as a flat text file. Words in the index are ordered by alphabet, value or frequency.

    38 weekly downloads
  • chinese Shupai Icon
    chinese Shupai

    ChineseShupai can help one typeset a Chinese paragraph in a vertical way, known as Chinese 'Shupai',which can be used in webpages, blogs and various of other environment without third party support from, say, Office Word.

    34 weekly downloads
  • Tomoe Icon
    Tomoe

    Tomoe is a handwriting character recognition engine.

    33 weekly downloads
  • WordHTML(Convert Word to HTML) Icon
    WordHTML(Convert Word to HTML)

    WordHTML CV is an open source software that mass converts MS word documents into HTML files. It makes it easy even for people with no knowledge of HTML to produce professional-quality website. Whether you need to publish a one-page press release or a long

    33 weekly downloads
  • Back
  • …
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • …
  • Next

Staff Picks

  • Icon America's Army 2.5 Assist
  • Icon BibDesk
  • Icon boot-repair-disk
  • Icon The FreeType Project
  • Icon KXStudio
  • Icon NAS4Free
  • Icon Password Safe
  • Icon RSS Owl | RSS / RDF / Atom Feed Reader
  • Icon Zentyal Linux small business server

Top Downloaded

Powered by Dice Logo Latest Tech Jobs

  • Loading... The latest tech jobs.
See All Jobs ››
SourceForge
About Site Status @sfnet_ops
Find and Develop Software
Create a Project Software Directory Top Downloaded Projects
Community
Blog @sourceforge Job Board
Help
Site Documentation Support Request Real-Time Support
Copyright © 2013 Dice. All Rights Reserved.
SourceForge is a Dice Holdings, Inc. service.
Terms Privacy Cookies/Opt Out Advertise SourceForge.JP Big Data