Showing 735 open source projects for "text processing"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 1
    A Perl script that splits a long HTML file into separate inter-linked pages, according to the headings in the original file. Useful for maintaining both a print version and a browsable version of a site.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    ooo2sdbk, OpenOffice to Docbook converter, is a set of XSLT stylesheets for convert OpenOffice-Writer documents to the simplified Docbook. ooo2sdbk stylesheets may run with any XSLT processor (Saxon, Xsltproc, etc.) on any platforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Comprehensive DocBook XML processing solution for MacOS X
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Tigerleaf's simple XML documents build rich, manageable sites and PDF publications. Tigerleaf eases XML authoring and publishing with versioning, code generation, management and workflow features.
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    Very simple "C like" preprocessor. Extendible syntax via "in code" directive.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    XML DTD and related tools for documenting Tcl packages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    CPIA is a macro-processing engine for XML (and HTML), written in C. The engine can either be used offline as a processor, or inside a web server. Both developers have lost interest. If you are interested in maintaining it, please contact either admin.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    A modular system for extracting and converting Python docstrings into useful structured formats like HTML, XML, and TeX. Project inactive. Development taken over by Docutils, http://docutils.sourceforge.net/.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    One-Liner creates one-line paragraphs from CR/LF- or LF-delimited text. Scanners normally return delimited text; substitution of complementary characters such as `` and \'\' for \" in paragraphs becomes easier when the delimiters are removed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    GNU FriBidi is the Free Implementation of the Unicode Bidirectional Algorithm. GNU FriBidi development has been moved to GitHub. See https://github.com/fribidi/fribidi/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Provides a simple Java .jar file for converting Docbook files to HTML, FO or XHTML and includes all the XSL files needed. Great for cross platform Docbook conversions and Ant build scripts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Visual XML editor. 100% pure TCL/TK application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    EDGE (electronic document general encoding) is a SGML-based markup Language for general documents, incl. scientific papers, technical/computer documentation, prose, drama, etc. It aims to be less restricted than comparable DTDs (e.g. DocBook or TEI).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    TXE is GUI XML editor written in Java using the DOM (Document Object Model) parser provided by Oracle.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    reStructuredText defines & implement a markup syntax for use in Python docstrings and other documentation domains, that is readable & simple, yet powerful. Project inactive. Development taken over by Docutils, http://docutils.sourceforge.net/.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    PDF Library in .NET (deprecated) The developer is not continuing the project. Find the dev at http://raasiel.typepad.com
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    <i>dblup</i> is a tool to output LaTeX from DocBook using Perl.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Ents is a small Java package designed to simplify the process of converting XML entity references to character references and vice-versa. Ents uses XML to specify lists of equivalent names and character references. Ents supports single character entities
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    A small Objective-C library which provides a SAX-like object-oriented interface to the Expat XML parser library.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    This project will compile a hungarian wordlist for use with spell-checkers like aspell. Additionally it will develop generic tools useful to compile and maintain wordlists for any language.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 21
    A Java application for statistical analysis and systematic manipulation of natural language texts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Roap scans a text file, extracts regions that matches specified patterns from it, and processes them with specified executables sequentially. Each executable reads the region as the stdin, and whole their stdouts are written out as the stdout of Roap
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    MacBibTk is a Mac compatible version of Peter Corke's tkbibtex (release 9), a BibTeX file editor and browser. BibTeX is a reference/citation system for use with LaTeX. MacBibTk runs on all platforms with Tcl/Tk ports.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    J2ME Memopad is a simple MIDP application designed to allow storage and retrieval of notes. It will have the ability to search and generate a list of results, as well as categorize your memos. The basic design of the memopad is similar to the Palm.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    FileExtender is a Perl script to evaluate embedded SQL statements in any kind of text file (incl. HTML files) and extends these files with results from the database queries.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB