Showing 131 open source projects for "text processing"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 1
    The DocBook Publishing Utilities tools, which make creation and publishing of DocBook easier. The tools are: Maven plug-in to Transform HTML into XML (use after docbkx); Eclipse DocBook table editor; Eclipse wizards for initial DocBook files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2

    EncTool

    Command line tool to detect and convert files encoding.

    Command line tool to detect and convert files encoding. Works with files or directories. Can be used to add or remove UTF-8 BOM. Multi-platform. EncTool requires Java 1.5 or highter.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    FigTeX manages images and their easy inclusion in LaTeX documents. Similar to BibTex, the image information is stored in an external file and is imported into the document as needed. It comes with a comfortable GUI for managing the image library.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    majix

    RTF to XML Converter

    With MajiX you can automatically transform RTF files (Microsoft Word files) in XML. MajiX is Java compliant. You can convert Headings, lists (numbered or not), tables, bold, italics, underline and some more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5

    Large Text File converter

    Java Based Heavy-duty utilitity to process large delimited text files

    TextZilla is a Multithreaded Java utility which can process huge size delimited text files to extract, convert, encode, decode, encrypt/decrypt text data from source and write it in desired output file or files. It provides fully extensible framework based on which Java classes can be created, for example it currently has MD5 conversion capability, based on same design classes for 3DES ,AES or any other Algo can be created. Another strength of this tool is in its configurability, it's...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    Text2XML

    Generic Method for Parsing Text into XML

    Text2XML provides a simple, declarative method of parsing and processing of raw text data into XML. Text2XML uses an XML configuration file which describes the desired XML output, as well as the rules for parsing the text file. Parsing is performed using XPath 2.0 and Regular Expressions. Text2XML can be used from the command line, or it can be embedded into another Java application.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 7

    XML Text Declaration

    Support for XML text declaration handling

    The purpose of this library is to determine the text encoding of an input, and dynamically insert a filter, which allows the processing program to assume UTF-8 input, regardless of the actual input encoding.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    A collection of DITA map and topic files used for checking the performance of tools such as the DITA-OT used to convert DITA to other formats, including recommended PIs for dealing with presentation needs not covered in the DITA specification. Primary host is now github, https://github.com/jeremygriffith/DITA-Test-Suite
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    PROJECT HAS MOVED: https://github.com/wiki2beamer
    Downloads: 0 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    Kermit is a bundle for LaTeX for Russian tipography
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    ExiProcessor

    Command-line program for processing Efficient XML Interchange (EXI)

    ExiProcessor is a command-line program that encodes text XML files into binary EXI and decodes EXI files into XML. It uses the open source Java-based library EXIficient (http://exificient.sourceforge.net) as the EXI parser. In essence, ExiProcessor is a command-line interface to EXIficient. ExiProcessor can help people learn about the various EXI encoding and decoding options and how those options affect compression ratios. The source code itself can also be used as an example of how to...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    SemaRule Navigator
    SemaRule Navigator is an Integrated Suite of Open-Source and Free-License Software, placing Semantic and Text Analysis Technologies in the toolbox of Researchers, Students, and Enterprises.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Java csveditor
    A CSV editor written in Java 1.6 Developed using NetBeans IDE 6.0.1
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Java library for reading and writing of flat files. CSV, FLR (fixed length record) or mixed structures. Tree-style processing API. Adapters for SAX, Stax and XStream for transformation, data binding or serialization.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    The XSD editor is a cross-platform XML editor. Although it can be used to edit any type of XML file, the editor is specifically designed to allow easy creation, editing, and validation of XML Schema (XSD) files.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 16

    odtPHP

    Open Document Templating System for PHP

    OdtPHP is a PHP librairy designed to use OpenDocument file as a template for PHP. It's a kind of PHPLib for OpenOffice document.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    A C++ library to read and write PDF files, plus a GUI editor.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Versatile converter from Wiki-Markup-Language to LaTex.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Multiple format bibliography processor in Python.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    A stand-alone editor using Mediawiki markup language to generate HTML code. You can create and preview pages written using Mediawiki markup (i.e. Wikipedia pages) while off-line.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    GOSH takes an Usenet-styled ASCII text as input and generates Latex-code, HTML, or Man-pages. Its transformation is easily customizable, which makes GOSH suited for creating web content, papers, letters, and technical documentation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    The xslt2 script semAuth (semantic authoring) translates a freemind mindmap into an xhtml website and an RDF ontology
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    creole/c is a Wiki Creole parser and a HTML converter. It implements Wiki Creole 1.0 and almost all of its additions. The parser is written in C++ and has a simple event driven plain C API. The converter is a stand-alone console application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    nanoWIME is a simple, flexible, easy-to-use javascript based WikiMarkup editor.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    HTMLtools includes several Java HTML tools for preparing Web pages. The HTMLtools program automates batch conversion of tab-delimited spreadsheet text files to HTML Web-page files, file & table editing, keyword mapping, templates, and more.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB