Showing 61 open source projects for "text processing"

View related business solutions
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1

    Text2XML

    Generic Method for Parsing Text into XML

    Text2XML provides a simple, declarative method of parsing and processing of raw text data into XML. Text2XML uses an XML configuration file which describes the desired XML output, as well as the rules for parsing the text file. Parsing is performed using XPath 2.0 and Regular Expressions. Text2XML can be used from the command line, or it can be embedded into another Java application.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 2

    ExiProcessor

    Command-line program for processing Efficient XML Interchange (EXI)

    ExiProcessor is a command-line program that encodes text XML files into binary EXI and decodes EXI files into XML. It uses the open source Java-based library EXIficient (http://exificient.sourceforge.net) as the EXI parser. In essence, ExiProcessor is a command-line interface to EXIficient. ExiProcessor can help people learn about the various EXI encoding and decoding options and how those options affect compression ratios. The source code itself can also be used as an example of how to...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    SemaRule Navigator
    SemaRule Navigator is an Integrated Suite of Open-Source and Free-License Software, placing Semantic and Text Analysis Technologies in the toolbox of Researchers, Students, and Enterprises.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Java csveditor
    A CSV editor written in Java 1.6 Developed using NetBeans IDE 6.0.1
    Downloads: 0 This Week
    Last Update:
    See Project
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 5
    Java library for reading and writing of flat files. CSV, FLR (fixed length record) or mixed structures. Tree-style processing API. Adapters for SAX, Stax and XStream for transformation, data binding or serialization.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    A stand-alone editor using Mediawiki markup language to generate HTML code. You can create and preview pages written using Mediawiki markup (i.e. Wikipedia pages) while off-line.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    HyFo provides hyphenation services in Java 5 for developers in multiple languages. It extends TeX-style hyphenation to directly support re-spelling hyphenations such as TeX's discretionary hyphens.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    HTMLtools includes several Java HTML tools for preparing Web pages. The HTMLtools program automates batch conversion of tab-delimited spreadsheet text files to HTML Web-page files, file & table editing, keyword mapping, templates, and more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Shared Questionnaire System
    Shared Questionnaire System(SQS) is a full-functional Optical Mark Reader(OMR) form processing system implemented in Java-Swing, XSL-FO and AJAX with straightforward GUIs. It is aimed at developing social platform to share knowledge about questionnaire.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    DITA2wiki is a toolkit that enables you to publish DITA content (maps and topics) to a wiki.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    This tools compare two xml files and register the diferences in xpaht format.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    A package that works with Apache Velocity and allows you to build web pages and other content from reusable Velocity template fragments, controlled by hierarchies of layout descriptions in XML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    The DITA Open Platform is a free, open-source project which goal is to provide an enterprise platform for the edition, management and processing of DITA documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    JFileReader allows users to view the contents of any file in a bytewise manner using a simple GUI interface. Functions include byte view changes to different primitive datatypes, endianess changes, and file content exports to new files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Convert plain text format RFC docs into open format, such as HTML, PDF. Features: Index page link, Document reference link, Figure/Table reference link; customizable CSS.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    twexter formats twin twext translations to help us learn language .. demo: http://test.twext.com .. javascript code is open at http://github.com/tudisco/twexter
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    JLoom is a JSP like template language for text generation - e.g. source code, HTML, XML. JLoom templates are modular encapsulated. Parameters can be any Java type, even Generics or Varargs. There is a plugin for Eclipse and a command line tool.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Some text parser, that transforms a custom TeX-like syntax into XML. Un parseur, qui produit du XML à partir d'une syntaxe ressemblant à TeX.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Html Optimizer is an optimizer for optimize html files by shrink them.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Use Xilize to create XHTML pages or entire websites with just a plain-text editor. The markup is similar to Textile and extensible via BeanShell. Run as a jEdit plugin, from the command line, or embed in a Java program. Small, fast, easy-to-use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Simple plain text layout library. Can be utilized for html-to-text (html2text) conversion with its HTML reading support.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    LiMa means Lightweight Markup Language. It is a parser for an easy to use ASCII/Text-based markup - comparable to Markdown or the Wikipedia-Markup language with special configurable extensions in defining Links and image-resources.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Markout is a pure-Java lightweight wiki markup parser based on John Gruber's Markdown.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    CPLed is an OpenSIPS tool for editing CPL scripts in a friendly and easy graphical way. It can be used as a standalone application or embedded in a web page as applet. It also provide CPL script transport functionalities via SIP and HTTP protocols.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    An extensible tool to validate Open Office Text documents for accessability standards.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB