Showing 156 open source projects for "text processing"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 1
    A collection of DITA map and topic files used for checking the performance of tools such as the DITA-OT used to convert DITA to other formats, including recommended PIs for dealing with presentation needs not covered in the DITA specification. Primary host is now github, https://github.com/jeremygriffith/DITA-Test-Suite
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    PROJECT HAS MOVED: https://github.com/wiki2beamer
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Kermit is a bundle for LaTeX for Russian tipography
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    ExiProcessor

    Command-line program for processing Efficient XML Interchange (EXI)

    ExiProcessor is a command-line program that encodes text XML files into binary EXI and decodes EXI files into XML. It uses the open source Java-based library EXIficient (http://exificient.sourceforge.net) as the EXI parser. In essence, ExiProcessor is a command-line interface to EXIficient. ExiProcessor can help people learn about the various EXI encoding and decoding options and how those options affect compression ratios. The source code itself can also be used as an example of how to...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 5
    SemaRule Navigator
    SemaRule Navigator is an Integrated Suite of Open-Source and Free-License Software, placing Semantic and Text Analysis Technologies in the toolbox of Researchers, Students, and Enterprises.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    A set of macros for usage of Vim + Cream as a LaTeX editor
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    (Hyper)Markup can: 1) Edit HTML or text files, 2) tidy up your HTML source code and produce formatted text, 3) Flexisibly generate XML codes or XHTML files from given HTML files with XML template and XSL stylesheet.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    pdfInspect
    pdfInspect offers a flexible GUI interface for viewing the internal structure and content of a PDF file. Wraps the Apache PDFBox library; example of application built with Superficial http://superficial.sourceforge.net
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Java csveditor
    A CSV editor written in Java 1.6 Developed using NetBeans IDE 6.0.1
    Downloads: 0 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    Java library for reading and writing of flat files. CSV, FLR (fixed length record) or mixed structures. Tree-style processing API. Adapters for SAX, Stax and XStream for transformation, data binding or serialization.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    The XSD editor is a cross-platform XML editor. Although it can be used to edit any type of XML file, the editor is specifically designed to allow easy creation, editing, and validation of XML Schema (XSD) files.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 12

    odtPHP

    Open Document Templating System for PHP

    OdtPHP is a PHP librairy designed to use OpenDocument file as a template for PHP. It's a kind of PHPLib for OpenOffice document.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    A C++ library to read and write PDF files, plus a GUI editor.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Versatile converter from Wiki-Markup-Language to LaTex.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Multiple format bibliography processor in Python.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    A stand-alone editor using Mediawiki markup language to generate HTML code. You can create and preview pages written using Mediawiki markup (i.e. Wikipedia pages) while off-line.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    GOSH takes an Usenet-styled ASCII text as input and generates Latex-code, HTML, or Man-pages. Its transformation is easily customizable, which makes GOSH suited for creating web content, papers, letters, and technical documentation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    The xslt2 script semAuth (semantic authoring) translates a freemind mindmap into an xhtml website and an RDF ontology
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    creole/c is a Wiki Creole parser and a HTML converter. It implements Wiki Creole 1.0 and almost all of its additions. The parser is written in C++ and has a simple event driven plain C API. The converter is a stand-alone console application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    nanoWIME is a simple, flexible, easy-to-use javascript based WikiMarkup editor.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    HTMLtools includes several Java HTML tools for preparing Web pages. The HTMLtools program automates batch conversion of tab-delimited spreadsheet text files to HTML Web-page files, file & table editing, keyword mapping, templates, and more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    PDML is an informal markup language written in PHP that is similar to HTML. It allows for the creation of complex PDF documents and can also be used in conjunction with PHP, to define templates which can generate dynamic PDF documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Heirarchy Hacker is a command-line tool for walking through any directory tree of hierarchical human-readable data files to make large, batch changes. The system is primarily designed for TASpring FBI/TDF files, with plans for future expansion.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Shared Questionnaire System
    Shared Questionnaire System(SQS) is a full-functional Optical Mark Reader(OMR) form processing system implemented in Java-Swing, XSL-FO and AJAX with straightforward GUIs. It is aimed at developing social platform to share knowledge about questionnaire.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    DITA2wiki is a toolkit that enables you to publish DITA content (maps and topics) to a wiki.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB