Showing 161 open source projects for "text processing"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    The objective of the OpenBerg Project is to develop Open-Source, Open-Standards-based, Multi-Platform tools for eBook authors, editors and users. We are currently working on OpenBerg Lector, an e-Book reader, and OpenBerg Rector, an e-Book compiler.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    A basic html editor that I have been developing. In first stages of development. I hope to make this a very easy-to-use text/html editor and I have begun adding insert functions for common tags. I plan to add insert function for common Html Tags.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    Collection of tools for input, reading, processing, and typesetting Taiwanese language. Includes SCIM and quail input methods, Firefox dictionary plugin, plus scripts for LaTeX and HTML generation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    LiMa means Lightweight Markup Language. It is a parser for an easy to use ASCII/Text-based markup - comparable to Markdown or the Wikipedia-Markup language with special configurable extensions in defining Links and image-resources.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    A lightweight toolkit for efficient processing of XML data. The tools are analagous to the UNIX command-line text processing tools sort, grep, etc. The infrastructure includes an efficient DFA-based engine for streaming evaluation of XPath expressions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    The Newswriter is a simple script to publish and administrate your News. There is no need for a database. The News a stored in a simple file. I've coded this script because the others I found were to complicated and not adaptable enough.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Markout is a pure-Java lightweight wiki markup parser based on John Gruber's Markdown.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    CPLed is an OpenSIPS tool for editing CPL scripts in a friendly and easy graphical way. It can be used as a standalone application or embedded in a web page as applet. It also provide CPL script transport functionalities via SIP and HTTP protocols.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    SilverCity is a lexing package, based on Scintilla, that can provide lexical analysis for over 20 programming and markup langauges. Scripting language bindings currently exist for Python.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    PDFBox is a Java PDF Library. This project will allow access to all of the components in a PDF document. More PDF manipulation features will be added as the project matures. This ships with a utility to take a PDF document and output a text file.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    This tool converts html to mediawiki markup language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    PWEditor is a professional web development tool, enabling users to efficiently design, develop and maintain websites both online and offline. It includes WYSIWYG html editor, CSS editor, JS editor and text editor. It works with Firefox, Mozilla and IE.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    A Python-based template and view-controller framework derived from HTML::Mason. Supports the full featureset of Mason, allowing component-based web development with Python-embedded HTML, and includes many new concepts and features not found in Mason.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    The Collective Idenitity Project's goal is to build a p2p infrastructure aimed at fostering true community thinking. Largely text based, and peer reviewed, this software is an experiment to find a social convergence on a wide range of topics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    (XSLT transformer/editor) A text editor that allows the loading and editing of an XML document and an XSLT document at the same time. It also can apply the XSLT to the XML and display the output for further editing/saving. Plugable XML and XSLT parsers
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Early Access iText, a PDF generation library in Java
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    This is simple and tiny template framework module. It processing is speedy. And provides extract variables, dictionary reference and sequencial variable loop. Import a tinpy module and call the build function, so it became generate document with templat
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    (Almost) all a scholar in the Humanities needs (polytonic Greek fonts, stylistic and metrical analysis tools, search engines on TLG and PHI) concentrated in only one Linux Live CD, ready to use everywhere at home or at University, without installation
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Prototype for a framework and user interface for combining various structured search and document clustering techniques.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Estraier is a personal full-text search system for web sites, local file systems, mail boxes, and so on. Estraier has flexible interface and it can handle multilingual documents and various file formats with external plug-ins.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    csvplus is a collection of Perl-based utilities for querying csv files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    DOMIT! is a Document Object Model (DOM) XML parser for PHP, written purely in PHP. It is mostly compliant with the DOM Level 2 specification.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    SAXY is a Simple API for XML (SAX) parser for PHP, written purely in PHP. It is lightweight, fast, and modelled on the Expat parser.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    A language-neutral syntax highlighting module in Perl. Can be based off syntax files for popular text/code editors or custom parsing modules for a specific language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Lightweight system for running a weblog. Features multiple authors, topics, Trackback, RSS (amongst others). TruBlog comes with easy installation and strong caching mechanisms, it's localisable and produces a valid XHTML. Theming is done through CSS.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB