Open Source Linux Text Processing Software - Page 14

Text Processing Software for Linux

View 9 business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Context for your AI agents Icon
    Context for your AI agents

    Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.

    Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
    Try for free
  • 1
    DOMIT! is a Document Object Model (DOM) XML parser for PHP, written purely in PHP. It is mostly compliant with the DOM Level 2 specification.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    DOMIT! RSS is an RSS parser for PHP, written purely in PHP. Unlike most existing PHP RSS clients, it uses a DOM XML parser -- DOMIT! -- to convert an RSS feed into a DOM document that can be traversed using the standard DOM methods.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    A WYSIWYG GUI-based HTML editor with real-time tag parsing and properties editor, as well as site management with link validation and updating. Similar to a well known editor from Macromedia.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    Detexter

    Detexter is an app designed to extract text from PDF files.

    Detexter lets you extract text from multiple PDF files. Detexter uses the PDFBox library for its text extraction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Comet Backup - Fast, Secure Backup Software for MSPs Icon
    Comet Backup - Fast, Secure Backup Software for MSPs

    Fast, Secure Backup Software for Businesses and IT Providers

    Comet is a flexible backup platform, giving you total control over your backup environment and storage destinations.
    Learn More
  • 5
    Devchekio is a GTK+ 2 FTP check in/out code editor. It was inspired by the use of DreamWeaver's system to access files like a library system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    An experimental set of tools for text analysis and dictionary construction. One goal is to improve text-input e.g. on devices with touchscreens using dictionary-based symbolic on-screen keyboards.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    Discriminative Language Editor

    Discriminative language editor based on ontologies

    Text editor in Java that is able to detect discriminative expressions while the user is typing. When the internal ontology-based analyzer detects a potential discriminative expression the user is advised by underscoring the related words in the text. A descriptive message about the issue is also shown to the user when the cursor is placed over the potential discriminative expression.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Distributed Proofreaders
    Project has moved to https://github.com/DistributedProofreaders/dproofreaders Distributed Proofreaders is a web application intended to ease the process of converting public domain books and other printed materials into e-texts. The main site is at http://www.pgdp.net By breaking the work into individual pages, many proofreaders can be working on the same book at the same time. This significantly speeds up the proofreading/E-Text creation process. When a proofer elects to proofread a page for a particular project, the text and image file are displayed on a single webpage. This allows the text file to be easily reviewed and compared to the image file, thus assisting the proofreading of the text file. The edited text is then submitted back to the site via the same webpage that it was edited on. Once all pages for a particular book have been processed, a concatenated text file is made available for final clean-up and submitted to a Project Gutenberg site.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Latex editor with document structure tree view and project handling. Latex output allows direct jump to warnings/errors. Projects folders gives support for figures/graphs. Editor component includes usual features like search/replace and syntax highlight.
    Downloads: 0 This Week
    Last Update:
    See Project
  • AestheticsPro Medical Spa Software Icon
    AestheticsPro Medical Spa Software

    Our new software release will dramatically improve your medspa business performance while enhancing the customer experience

    AestheticsPro is the most complete Aesthetics Software on the market today. HIPAA Cloud Compliant with electronic charting, integrated POS, targeted marketing and results driven reporting; AestheticsPro delivers the tools you need to manage your medical spa business. It is our mission To Provide an All-in-One Cutting Edge Software to the Aesthetics Industry.
    Learn More
  • 10
    The Doc2Html command line operating program strippes the Word produced html files (by opening the documet, saving as html) leaving pure text + minimum html code. It also has a mode to convert data berween different charsets: DOS, Windows-1250 and ISO-8859
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    The DocBook Publishing Utilities tools, which make creation and publishing of DocBook easier. The tools are: Maven plug-in to Transform HTML into XML (use after docbkx); Eclipse DocBook table editor; Eclipse wizards for initial DocBook files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    DocbBook SGML Toolbox combines the source packages for DocBook SGML 3.1, 4.1, Jade, and other sgml/docbook utilities, and automatically compiles and installs the binary Slackware,RPM,or Debian packages. Install is EASY!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    This is an Emacs-Lisp package that enables easy editting and maintenance of DocBook XML files within GNU Emacs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    XSL Transformation stylesheets. These stylesheets output LaTeX code from a DocBook (c) XML document. A recent version of LaTeX2e is required.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    DocBook XSLT stylesheets & utils to OpenDocument transformation (text/presentation/...) with support for Corporate Identity templates.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    JSESOFT-DB2PDF provides a transformator for a limited (but expanding) subset of DocBook to PDF. The transformation from DocBook is done via iText directly to PDF. Priority is given to predictability and stability rather than to completeness.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Comprehensive DocBook XML processing solution for MacOS X
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    The DocConversion project provides a distributed document conversion solution with a well defined API which makes use of existing convstion tools and/or a centralized conversion server. This is part of the PRONIR research at http://www.pronir.nl
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    DocScript is an approach to document preparation. It presents tools and utilities to edit and publish documents. The philosophy behind the DocScript project is to utilize the programming tools you're working with anyway in your daily work.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Doco is a simple but feature rich and powerful markup language for converting text documents into highly-presentable and navigable web content.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    A modular system for extracting and converting Python docstrings into useful structured formats like HTML, XML, and TeX. Project inactive. Development taken over by Docutils, http://docutils.sourceforge.net/.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Docyrus is a web application allowing to create articles from imported chapters or articles. You can store local documents (doc, wiki, HTML, RSS, openOffice) online, manage them online and export them in many formats (pdf, openOffice, HTML,...).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    DPRK pull is a script that pulls the English language North Korean news articles from the KCNA website and puts them into one file for reading by a Text to Speech program.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    Dvipdfm tool for SCons

    SCons tool to cooperate with dvipdfm program

    SCons is a make replacement providing a range of enhanced features such as automated dependency generation and built in compilation cache support. SCons rule sets are Python scripts so as well as the features it provides itself SCons allows you to use the full power of Python to control compilation. This is a SCons extension (tool) which enables usage of the dvipdfm program to convert dvi files to pdf.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    ChapMaker is a new idea for working on texts like essays, articles, documentations or books. It is a simple texteditor with connection to a central database server. Different users from different locations are able to work on their projects together.
    Downloads: 0 This Week
    Last Update:
    See Project