Open Source Text Processing Software - Page 2

Text Processing Software

View 92 business solutions
  • Cloud data warehouse to power your data-driven innovation Icon
    Cloud data warehouse to power your data-driven innovation

    BigQuery is a serverless and cost-effective enterprise data warehouse that works across clouds and scales with your data.

    BigQuery Studio provides a single, unified interface for all data practitioners of various coding skills to simplify analytics workflows from data ingestion and preparation to data exploration and visualization to ML model creation and use. It also allows you to use simple SQL to access Vertex AI foundational models directly inside BigQuery for text processing tasks, such as sentiment analysis, entity extraction, and many more without having to deal with specialized models.
  • Easy management of simple and complex projects Icon
    Easy management of simple and complex projects

    We help different businesses become digital, manage projects, teams, communicate effectively and control tasks online.

    Plan more projects with Worksection. Use Gantt chart and Kanban boards to organize your projects, get your team onboard and assign tasks and due dates.
  • 1
    GEDCOM Lexer Plugin for Notepad++

    GEDCOM Lexer Plugin for Notepad++

    Syntax highlighting and folding for GEDCOM

    Enhances Notepad++ text editor's handling of GEDCOM files. View GEDCOM files with syntax highlighting of: level, xref id, tag, pointer, value and escape tokens. Customize coloration and font styles. Grammar errors are also highlighted. View GEDCOM files in outline mode by folding sections based on line level.
    Leader badge
    Downloads: 167 This Week
    Last Update:
    See Project
  • 2
    ANTLR

    ANTLR

    Parser generator to read, process, or translate structured text

    ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files. It's widely used to build languages, tools, and frameworks. From a grammar, ANTLR generates a parser that can build and walk parse trees. It’s widely used in academia and industry to build all sorts of languages, tools, and frameworks. Twitter search uses ANTLR for query parsing, with over 2 billion queries a day. The languages for Hive and Pig, the data warehouse and analysis systems for Hadoop, both use ANTLR. Lex Machina uses ANTLR for information extraction from legal texts. Oracle uses ANTLR within SQL Developer IDE and their migration tools. NetBeans IDE parses C++ with ANTLR. The HQL language in the Hibernate object-relational mapping framework is built with ANTLR.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 3
    XSLT syntax highlighting

    XSLT syntax highlighting

    Java based XSLT Processor extension for syntax highlighting

    Please note that project moved to GitHub: https://github.com/xmlark/xslthl This is an implementation of syntax highlighting as an extension module for XSLT processors (Xalan, Saxon), so if you have e.g. article about programming written in DocBook, code examples can be automatically syntax highlighted during the XSLT processing phase.
    Leader badge
    Downloads: 131 This Week
    Last Update:
    See Project
  • 4
    meld-installer

    meld-installer

    Meld Installer for Windows

    Bundles Portable Python (with PyGTK) and Meld together in an easy to use installer. This allows you to not have to worry about setting up Python or PyGTK and you can keep Meld's Python separate from other Python installations on your machine. ** NOTE ** Meld 3.11 and later now have official installers, hence this project is no longer supported. You can download the new installer here: https://download.gnome.org/binaries/win32/meld/. You should uninstall the old 1.8 version before upgrading.
    Leader badge
    Downloads: 103 This Week
    Last Update:
    See Project
  • Component Content Management System for Software Documentation Icon
    Component Content Management System for Software Documentation

    Great tool for serious technical writers

    Paligo is an end-to-end Component Content Management System (CCMS) solution for technical documentation, policies and procedures, knowledge management, and more.
  • 5

    ConcatPDF

    PDF Concatenation Tool

    ConcatPDF is the tool to concatenate PDF files. It can concatenate, extract, encrypt, decrypt, configure PDF files, convert image files to PDF. GUI version and CUI version are both available. iText.NET is iText porting on .NET Framework by J#. This library allows you to generate PDF, (X)HTML, XML, RTF files on Microsoft.NET Framework including ASP.NET.
    Leader badge
    Downloads: 86 This Week
    Last Update:
    See Project
  • 6
    Find And Replace Text command line utility. New & improved version of the well-known grep command, with advanced features such as: case-adaption of the replace string; find (& replace) in filenames, auto CVS edit. Moved to https://github.com/lionello/fart-it
    Leader badge
    Downloads: 70 This Week
    Last Update:
    See Project
  • 7
    The DITA Open Toolkit is an implementation of the OASIS DITA XML Specification. The Toolkit transforms DITA content into many deliverable formats. See https://www.dita-ot.org/ for documentation and links to downloads. The source code and issue trackers have been moved to https://github.com/dita-ot/dita-ot
    Downloads: 50 This Week
    Last Update:
    See Project
  • 8
    CodeLite

    CodeLite

    CodeLite: an open source, cross platform C/C++/PHP and JavaScript IDE

    codelite is an open-source, cross platform IDE for the C/C++/PHP and JavaScript programming languages (build and tested on Windows 7, 8, ubuntu >=14.04 plus debian, fedora and openSUSE, and Mac OSX 10.8.4). codelite is distributed under the terms of the GPLv2 license If you need help with codelite, please join our forums at: http://forums.codelite.org Visit our download page at: http://codelite.org/support.php The CodeLite Team
    Downloads: 49 This Week
    Last Update:
    See Project
  • 9
    The Guide
    The Guide is a tree-based information management tool. It lets you to organize information as nodes in a tree. (A two-pane rich-text outliner for Windows.)
    Leader badge
    Downloads: 52 This Week
    Last Update:
    See Project
  • Control remote support software for remote workers and IT teams Icon
    Control remote support software for remote workers and IT teams

    Raise the bar for remote support and reduce customer downtime.

    ConnectWise ScreenConnect, formerly ConnectWise Control, is a remote support solution for Managed Service Providers (MSP), Value Added Resellers (VAR), internal IT teams, and managed security providers. Fast, reliable, secure, and simple to use, ConnectWise ScreenConnect helps businesses solve their customers' issues faster from any location. The platform features remote support, remote access, remote meeting, customization, and integrations with leading business tools.
  • 10
    FCKeditor

    FCKeditor

    FCKeditor (retired)

    FCKeditor is the previous version of CKEditor and has been discontinued after version 2. The new CKEditor is redesigned from the ground up, offering more WYSIWYG text editing features, enhanced security and better integration. Don’t force yourself with retro FCKeditor. Switch to the new, cool CKEditor at ckeditor.com
    Downloads: 38 This Week
    Last Update:
    See Project
  • 11
    Vrapper

    Vrapper

    Vim-like editing in Eclipse

    Vrapper is an eclipse plugin which acts as a wrapper for existing eclipse text editors to provide a Vim-like input scheme for moving around and editing text. Eclipse Update Site: http://vrapper.sourceforge.net/update-site/stable
    Leader badge
    Downloads: 38 This Week
    Last Update:
    See Project
  • 12
    Tinn-R

    Tinn-R

    Tinn-R Editor - GUI for R Language and Environment

    The Tinn-R is an open source (GNU General Public License) project. It is an editor/word processor ASCII/UNICODE generic for the Windows operating system, very well integrated into the R, with characteristics of Graphical User Interface (GUI) and Integrated Development Environment (IDE). Project leader and main developer: José Cláudio Faria/UESC/DCET. LANGUAGE: Object Pascal, IDE: DELPHI 2007.
    Downloads: 41 This Week
    Last Update:
    See Project
  • 13
    Pdftohtml is a tool based on the Xpdf package which translates pdf documents into html format.
    Downloads: 44 This Week
    Last Update:
    See Project
  • 14
    pgfplots - A TeX package to draw normal and/or logarithmic plots directly in TeX in two and three dimensions with a user-friendly interface and pgfplotstable - a TeX package to round and format numerical tables. Examples in manuals and/or on web site. ATTENTION: As of February 2020, the development has been moved to https://github.com/pgf-tikz/pgfplots ! Please refer to the new page for downloads and change requests! This page is kept in read-only mode.
    Leader badge
    Downloads: 38 This Week
    Last Update:
    See Project
  • 15
    HarfBuzz

    HarfBuzz

    Open source text shaping engine

    HarfBuzz is an open source text-shaping engine with a C API that turns fonts and strings of character codes into a form that is correctly arranged for the corresponding language and writing system. This is essentially the process of text shaping: translating a string of character codes into a properly arranged sequence of glyphs that can be rendered onto a screen or into final output form for inclusion in a document. This shaping depends on a number of factors: the input string, the active font, the script (or writing system) of the string, and the string's language. Various font formats have their own set of standard text-shaping rules. With Harfbuzz, you can properly shape all the major writing systems. HarfBuzz is cross-platform and supports all major software platforms and font formats.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 16
    Vim provides a rich set of tools which makes generating latex easy, pain-free and quite pleasurable. This web-site aims at bringing together the rich set of tools the vim community has produced over the years into a central repository
    Downloads: 38 This Week
    Last Update:
    See Project
  • 17
    Madedit-Mod

    Madedit-Mod

    MadEdit-Mod is a cross platform Text/Hex editor based on MadEdit

    Madedit-Mod is a cross platform text/hex editor base on MadEdit with a log of critical bug fix from me or other developers. A lot of new features were added, such as Drag-Drop Edit(cross platform), Highlight word, etc. The reason that I maintained this project is that the author of MadEdit had not worked on it for for a long time and I really like it and need more features. Find more information on Wiki pages. Currently supported Languages: English Chinese Simplified (zh_CN) Chinese Traditional(zh_TW) German(de_DE) (Not done) Greek(el) Italian(it_IT) (Not done) Japanese(ja_JP) Polish(pl_PL) (Not done) Russian(ru_RU) (Not done) Spanish(es) (Not done) You may help to add more translation for MadEdit-Mod by Poedit https://poedit.net
    Leader badge
    Downloads: 95 This Week
    Last Update:
    See Project
  • 18
    Typeit! supports five Malayalam Keyboards. They are Inscript (ISM), GIST, Malayalam Typewriter, Panchari and Varityper Phonetic keyboard Layout. Anyone who is familier with any of the above keyboard layouts can use Typeit! with ease.
    Leader badge
    Downloads: 185 This Week
    Last Update:
    See Project
  • 19
    PDFBox is a Java PDF Library. This project will allow access to all of the components in a PDF document. More PDF manipulation features will be added as the project matures. This ships with a utility to take a PDF document and output a text file.
    Downloads: 29 This Week
    Last Update:
    See Project
  • 20
    OOoFBTools

    OOoFBTools

    Open/Libre Office extension for converting eBooks in fb2 format

    Open/Libre Office extension for converting and processing eBooks in FictionBook2 format with validator. Apache OpenOffice Extensions page: http://extensions.openoffice.org/en/project/ooo-fbtools Libre Office Extensions page: http://extensions.libreoffice.org/extension-center/fbtools Внимание! Приглашаются разработчики! dikbsd последние годы тяжело тянуть даже текущее сопровождение. И есть интерес передать проект в хорошие руки, не доводя до кризиса, а с передачей опыта.
    Downloads: 28 This Week
    Last Update:
    See Project
  • 21
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 22
    LMG2Shruti is a free non unicode to unicode font converter. It converts the LMG Arun font to Gujarati unicode Shruti font.
    Leader badge
    Downloads: 108 This Week
    Last Update:
    See Project
  • 23
    TagsView user interface for ctags parsed result. Supported editors: Notepad++, AkelPad.
    Downloads: 35 This Week
    Last Update:
    See Project
  • 24
    Unicode Converter

    Unicode Converter

    All in one Text, Decimal, and Unicode Converter as you type.

    Unicode Converter is an open source software for converting Text to Unicode and decimal and vice versa while you type. No clicks or convert button! Latest source code: https://github.com/ahmed-fathy/unicode-converter Conversion Options: Text -> Unicode and Decimal Unicode -> Text and Decimal Decimal -> Text and Unicode *Smart settings: The settings take effect directly after you change it and will be saved for you and applied in next runs. Formatting options: Unicode and decimal perfix and postfix to get your desired output, such as \u0001  0001 etc...
    Leader badge
    Downloads: 99 This Week
    Last Update:
    See Project
  • 25
    Calc2LaTeX is an OpenOffice.org Calc (Spreadsheet) macro for converting tables. It makes making tables on LaTeX very easy.
    Leader badge
    Downloads: 27 This Week
    Last Update:
    See Project