Open Source Text Processing Software - Page 4

Text Processing Software

View 91 business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 1
    DrPython is a highly customizable cross-platform ide to aid programming in Python. It was developed with teaching in mind, and has a clean, simple interface. It is written in Python, using wxPython as the gui.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 2
    chmProcessor: Word/HTML to CHM converter

    chmProcessor: Word/HTML to CHM converter

    MS Word / HTML to CHM / Web Help converter

    A tool to generate compiled help files (CHM) and Java Help files from MS Word or HTML files. It splits the document on different topics pages by the "titles" sections. It can too generate a web site, a PDF and a XPS with the help content.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 3
    cEdit is an advanced and free alternative to both common text editors, and IDE's. It has many of the features found in shareware editors, including extensive language support, function listing, built in FTP, projects, and docking support.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 4
    Perpetual Notes

    Perpetual Notes

    Write beautifully. Organize easily. Find everything.

    Take notes faster. Find information easily. Save notes in RTF with rich text formatting and images, meeting notes, web pages, projects, travel plan, research drafts - with Perpetual Notes as your note taking app, have fun with note taking again. Runs on Windows 7/8/10.
    Downloads: 8 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 5
    ScreenTranslate

    ScreenTranslate

    Translate any text on your Mac screen — capture or select,instantly.

    ScreenTranslate lets you translate any text on your Mac screen without switching tabs or copy-pasting. Screen Capture Translation: Press Cmd+Shift+T, drag over any text on screen, and get an instant translation popup. Works with images, PDFs, and subtitles using OCR (Apple Vision). Text Selection Translation: Select text in any app and press Cmd+Option+Z to translate directly. No OCR needed. - Free and open-source (GPL-3.0) - On-device translation using Apple Translation - Works offline with downloaded language packs - 20 languages with auto-detect - Optional cloud engines (DeepL, Google, Azure) with your own API key - Auto-copy to clipboard - Translation history with search - Lightweight menu bar app - Apple Silicon and Intel Mac supported - macOS 15 (Sequoia) or later required
    Downloads: 22 This Week
    Last Update:
    See Project
  • 6
    Tarjamento de Dados Pessoais e Sigilosos

    Tarjamento de Dados Pessoais e Sigilosos

    Ferramenta de Tarjamento de Dados Pessoais e Sigilosos

    TarjaPDF v2.0 Beta — Ferramenta de Tarjamento de Dados Pessoais e Sigilosos Proteja dados sensíveis em PDFs com segurança irreversível. Interface moderna com dark mode, marcação manual (texto, linha e área livre), detecção automática de CPF, RG, e-mail, telefone, nomes próprios e endereços. Escaneamento inteligente com análise preditiva: destaca dados pessoais para revisão antes de tarjar. Detecção de nomes via heurística e base oficial, com dicionário customizável. Relatório de conformidade LGPD após cada operação. Security by Design: salva exclusivamente como PDF-imagem, impossibilitando recuperação dos dados tarjados. Novo na versão 2.0: marcação por área, buscar e tarjar, scan preditivo com clique para tarjar blocos, menu de contexto e gerenciador de nomes. ⚠️ Alguns antivírus podem gerar falso positivo devido ao empacotamento com PyInstaller. O software é seguro. Ao instalar, adicione exceção no antivírus ou desabilite temporariamente a proteção em tempo real.
    Downloads: 20 This Week
    Last Update:
    See Project
  • 7
    Leader badge
    Downloads: 20 This Week
    Last Update:
    See Project
  • 8
    GATE
    NOTE THAT THE SOURCE CODE AND ISSUE TRACKER HAVE NOW MOVED TO GITHUB. FIND US AT https://github.com/GateNLP/ GATE (General Architecture for Text Engineering) is an architecture, framework and development environment for developing, evaluating and embedding Human Language Technology. See http://gate.ac.uk for full details.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    PDF Clown

    PDF Clown

    General-Purpose PDF Library for Java and .NET

    PDF Clown is a general-purpose Java and .NET library for manipulating PDF files through multiple abstraction layers, rigorously adhering to PDF 1.7 specification (ISO 32000-1). This project aims to provide a universal access to PDF files (creation, reading, editing, rendering...) through an accurate and elegant object-oriented API. * Features: http://pdfclown.org/overview/features/ * Overview: http://pdfclown.org/overview/architecture/ * Website: http://pdfclown.org/ * Blog: http://www.pdfclown.org/blog/ * Twitter: https://twitter.com/PDFClown
    Downloads: 4 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    Flat file extractor can be used for reading and parsing different flat file structures and printing them in different formats. ffe is a command line tool developed in GNU/Linux environment and it is distributed under GPL. Project moved to https://github.com/igitur/ffe
    Downloads: 10 This Week
    Last Update:
    See Project
  • 11
    The Punjabi Computing Resource Centre holds resources (specifically articles, programs and fonts) to support the use of the Punjabi language using Unicode Gurmukhi. It also hosts a forum for language debate and technical support.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 12
    The XSD editor is a cross-platform XML editor. Although it can be used to edit any type of XML file, the editor is specifically designed to allow easy creation, editing, and validation of XML Schema (XSD) files.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 13
    RText is a customizable programmer's text editor written in Java. Some of its features include: syntax highlighting, editing multiple documents at once, printing and print preview, find/replace/find in files dialogs, undo/redo, and online help.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    PDFBox is a Java PDF Library. This project will allow access to all of the components in a PDF document. More PDF manipulation features will be added as the project matures. This ships with a utility to take a PDF document and output a text file.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    Library for automatic charset detection of a given text or file. Input buffer will be analysed to guess used encoding. The result (charset name or code page id) can be used as control parameter for charset conversation. Make your programs Unicode aware!
    Leader badge
    Downloads: 8 This Week
    Last Update:
    See Project
  • 16
    Create beautiful song books for your church or fellowship using this LaTeX package and related tools.
    Leader badge
    Downloads: 5 This Week
    Last Update:
    See Project
  • 17
    Diff-ext is an extension for filemanagers such as Windows Explorer and Nautilus that allows to launch diff/merge tools on selected files.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18

    Portable Lyx 2.0

    A portable packaging of LyX 2.0

    A portable packaging of LyX 2.0
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    Chuletas / Cribr

    Chuletas / Cribr

    A powerful text processor to make cheat sheets easily

    A powerful text processor to make cheat sheets in an easy and suitable way. Text compressor, interface based in Office 2010, equation editor, real time previw & spell checker.... are some of its features. In addition, it can also works as a normal text editor, putting at your fingertips all the power of a great text editor comparable to Microsoft Word or LibreOffice, but with a much lighter installation and free of charge. In spanish speaking countries is known as Chuletas, the spanish translation of cheat sheet. In the rest of the world it's known as Cribr (from crib-sheet).
    Downloads: 4 This Week
    Last Update:
    See Project
  • 20
    Track changes in LaTeX documents. The goal is to provide editing facilities as known from word processors like Microsoft Word or OpenOffice Writer for LaTeX. The project comprises a LaTeX package and additional software to accept/reject changes etc.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 21
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    Lout is a batch document formatter. It reads a high-level description of a document similar in style to LaTeX and produces a PostScript file which can be printed on most laser printers. Plain text and PDF output are also available.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 23
    gema is a general purpose text processing utility based on the concept of pattern matching. It reads an input file and copies it to an output file transforming the data as specified by the patterns defined by the user. See the "Wiki" tab for further information.
    Leader badge
    Downloads: 4 This Week
    Last Update:
    See Project
  • 24
    Crypto Notepad Plus

    Crypto Notepad Plus

    Advanced notepad with functions of protection

    Crypto Notepad Plus is a free text editor with advanced encryption features. Also you can generate code snippets in various programming languages. Its use is governed by the GPL (GNU General Public License). Tested on Windows operating systems, from version 7 to 11.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 25

    Ghawwas_V4

    An open source system for Arabic corpora processing

    Ghawwas (previously known as Khawas) is an open source system for Arabic corpora processing. Ghawwas V4.0 provides the following main functions: a. Frequency list for single word and N-Grams b. Concordance c. Collocation (MI, CHI Squared, LL, T-Score, Z Score, Dice, Log Dice, Weirdness Coefficient) d. Lexical patterns search e. Two corpora frequency profile comparison based on MI, CHI, LL, T-Score, Z Score, Dice, Log Dice, Weirdness Coefficient f. Accept Windows and UTF-8 character encoding g. Accept TXT, DOC, DOCX, RTF and HTML formats h. Export the processing results in CSV file format
    Downloads: 6 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB