Showing 77 open source projects for "batch text processing"

View related business solutions
  • 99.99% Uptime for Your Most Critical Databases Icon
    99.99% Uptime for Your Most Critical Databases

    Sub-second maintenance. 2x read/write performance. Built-in vector search for AI apps.

    Cloud SQL Enterprise Plus delivers near-zero downtime with 35 days of point-in-time recovery. Supports MySQL, PostgreSQL, and SQL Server.
    Try Free
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    natural

    natural

    General natural language facilities for node

    "Natural" is a general natural language facility for nodejs. It offers a broad range of functionalities for natural language processing. Tokenizing, stemming, classification, phonetics, tf-idf, WordNet, string similarity, and some inflections are currently supported. It’s still in the early stages, so we’re very interested in bug reports, contributions and the like. Note that many algorithms from Rob Ellis’s node-nltools are being merged into this project and will be maintained from here...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    PHPWord

    PHPWord

    PHP library for reading and writing word processing documents

    PHPWord is a library written in pure PHP that provides a set of classes to write to and read from different document file formats. The current version of PHPWord supports Microsoft Office Open XML (OOXML or OpenXML), OASIS Open Document Format for Office Applications (OpenDocument or ODF), and Rich Text Format (RTF). PHPWord is an open source project licensed under the terms of LGPL version 3. PHPWord is aimed to be a high quality software product by incorporating continuous integration and...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 3

    Bulgarian language support

    Spell check, grammar check and hyphenation for Bulgarian language

    The goal of this project is to provide spell check, grammar check and hyphenation for Bulgarian language for Open Source products such as OpenOffice.org, LibreOffice, TeX, aspell, ispell, hunspell etc.
    Leader badge
    Downloads: 42 This Week
    Last Update:
    See Project
  • 4
    Microsoft Works format import library
    libwps is a Microsoft Works file format import filter based on top of the librevenge (see https://sourceforge.net/p/libwpd/wiki/librevenge/ ). Currently, libwps can import all word processing Works formats since about 1995 with some success. It may also be able to import some basic database and spreadsheet files.
    Leader badge
    Downloads: 352 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 5
    ADAPRO

    ADAPRO

    Word processor oriented for learning difficulties.

    ADAPRO is a free-to-use word processor geared towards individuals with a learning difficulty like dyslexia or a developmental disorder such as autism. Its adapted, seamless and configurable interface provides a simplified environment that can be relied, fostering the user's sustained attention. It can be downloaded and used completely free of charge for any purpose. Supports English, Spanish and Portuguese. If Java 6 or higher is already present on the computer, it does not even require...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 6
    SimplyHTML is an application and a java component for rich text processing. It stores documents as HTML files in combination with Cascading Style Sheets (CSS). SimplyHTML is not intended to be used as an editor for web pages.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 7
    pgfplots - A TeX package to draw normal and/or logarithmic plots directly in TeX in two and three dimensions with a user-friendly interface and pgfplotstable - a TeX package to round and format numerical tables. Examples in manuals and/or on web site. ATTENTION: As of February 2020, the development has been moved to https://github.com/pgf-tikz/pgfplots ! Please refer to the new page for downloads and change requests! This page is kept in read-only mode.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 8

    QVASIMODO's String Tools

    This project is a quick way of applying macros to a portion of text.

    This project is basically a quick way of applying macros, i.e., sequences of editing commands, to a portion of text. You create a "ruleset", which is basically a sequence of "rules" to be sequentially applied to the source text. The original text is entered in the top left box, while the result appears in the top right box. Program configuration is stored in a sqlite file in a '.panglossa' directory in the user's home folder. Remember that the "rules", or commands, will be...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    PDF-Shuffler
    PDF-Shuffler is a small python-gtk application, which helps the user to merge or split pdf documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface. It is a frontend for python-pyPdf.
    Downloads: 47 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 10
    TeXML is an XML vocabulary for TeX. The processor transforms the TeXML markup into the TeX markup, escaping special and out-of-encoding characters. The intended audience is developers who automatically generate [La]TeX or ConTeXt files.
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    The Text Trix editor is an open-source, cross-platform text editor with a mind for user-friendly file and text navigation. See our new GitHub home for the latest and greatest Text Trix development: https://github.com/the4thchild/texttrix
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    JCLTP

    A Java Class Library for Text Processing

    JCLTP is a class library designed for processing text. JCLTP is free, open source and developed with the Java programming language. JCLTP is distributed under the GNU license. It incorporates several technologies that enable process information while applying AI techniques, in order to build predictive models for text classification. Through a flexible structure of interfaces and classes, the opportunity to extend, adapt and add functionality JCLTP is provided. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    RTE

    Rapid Text Edior

    RTE is a rapid text editor with column selection and a numeric sort ascending and descending (integer or double)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    TreeWriter

    TreeWriter

    Unix console full screen word processor

    Tree Writer easy unix console word processor full screen editor that edits raw ascii text files. It uses a text based interface that works well over terminal lines, designed for a 80x25 mono display. Includes spelling checker (english dictionary), on screen calculator and a file system that stores the documents apart from regular unix files. In that treewriter looks only for it's own kind of file and doesn't expose the user to whatever else is in it's path.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Ansj Chinese word segmentation

    Ansj Chinese word segmentation

    Ansj word segmentation

    The real java implementation of ict. The word segmentation effect is faster than the open source version of ict. Chinese word segmentation, name recognition, part-of-speech tagging, user-defined dictionary. This is a java implementation of Chinese word segmentation based on n-Gram+CRF+HMM. The word segmentation speed reaches about 2 million words per second (tested under mac air), and the accuracy rate can reach more than 96%. At present, it has realized the functions of Chinese word...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Regular Expression Editor (RegExpEditor)

    Regular Expression Editor (RegExpEditor)

    regex as a tool, not as a problem

    Regular Expressions (aka regex, regexp) made easy. This simple tool manipulates text with regular expressions. Highlighting of regular expression results. See the real power of regex! Use Scala to do manipulate your search results even more.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 17
    Virastyar

    Virastyar

    Virastyar is an spell checker for low-resource languages

    Virastyar is a free and open-source (FOSS) spell checker. It stands upon the shoulders of many free/libre/open-source (FLOSS) libraries developed for processing low-resource languages, especially Persian and RTL languages Publications: Kashefi, O., Nasri, M., & Kanani, K. (2010). Towards Automatic Persian Spell Checking. SCICT. Kashefi, O., Sharifi, M., & Minaie, B. (2013). A novel string distance metric for ranking Persian respelling suggestions. Natural Language Engineering,...
    Leader badge
    Downloads: 402 This Week
    Last Update:
    See Project
  • 18
    docx2txt

    docx2txt

    Perl based utility to extract formatted text content from MS Docx file

    Docx2txt is a Perl based command-line utility to convert (even corrupted) Microsoft docx documents to reasonably formatted text files, along with appropriate character conversions. Apart from Perl it also requires a command line unzipping program like unzip/7z/pkzipc/wzunzip.
    Leader badge
    Downloads: 75 This Week
    Last Update:
    See Project
  • 19
    Transliterator between any Language files - Map Fonts, Create Encoding Scheme, Input Phonetic, Indian, Roman, Tamil, Hindi, English, French, German, Spanish or Any World Language Keyboard. Ex: [Phonetic Input]-[Any World Language Output] or ViceVersa.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    This project offers galician dictionaries for several spell checkers: "Ispell", "Myspell" , "Aspell", "Spell Checker for Edit Boxes" and "Excalibur".
    Downloads: 5 This Week
    Last Update:
    See Project
  • 21
    OmniHelp is a cross-platform, browser-independent, tri-pane help viewer built in pure JavaScript and CSS with HTML 4. Some functions (such as help embedding) may in the future be in Java, C, or C++; CSH is fully supported. All code is under the LGPL.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    JSpell is a Java API for providing text-checking functionality (spell checkers, grammar checkers, style checkers etc.) to Java programs. It is designed to support all text-checking engines via a plugin architecture, and to be as easy to use as possib
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Vondel
    Vondel is a program that will assist you in writing novels.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Track changes in LaTeX documents. The goal is to provide editing facilities as known from word processors like Microsoft Word or OpenOffice Writer for LaTeX. The project comprises a LaTeX package and additional software to accept/reject changes etc.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 25
    QCreativeWriter
    A software for creative writers with some new aspects. Scene/Strand based, interactive story-devolepment in an intuitive way. Database oriented on creative writing technics. Full editor integration. Statistics. -> And a beta-reader client!
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next
MongoDB Logo MongoDB