Showing 1025 open source projects for "text processing"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 1

    xmlj

    XMLJ is a Java XML Editor and validator project.

    XMLJ is a Java XML Editor and validator project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Unihanconver

    Unihanconver

    Traditional/Simplified Chinese conversion with CLI or GUI

    Tool to convert between Traditional/Simplified Chinese directly in Unicode (not GB/Big5 conversion). It is written in Perl and does not use any external libraries. It provides a command-line utility as well as a GTK+ interface for X Window.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Pen Possible

    Pen Possible

    scans a given textual string in 146 pen on paper possible combinations

    Application scans a given textual string in 146 pen on paper possible combinations- horizontal, vertical, diagonal, reverse, join top, join bottom, groups(2/3/4..), edges & in quadrant dimensions of your choice
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    FOray

    Modular XSL-FO Implementation for Java.

    FOray is an open-source XSL-FO publishing system that is suitable for converting XML content into PDF and other document formats. Although not yet fully conformant with the XSL-FO standard, it is very useful for many applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • 5
    SubLin

    SubLin

    Software tool to subtract lines of any text file from another.

    SubLin is a simple easy-to-use cross-platform application to subtract lines of any text file from another. It is also possible to keep or ignore case sensitive. Works also from command-line interface, e.g. "java -jar sublin.jar -s file1.txt file2.txt >new_file1.txt", to create output file "new_file1.txt", or "java -jar sublin.jar -s file1.txt file2.txt >>new_file1.txt", to create or append to output file "new_file1.txt". SubLin is portable, does not need installation and is developed in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    DupRem

    DupRem

    Simple application to remove duplicate and empty lines on text files.

    DupRem is a simple easy-to-use cross-platform application to remove duplicate and empty lines from any text file. It is also possible to keep or ignore case sensitive. Works also from command-line interface, e.g. "java -jar duprem.jar -r input_file.txt >output_file.txt", to create output file, or "java -jar duprem.jar -r input_file.txt >>output_file.txt", to create or append to output file. DupRem is portable, does not need installation and is developed in Java, so needs the Java Virtual...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Diff-ext is an extension for filemanagers such as Windows Explorer and Nautilus that allows to launch diff/merge tools on selected files.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    RefDB is a reference database and bibliography tool for SGML, XML, and LaTeX documents, sort of a Reference Manager or BibTeX for markup languages. It is portable and known to run on Linux, Free/NetBSD, OSX, Solaris, and Windows/Cygwin.
    Downloads: 2 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    ChordSmith

    ChordSmith

    Chordpro editor that can display, transpose and print song sheets.

    ChordSmith is a chordpro editor that can display, transpose and print song sheets containing chords and lyrics. It can also edit and convert song sheet formats (including Harmonica tabs) between chordpro format (chords in square brackets in line with lyrics) and two-line format (chords above lyrics). You can find many free sources of song sheets in both formats on the Internet. More information at https://chordsmith.sourceforge.io/ Here are just a few of ChordSmith's many features:
    Downloads: 44 This Week
    Last Update:
    See Project
  • 11
    PCSecrets

    PCSecrets

    Encrypt and manage secret text data

    PCSecrets is a PC application that holds secret text data - protected by a master password and strong encryption. Use it as a password manager or just somewhere to hold any text data securely in one place. It can hold a second, hidden set of secrets that is undetectable and plausibly deniable. The program is also a PC counterpart of the Secrets for Android app. It uses the same data structure and provides synchronization that allows easy transfer of secrets between the two. For those who...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    SciTECO

    SciTECO

    Advanced TECO dialect and interactive screen editor based on Scintilla

    SciTECO is an interactive TECO dialect, similar to Video TECO. It also adds features from classic TECO-11, as well as unique new ideas. Project development takes place here: https://git.fmsbw.de/sciteco The download archive is mirrored at Sourceforge, but for nightly builds check out: https://sciteco.fmsbw.de/downloads/nightly/
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    DOCX Document Converter

    DOCX Document Converter

    Convert .docx to .md/.txt and .html. Free, unlimited, fast.

    A simple, free, unlimited, secure web-based tool that converts Microsoft Word documents (.docx) into Markdown (.md/.txt) and HTML files. Perfect for developers, writers, and anyone who needs to transform .docx MS Office Word documents into web-friendly or AI context friendly formats. Unlike those other jerks on the web that charge many dollars per month for this, I made it free, unlimited and open source. This is a better version of 'convert docx to txt' since .md files can be opened...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 14
    Chord5

    Chord5

    A version of CHORD4 updated to cooperate with ChordSmith.

    CHORD5 is a ChordPro editor and renderer, useful for formatting and printing song sheets ("lead sheets"). This version of the CHORD program (based on CHORD4) has been modified to add functionality that enables it to cooperate with the ChordSmith program (available at https://sourceforge.net/projects/chordsmith/). This modified version has been renamed to CHORD5. Although it is revised to work with ChordSmith, it also works well as a standalone program. NOTE: If you have...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    gema is a general purpose text processing utility based on the concept of pattern matching. It reads an input file and copies it to an output file transforming the data as specified by the patterns defined by the user. See the "Wiki" tab for further information.
    Leader badge
    Downloads: 7 This Week
    Last Update:
    See Project
  • 16
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    gleditor

    gleditor

    A small programmer's editor.

    A small programmer's editor whith syntax highlight, extended search features, code completion (ctrl+space). Supported languages: htlm, sql, pascal, c/c++, c#, java, basic, javascript, css, php, python.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    Change File Encoding

    Change encoding of text files.

    Change File Encoding is a utility that allows you to change the encoding of text files. For example, files saved in US-ASCII can be converted to UTF-8. Over 170 encodings are supported. Requires Java 1.8 or higher.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 19
    The LaTeX Equation Compiler is a preprocessor for LaTeX files that enables numeric and symbolic calculations. It reads equations specified by special keywords in the LaTeX file, manipulates them and calculates numeric values of variables. If your OS can install .deb packages, have a look at my PPA http://launchpad.net/~jrheinlaender/+archive/ppa
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    oStorybook

    oStorybook

    oStorybook original, the writer's tool

    oStorybook un outil pour les écrivains. ATTENTION : voir sur http://ostorybook.eu --en_EN oStorybook a tool for writers. WARNING : see on http://ostorybook.eu.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Leader badge
    Downloads: 21 This Week
    Last Update:
    See Project
  • 22
    Ink

    Ink

    A fast and flexible Markdown parser written in Swift.

    Ink is a fast and flexible Markdown parsing library written in Swift that is designed to convert Markdown content into structured representations suitable for rendering or further processing. Originally developed as part of the Publish ecosystem, it focuses on performance and extensibility while maintaining a relatively simple and understandable internal architecture. The parser operates by reading Markdown input through a structured reader system that processes fragments and determines how...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Madedit-Mod

    Madedit-Mod

    MadEdit-Mod is a cross platform Text/Hex editor based on MadEdit

    Madedit-Mod is a cross platform text/hex editor base on MadEdit with a log of critical bug fix from me or other developers. A lot of new features were added, such as Drag-Drop Edit(cross platform), Highlight word, etc. The reason that I maintained this project is that the author of MadEdit had not worked on it for for a long time and I really like it and need more features. Find more information on Wiki pages. Currently supported Languages: English Chinese Simplified...
    Leader badge
    Downloads: 37 This Week
    Last Update:
    See Project
  • 24
    References bibliographic software
    Bibliographic software supporting LaTeX/BibTeX, LibreOffice, OpenOffice and other word processors
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB