Showing 366 open source projects for "text processing"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    JPDF Tools
    JPDF Tools is a GUI java program built on the JPDF Export library. Its main aim is to create pdf files by inserting texts, images or tables. Users can also merge PDF files, split PDF files, merge images into PDF files and soon convert from and to PDF files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    XSL Transformation (XSLT) stylesheets. They are used to convert documents from Adobe's XML Data Package (XDP) XML vocabulary into documents in the W3C's XSL Formatting Objects (XSL-FO) vocabulary.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    EncTool

    Command line tool to detect and convert files encoding.

    Command line tool to detect and convert files encoding. Works with files or directories. Can be used to add or remove UTF-8 BOM. Multi-platform. EncTool requires Java 1.5 or highter.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    Dvipdfm tool for SCons

    SCons tool to cooperate with dvipdfm program

    SCons is a make replacement providing a range of enhanced features such as automated dependency generation and built in compilation cache support. SCons rule sets are Python scripts so as well as the features it provides itself SCons allows you to use the full power of Python to control compilation. This is a SCons extension (tool) which enables usage of the dvipdfm program to convert dvi files to pdf.
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    DPRK pull is a script that pulls the English language North Korean news articles from the KCNA website and puts them into one file for reading by a Text to Speech program.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    proJMS

    proJMS

    Messaging Library for the Processing environment

    proJMS is a library for Processing that allows the easy usage of JMS. It allows to publish and consume JMS messages and to create peer to peer communication between processing applications in a network. Refer to the wiki for installation and usage instructions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Transliterator between any Language files - Map Fonts, Create Encoding Scheme, Input Phonetic, Indian, Roman, Tamil, Hindi, English, French, German, Spanish or Any World Language Keyboard. Ex: [Phonetic Input]-[Any World Language Output] or ViceVersa.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Almost Redundant Code Regenerator

    Almost Redundant Code Regenerator

    Provides incremental changes to code that needs to be almost redundant

    This is a program designed for programmers who may have to write code that is almost identical except for a few values that change along the way. Enjoy :)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    OmegaT+ CAT Tools
    A translation tools suite for Computer-Aided Translation / Computer-Assisted Translation (CAT). A translation processor with translation memory, machine translation and project support, bitext aligner/converter, TMX validator, and others.
    Leader badge
    Downloads: 4 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10

    Trim Lines

    Trim Lines removes trailing whitespace from source code files

    This is simple command line tool to batch process source code files to remove trailing whitespaces and convert all line endings to your system native style. Usage example: trimlines d:\Projects\SomeProject\src *.c;*.cpp;*.h;*.hpp;*.inc .svn;.git With this command all files in "d:\Projects\SomeProject\src" including sub-folders that match search masks "*.c;*.cpp;*.h;*.hpp;*.inc" and excluding specified folders ".svn;.git" will be processed. Can process unicode and system native...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    strtools

    A command line utility for common string functions

    What to quickly know the length of a string? Convert to upper or lower case? We introduce... strtools. The command line utility that should be included in all operating systems. A useful wrapper for programming, debugging and scripting.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    XML Text Declaration

    Support for XML text declaration handling

    The purpose of this library is to determine the text encoding of an input, and dynamically insert a filter, which allows the processing program to assume UTF-8 input, regardless of the actual input encoding.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    A collection of DITA map and topic files used for checking the performance of tools such as the DITA-OT used to convert DITA to other formats, including recommended PIs for dealing with presentation needs not covered in the DITA specification. Primary host is now github, https://github.com/jeremygriffith/DITA-Test-Suite
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    OmniHelp is a cross-platform, browser-independent, tri-pane help viewer built in pure JavaScript and CSS with HTML 4. Some functions (such as help embedding) may in the future be in Java, C, or C++; CSH is fully supported. All code is under the LGPL.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Ub3rMath

    Ub3rMath

    Simple math parsing library for C++

    A math parsing library for C++ with a number of powerful features to allow flexible interpretation of mathematical formula in text form.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    A general purpose framework for quickly build a working website in a typical PHP/MySQL environment. It is targeted at developers, experienced web administrators and anyone unafraid of hacking and using a text editor.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Winefish LaTeX Editor for Experienced Users
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    XML Editor for www.xical.org
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Tools to update all pages of a web site, at a command. Header and menu may be copied on each page. It is possible also to add a header to all source files of a project (a licence for example).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Task for ANT to produce documentation with (PDF)LaTeX using BibTeX, Makeindex and GlossTeX.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    OmegaT+ Computer Assisted Translation (CAT) tools platform that includes OmegaT+ (translation processor), bitext2tmx (aligner/TMX editor), and Validator (TMX validation).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Downloadable and open source Chinese-Spanish vocabulary inspired by the CEDICT and EDICT dictionaries. It is distributed in a plain Unicode text file that can be easily ported to other formats or used by different applications.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 23
    This is a Java-based project for complex event extraction from text and co-reference resolution. Currently the code can read BioNLP shared task format (http://2011.bionlp-st.org/) and i2b2 Natural Language Processing for Clinical Data shared task format (https://www.i2b2.org/NLP/DataSets/Main.php). Event extraction includes finding events and the parameters for an event in a text. The method is based on SVM but other ML algorithms can be adopted.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    regexxer
    regexxer is a nifty GUI search/replace tool featuring Perl-style regular expressions. If you need project-wide substitution and you're tired of hacking sed command lines together, then you should definitely give regexxer a try.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    TagParser is a java parser based on CSS formulas (like JQuery) and can parse any documents based on tags such as XML, HTML. Furthermore, it doesn't require documents to be well formed and can parse complex documents with embedded scripts or CSS parts
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB