Showing 1025 open source projects for "text processing"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1
    LaTeX Web Publisher

    LaTeX Web Publisher

    LaTeX Web Publisher is a Makefile based Web publishing system

    LaTeX Web Publisher is a Makefile based Web publishing system featuring content creation into HTML, non-split HTML, HTML Zip, PDF, DjVu, PostScript, DVI and Plain text formats. All LaTeX Web Publisher output formats are from a single LaTeX source and have indices. LaTeX Web Publisher can be used for website creation and has FTP deployment capabilities. A website created with LaTeX Web Publisher will have HTML, non-split HTML and PDF content formats. The website will have complete HTML...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3

    LaTeX Helper

    GUI to help create a LaTeX document

    This GUI lets the user navigate a wide selection of LaTeX structure and formatting tags that are used to write a document in LaTeX. It also has a raw text input box with a LaTeX preview for testing out LaTeX markup. Also, each markup has associated help text and is automatically selected to be ready to paste into the user's text editor. Also, this GUI lets the user choose which text editor and supporting programs are used to create their LaTeX document instead of forcing them into a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    ATTENTION! Morfologik is now at GitHub: https://github.com/morfologik/
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 5
    The tlve program is a command-line tool for parsing different tlv (tag-length-value) structures and for printing them in various text-based formats. tlve is developed in GNU/Linux environment and it is distributed under GPL.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Virastyar

    Virastyar

    Virastyar is an spell checker for low-resource languages

    Virastyar is a free and open-source (FOSS) spell checker. It stands upon the shoulders of many free/libre/open-source (FLOSS) libraries developed for processing low-resource languages, especially Persian and RTL languages Publications: Kashefi, O., Nasri, M., & Kanani, K. (2010). Towards Automatic Persian Spell Checking. SCICT. Kashefi, O., Sharifi, M., & Minaie, B. (2013). A novel string distance metric for ranking Persian respelling suggestions. Natural Language Engineering,...
    Downloads: 56 This Week
    Last Update:
    See Project
  • 7
    IMPORTANT NOTE: This project has moved to Github: https://github.com/pkozelka/libxml2-pas Pascal units accessing the popular XML API from Daniel Veillard ( http://www.xmlsoft.org ). This should be usable at least from Kylix and Delphi, but hopefully also from other Pascal compilers (like freepascal).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    The `fmtt` formatter

    A plain text formatter for typescripts.

    Formats plain text and improves upon the classical fmt tool. Well suited as a reformatting filter for mails, reports, README files and all kinds of structured typescripts. Features include: - Knows about section headings, numbered item lists, unordered lists, paragraph numbers. - Can automatically renumber sections, items, paragraphs. - Handles no-break space (NBSP) and soft hyphen (SHY) sensible. - Uses Unicode internally, reads and writes 8-bit oder UTF-8 encoded...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    LaTeX Track Changes

    LaTeX Track Changes

    Collaborators on a version-controlled .tex file can track changes.

    LaTeX Track Changes shows changes over time for a .tex file that has its history stored in a git or svn repository. The user can customize how to view the changes: limited to certain authors or by revision or date among other filters. An Emacs mode provides the user interface. Plug-ins for other editors (such as TeXShop) are planned.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    Velocity Editor Plugin

    Velocity Editor Plugin

    VTL (Velocity Template Language) edit support for the NetBeans IDE.

    Provides basic support to Velocity's *.vm and *.vsl files. Syntax coloring, basic error highlighting and braces matching is achieved with a lexer and parser based on Apache Velocity's 1.6.2 specification and compiled with JavaCC 5.0.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    nlconvert

    nlconvert

    Convert newlines in text files

    This package consists of six command-line utilities that let you convert the newlines of a text file from one style to another. It currently supports CR/LF (DOS/Windows), CR (Macintosh) and LF (Unix) newlines. It doesn’t try to be “intelligent”, since that will almost always fail at some point.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Redstring

    Redstring

    REDundant STRINg Generator

    Life is full of redundancy. Whether you need to write fifty lines of code that follows the same format, generate a thousand serial numbers from a pattern, or write eight hundred sentences for teaching the letter “D”, REDSTRING can help.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    MindRaider

    MindRaider is a personal notebook and outliner.

    MindRaider is a personal notebook and outliner. Where do you keep private remarks like ideas, plans, gift tips and howtos? Loads of documents and remarks spread around the file system? Can you find a remark when you need it? No? Try MindRaider!
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    JGloss

    JGloss

    Add readings and translations to Japanese text

    JGloss lets you import Japanese text documents and add reading and translation annotations for words, both automatically during import, and manually. It is written in Java.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 15
    wordaxe (formerly deco-cow): A hyphenation library for Python. Several hyphenation algorithms: - the pattern-based from TeX/OOO, - by decomposition of compound words for German language. Includes support for paragraph line-breaking with ReportLab.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    TemplateFx

    TemplateFx

    Dynamic Templating Tool

    MOVED TO GITHUB - http://templatefx.netnix.org TemplateFx is a Dynamic, Scriptable, Lightweight and Platform Independent Templating Tool which allows you to generate outputs based on a template and some source data. Its main use is in generating configurations for CLI based devices like Cisco routers and switches.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    isbntools

    A command line tool to extract, transform and get metadata for ISBNs

    As of 2015-06-02, this project is no longer under active development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    Character Encoding Translator

    Translate text from one encoding to another.

    Character Encoding Translator is an API and utility for translating text from one character encoding to another. A single JAR provides a GUI application, a console application, and an API.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    The best cyrillic text viewer and code page converter.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Sinhala Unicode Writer

    Sinhala Unicode Writer

    Sinhala Unicode Writer For Linux

    Sinhala Unicode Writer is a sinhala unicode writing tool for linux.This application will help you to type whatever you want in Sinhala just like you are typing an SMS. It is very easy and simple to use.All you have to do is type in your text with English keyboard, meantime app transliterates this text in to Sinhala for you Offline.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21

    Text Analyzer

    Text analyzing software

    An application developed in C using the list and the AVL tree data structures, which analyzes a text (.txt file) giving the following information as an output: 1. the total occurrences of every word in the text 2. the exact line of every occurrence of every word 3. the exact position in the line of every occurrence of every word 4. the exact paragraph of every occurrence of every word 5. the exact sentence of every occurrence of every word The output is also written in a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    ArabicDiacritizer

    ArabicDiacritizer

    An automatic restoration of Arabic diacritic marks

    This is a software of Arabic diacritical marks restoration. It is based mainly on deep architectures using deep neural network. The algorithm generates diacritized text with determined end case. The algorithm is described in detail in: Ilyes Rebai, and Yassine BenAyed 'Text-to-speech synthesis system with Arabic diacritic recognition system', Computer Speech & Language, 2015. We appreciate it very much if you can cite our related work. ************** Installation...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    TEA is a text editor that provides a wide range of text-processing functions (over 100) and the syntax highlighting. There are two branches of TEA: Qt-based and GTK-based.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 24

    pyWeb Literate Programming Tool

    Literate Programming in pure Python

    pyWeb is a Literate Programming tool that will work with any markup language and any programming langauge. The idea is to allow you to create great documentation with as constraints or limitations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    This project is devoted to the development of natural language processing tools and resources for the Lingala language, which is spoken by tens of millions of people in central Africa.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB