Showing 120 open source projects for "language processing"

View related business solutions
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1

    EncTool

    Command line tool to detect and convert files encoding.

    Command line tool to detect and convert files encoding. Works with files or directories. Can be used to add or remove UTF-8 BOM. Multi-platform. EncTool requires Java 1.5 or highter.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    The Information Extraction Plugin allows the use of information extraction techniques within RapidMiner. It can be seen as an interface between natural language and IE- or datamining-methods, by extracting interesting information out of documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Transliterator between any Language files - Map Fonts, Create Encoding Scheme, Input Phonetic, Indian, Roman, Tamil, Hindi, English, French, German, Spanish or Any World Language Keyboard. Ex: [Phonetic Input]-[Any World Language Output] or ViceVersa.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    Trim Lines

    Trim Lines removes trailing whitespace from source code files

    This is simple command line tool to batch process source code files to remove trailing whitespaces and convert all line endings to your system native style. Usage example: trimlines d:\Projects\SomeProject\src *.c;*.cpp;*.h;*.hpp;*.inc .svn;.git With this command all files in "d:\Projects\SomeProject\src" including sub-folders that match search masks "*.c;*.cpp;*.h;*.hpp;*.inc" and excluding specified folders ".svn;.git" will be processed. Can process unicode and system native...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 5
    A Java toolbox with commonly used Farsi Language functions. Includes functions for text manipulation, standardization, normalization, search, replace and changing words and ligatures. Fixing White space problems, Jalai date and Calendar, etc...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    CodeLift

    Editor with extensive code help for the C++ language

    Currently in very early stages, but eventually plans to bring a code editor with extensive C++ auto completion and assistance.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    ChaLo

    ChaLo

    Little helper application to correct mistypings

    ChaLo is trying to help with annoying mistypes due to wrongly selected keyboard locale. Currently supported laguages (keyboard layouts): Greek-Latin Will be supported soon: Russian-Latin Romanian-Latin A typical usage scenario involves a user writing text of mixed language (eg Greek and English) thus having to switch keyboard layout from time to time. User often forgets to switch from English to Greek but continues writing (often without looking at the screen) only to find out a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    This plugin works Notepad++ max version 7.5.3 and PHP v 5.26 (sorry). Plugin for text editor Notepad + +(both. UNICODE & ANSI). This plugin allows you to control and word processing using a scripting language PHP(as Visual Basic for Excel). No need to install PHP and Apache. Full Server HTTP is buildIn.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    The babel addon package to support serbian language with cyrillic script in the TeX document processing system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Your monitoring isn't a stack. It's a pile. Fix that. Icon
    Your monitoring isn't a stack. It's a pile. Fix that.

    Errors, performance, logs, uptime. One install, one invoice, one UI.

    Replace Datadog, New Relic, and Sentry without adding three more dashboards.
    Free 30 days.
  • 10
    TextMarker
    TextMarker is now developed and hosted at Apache UIMA (http://uima.apache.org/textmarker.html). TextMarker is a UIMA-based tool for information extraction and more. The full featured editor of the rule language and the build process of UIMA descriptors are complemented with components for visualization, explanation, testing and rule learning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    A simple way to create a syntax highlighting editor for a custom language/grammar and/or create custom grammar parsers. This is a .NET project written in C#. See details here: http://acct001.com/wordpress/?p=190
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    l33t Translator
    Translates to and from the advanced language for g33ks! It also contains an editor so you can specify your own leet language!
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Versatile converter from Wiki-Markup-Language to LaTex.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    ApexText

    ApexText

    Efficient and lightweight text editor with rich functionalities.

    ApexText is a general purpose text editor for developers and non-developers. It supports synatx highlighting for Java, C, C++, Perl, SQL, JSP, HTML etc., tooling for Java. Many UI features are configurable.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    A stand-alone editor using Mediawiki markup language to generate HTML code. You can create and preview pages written using Mediawiki markup (i.e. Wikipedia pages) while off-line.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    PDML is an informal markup language written in PHP that is similar to HTML. It allows for the creation of complex PDF documents and can also be used in conjunction with PHP, to define templates which can generate dynamic PDF documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Provide a robust and efficient implementation of n-gram based classifiers to Java. N-Gram algorithms have shown to be surprisingly good at tasks like guessing the language/encoding from an arbitrary text file. And there are many more applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Japanese Language Learning Tools for Hiragana, Katakana and Kanji. Kana Text Editor. Vocabulary Games and Sentence builder.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Simple text editor for editing multiple language files in parallel. Corresponding translations are automatically highlighted in order to aid reviewing against multiple sources. Supports gettext PO and key = value format. Highly recommended for polyglots.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    EzerKb is a virtual keyboard for Windows. It emulates a keyboard with, for example, Russian, Greek, or Hebrew characters without actually installing a keyboard driver for that language. EzerKb works with most (but not all) Windows programs.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    This project extends and enhances the Notepad++ plugin interface to make it easy for novice and experienced programmers to create plugins and language stylers using Scintilla and Notepad++.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    JCopist is a template-based document generation server based on OpenOffice.org. Its templates are regular OpenDocuments enhanced with the FreeMarker scripting language. A wide range of formats are available, eg. : ODT, PDF, RTF, HTML, MS Word, MS Excel
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    It's a free and handy text editor for both plain text files and formatted text files and printing. Since release 1.2.0 there is a tool for the conversion in PDF format. It's written in Java language so it's available for a lot of Operative Systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Tubaina is a book generator. Given a text written in afc syntax, a markup language, an html or pdf output is generated. This project has been moved to Github: http://github.com/caelum/tubaina
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    XVCL is a general-purpose language for configuring variants in all sorts of textual documents (including programs). It is based on frame technology. XVCL processor automates the customization process to produce system from specification of variants.
    Downloads: 1 This Week
    Last Update:
    See Project