Showing 139 open source projects for "language processing"

View related business solutions
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • 1
    Provide a robust and efficient implementation of n-gram based classifiers to Java. N-Gram algorithms have shown to be surprisingly good at tasks like guessing the language/encoding from an arbitrary text file. And there are many more applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Japanese Language Learning Tools for Hiragana, Katakana and Kanji. Kana Text Editor. Vocabulary Games and Sentence builder.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    HTML template library written in C inspired by perl HTML::Template. Template language has HTML-like tags (tmpl_var, tmpl_if, tmpl_loop, etc.) Use library to build a variable list and pass it to a template.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Simple text editor for editing multiple language files in parallel. Corresponding translations are automatically highlighted in order to aid reviewing against multiple sources. Supports gettext PO and key = value format. Highly recommended for polyglots.
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 5
    EzerKb is a virtual keyboard for Windows. It emulates a keyboard with, for example, Russian, Greek, or Hebrew characters without actually installing a keyboard driver for that language. EzerKb works with most (but not all) Windows programs.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    This project extends and enhances the Notepad++ plugin interface to make it easy for novice and experienced programmers to create plugins and language stylers using Scintilla and Notepad++.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    JCopist is a template-based document generation server based on OpenOffice.org. Its templates are regular OpenDocuments enhanced with the FreeMarker scripting language. A wide range of formats are available, eg. : ODT, PDF, RTF, HTML, MS Word, MS Excel
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    It's a free and handy text editor for both plain text files and formatted text files and printing. Since release 1.2.0 there is a tool for the conversion in PDF format. It's written in Java language so it's available for a lot of Operative Systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Tubaina is a book generator. Given a text written in afc syntax, a markup language, an html or pdf output is generated. This project has been moved to Github: http://github.com/caelum/tubaina
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    XVCL is a general-purpose language for configuring variants in all sorts of textual documents (including programs). It is based on frame technology. XVCL processor automates the customization process to produce system from specification of variants.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    This project provides a Low Saxon language module for the GNU aspell program. Low Saxon (ISO-language code: nds) is a regional language spoken in northern Germany and the eastern part of the Netherlands.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 12
    A web-based tool for checking XML documents against user-defined house style rules. Presets include Commonwealth/American spelling and inclusive language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    ZML, the Zeitung Markup Language, is a simple CMS for small newspapers. It was specifically designed to publish a student newspaper in print and on the Web. It uses LaTeX and XHTML. So far, it is documented in German only.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    twexter formats twin twext translations to help us learn language .. demo: http://test.twext.com .. javascript code is open at http://github.com/tudisco/twexter
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    "Java Artificial Intelligence Markup Language PAD" is a tool that manages ProgramD AI (on local or remote machines) and AIML files with real-time previews and it provides a network support to test AI capabilities over many network protocols.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    JLoom is a JSP like template language for text generation - e.g. source code, HTML, XML. JLoom templates are modular encapsulated. Parameters can be any Java type, even Generics or Varargs. There is a plugin for Eclipse and a command line tool.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    This is an office suite based on the OpenDocument standards that will have, when completed: a word processor and a spreadsheet editor. All of it will be programmed in the Visual Basic.NET language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Automatically translate english/french/german text to german/french/english text and output speech in appropriate language. All Automagically with the power of the inter-webs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    IniTranslator is a Windows tool for developers and users to simplify the translation and localization of ini style language files in a manner similar to how poEdit works. IniTranslator can also load and save other formats through its plugin interface.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 20
    The Java Text Categorizing Library (JTCL) is a pure java implementation of libTextCat which in turn is "a library that was primarily developed for language guessing, a task on which it is known to perform with near-perfect accuracy."
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    LeetGen is a small multi-language application, which allows to convert text to leet and vice-versa.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Collection of tools for input, reading, processing, and typesetting Taiwanese language. Includes SCIM and quail input methods, Firefox dictionary plugin, plus scripts for LaTeX and HTML generation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    LiMa means Lightweight Markup Language. It is a parser for an easy to use ASCII/Text-based markup - comparable to Markdown or the Wikipedia-Markup language with special configurable extensions in defining Links and image-resources.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Bi-gram applications based on language models produced by SRILM from Chinese Wikipedia corpus, include Chinese word segmenter, word-based (not character-based) Traditional-Simplified Chinese converter and Chinese syllable-to-word converter.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    SilverCity is a lexing package, based on Scintilla, that can provide lexical analysis for over 20 programming and markup langauges. Scripting language bindings currently exist for Python.
    Downloads: 0 This Week
    Last Update:
    See Project