Showing 84 open source projects for "language processing"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 1
    compromise

    compromise

    Modest natural-language processing

    Language is complicated and there's a gazillion words. Compromise is a javascript library that interprets and pre-parses text and makes some reasonable decisions so things are way easier. Compromise tries its best to parse text. it is small, quick, and often good-enough. It is not as smart as you'd think. Conjugate and negate verbs in any tense. Play between plural, singular and possessive forms. Interpret plain-text numbers. Handle implicit terms. Use it on the client-side or as an...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Stanford CoreNLP

    Stanford CoreNLP

    Stanford CoreNLP, a Java suite of core NLP tools

    CoreNLP is your one stop shop for natural language processing in Java! CoreNLP enables users to derive linguistic annotations for text, including token and sentence boundaries, parts of speech, named entities, numeric and time values, dependency and constituency parses, coreference, sentiment, quote attributions, and relations. CoreNLP currently supports 6 languages, Arabic, Chinese, English, French, German, and Spanish.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    natural

    natural

    General natural language facilities for node

    "Natural" is a general natural language facility for nodejs. It offers a broad range of functionalities for natural language processing. Tokenizing, stemming, classification, phonetics, tf-idf, WordNet, string similarity, and some inflections are currently supported. It’s still in the early stages, so we’re very interested in bug reports, contributions and the like. Note that many algorithms from Rob Ellis’s node-nltools are being merged into this project and will be maintained from here onward. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    ANTLR

    ANTLR

    Parser generator to read, process, or translate structured text

    ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files. It's widely used to build languages, tools, and frameworks. From a grammar, ANTLR generates a parser that can build and walk parse trees. It’s widely used in academia and industry to build all sorts of languages, tools, and frameworks.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    HarfBuzz

    HarfBuzz

    Open source text shaping engine

    HarfBuzz is an open source text-shaping engine with a C API that turns fonts and strings of character codes into a form that is correctly arranged for the corresponding language and writing system. This is essentially the process of text shaping: translating a string of character codes into a properly arranged sequence of glyphs that can be rendered onto a screen or into final output form for inclusion in a document. This shaping depends on a number of factors: the input string, the active...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 6

    Bulgarian language support

    Spell check, grammar check and hyphenation for Bulgarian language

    The goal of this project is to provide spell check, grammar check and hyphenation for Bulgarian language for Open Source products such as OpenOffice.org, LibreOffice, TeX, aspell, ispell, hunspell etc.
    Leader badge
    Downloads: 20 This Week
    Last Update:
    See Project
  • 7
    Utilities for general- and special-purpose documentation. Includes reStructuredText, the easy to read, easy to use, what-you-see-is-what-you-get plaintext markup language.
    Leader badge
    Downloads: 200 This Week
    Last Update:
    See Project
  • 8
    OmegaT - multiplatform CAT tool

    OmegaT - multiplatform CAT tool

    The free computer aided translation (CAT) tool for professionals

    OmegaT is a free and open source multiplatform Computer Assisted Translation tool with fuzzy matching, translation memory, keyword search, glossaries, and translation leveraging into updated projects.
    Leader badge
    Downloads: 1,764 This Week
    Last Update:
    See Project
  • 9
    ant4docbook

    ant4docbook

    ANT4DOCBOOK is an ANT task for DOCBOOK

    ANT4DOCBOOK is an ANT task for DOCBOOK, a semantic markup language for technical documentation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 10
    GATE
    NOTE THAT THE SOURCE CODE AND ISSUE TRACKER HAVE NOW MOVED TO GITHUB. FIND US AT https://github.com/GateNLP/ GATE (General Architecture for Text Engineering) is an architecture, framework and development environment for developing, evaluating and embedding Human Language Technology. See http://gate.ac.uk for full details.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 11
    ScreenTranslate

    ScreenTranslate

    Translate any text on your Mac screen — capture or select,instantly.

    ScreenTranslate lets you translate any text on your Mac screen without switching tabs or copy-pasting. Screen Capture Translation: Press Cmd+Shift+T, drag over any text on screen, and get an instant translation popup. Works with images, PDFs, and subtitles using OCR (Apple Vision). Text Selection Translation: Select text in any app and press Cmd+Option+Z to translate directly. No OCR needed. - Free and open-source (GPL-3.0) - On-device translation using Apple Translation - Works...
    Downloads: 31 This Week
    Last Update:
    See Project
  • 12
    Ol.Text

    Ol.Text

    This is an implementation of Rx text transformation script language.

    Rx is a simple scripting language based on regular expressions designed to transform text information. The Ol.Text project is a Rx implementation for .NET Framework (>= 4.5), .NET Standard (>= 2.0) and .NET (>= 6.0) platforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    ChordSmith

    ChordSmith

    Chordpro editor that can display, transpose and print song sheets.

    ChordSmith is a chordpro editor that can display, transpose and print song sheets containing chords and lyrics. It can also edit and convert song sheet formats (including Harmonica tabs) between chordpro format (chords in square brackets in line with lyrics) and two-line format (chords above lyrics). You can find many free sources of song sheets in both formats on the Internet. More information at https://chordsmith.sourceforge.io/ Here are just a few of ChordSmith's many features:
    Downloads: 58 This Week
    Last Update:
    See Project
  • 14
    SciTECO

    SciTECO

    Advanced TECO dialect and interactive screen editor based on Scintilla

    SciTECO is an interactive TECO dialect, similar to Video TECO. It also adds features from classic TECO-11, as well as unique new ideas. Project development takes place here: https://git.fmsbw.de/sciteco The download archive is mirrored at Sourceforge, but for nightly builds check out: https://sciteco.fmsbw.de/downloads/nightly/
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    TextPaint

    TextPaint

    this is the text editing and painting software

    this is the text editing and painting software. you can edit text files in our application
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Leseratte is a Java parser for German written language. Currently, it contains a German lexicon (based on the Wiktionary), inflexion rules, a grammar and a parser. (Semantics component planned.) Usable as a Java library, also provides a graphical UI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    fastText

    fastText

    Library for fast text classification and representation

    FastText is an open-source, free, lightweight library that allows users to learn text representations and text classifiers. It works on standard, generic hardware. Models can later be reduced in size to even fit on mobile devices. ext classification is a core problem to many applications, like spam detection, sentiment analysis or smart replies. In this tutorial, we describe how to build a text classifier with the fastText tool. The goal of text classification is to assign documents (such as...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Web Book Downloader

    Web Book Downloader

    Download websites as e-book: pdf, txt, epub.

    This application allows user to download chapters from website in 3 ways: - from table of contents; - from range: first chapter address, last chapter address; - by crawling from first chapter to n; In settings you can customize language, input(website encoding) for simplicity output is in the same encoding. If you want your language add new class into strings package, and new fields into Settings class and GUI menu(initialize method).
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    Common Litt

    Common Litt

    Simple java script library for auto literation, input tool.

    This project focuses of auto conversion in between language alphabets. Using this 'lit.js' library currently you can done conversion in between English - Tamil - Sinhala scrpits vise-versa. This is use full when you need to know how write something in an other given language. This is still at development stage but works purfectly and easy to customize as well. Live demo available at: http://commonlitt.42web.io/ For UI creations I had used Bootstrap and Jquery. For easy array...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. The latest...
    Leader badge
    Downloads: 114 This Week
    Last Update:
    See Project
  • 21
    cebe/markdown

    cebe/markdown

    A super fast, highly extensible markdown parser for PHP

    ...It is a set of PHP classes, each representing a Markdown flavor and a command line tool for converting Markdown files to HTML files. The implementation focus is to be fast (see benchmark) and extensible. You are able to add additional language elements by directly hooking into the parser, no (possibly error-prone) post- or pre-processing is needed to extend the language. It is also well-tested to provide the best rendering results also in edge cases where other parsers fail.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Arabic Keyboard

    Arabic Keyboard

    Arabic Keyboard Phonetic

    Can't Remember the Position of Arabic Letters on your Keyboard? Try This Phonetic Arabic Keyboard Layout. Letters correspond to Latin / English / German Letters of similar pronunciation, which makes it much faster and easier to type. Example: the Arabic Letter Meem "م" can be typed by pressing "M". They Layout can be either used online on arab-key.com or installed as a System Language for macOS and Windows. Includes Arabic, Persian, Kurdish Letters. All Features Free
    Leader badge
    Downloads: 159 This Week
    Last Update:
    See Project
  • 23

    Discriminative Language Editor

    Discriminative language editor based on ontologies

    Text editor in Java that is able to detect discriminative expressions while the user is typing. When the internal ontology-based analyzer detects a potential discriminative expression the user is advised by underscoring the related words in the text. A descriptive message about the issue is also shown to the user when the cursor is placed over the potential discriminative expression.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    JCLTP

    A Java Class Library for Text Processing

    JCLTP is a class library designed for processing text. JCLTP is free, open source and developed with the Java programming language. JCLTP is distributed under the GNU license. It incorporates several technologies that enable process information while applying AI techniques, in order to build predictive models for text classification. Through a flexible structure of interfaces and classes, the opportunity to extend, adapt and add functionality JCLTP is provided.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Notepad3

    Notepad3

    Light-weight Scintilla-based text editor with syntax highlighting

    Notepad3 is a fast and light-weight Scintilla-based text editor with syntax highlighting. Notepad3 is an excellent replacement for the default Windows text editor. Notepad3 offers many extra features over Notepad. It has a small memory footprint, but is powerful enough to handle most programming jobs.
    Downloads: 8 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next