C++ Linguistics Software

View 202 business solutions

Browse free open source C++ Linguistics Software and projects below. Use the toggles on the left to filter open source C++ Linguistics Software by OS, license, language, programming language, and project status.

  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • 1
    OPolyglot

    OPolyglot

    Your Private Offline Translator

    OPolyglot is a fast and fully offline translator that respects your privacy. The application is powered by the robust "Bergamot" engine, allowing you to perform all translations locally on your device without the need for an internet connection. At the current stage of development, OPolyglot offers two key features: Simple Translator: A convenient tool for classic translation of typed or copied text. Screen Translator: A smart feature that allows you to take a screenshot of any screen area, automatically recognize the text (OCR), and instantly translate it. OPolyglot is an excellent assistant for reading foreign documents, playing games, or working with unlocalized applications, all while keeping your data completely secure. source code: https://github.com/fenenko/OPolyglot
    Leader badge
    Downloads: 1,318 This Week
    Last Update:
    See Project
  • 2
    Crow Translate

    Crow Translate

    Lightweight translator that allows you to translate and speak text

    Crow Translate is a simple and lightweight translator written in C++ / Qt that allows you to translate and speak text using Google, Yandex, Bing, LibreTranslate and Lingva translate API. You may also be interested in my library QOnlineTranslator used in this project. Wayland does not support global shortcuts registration, but you can use D-Bus to bind actions in the system settings. For desktop environments that support additional applications actions (KDE, for example) you will see them predefined in the system shortcut settings. You can also use them for X11 sessions, but you need to disable global shortcuts registration in the application settings to avoid conflicts. On Linux to make the application look native on a non-KDE desktop environment, you need to configure Qt applications styling.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 3

    Presage

    the intelligent predictive text entry platform

    Presage (formerly Soothsayer) is an intelligent predictive text entry system. Presage generates predictions by modelling natural language as a combination of redundant information sources. Presage computes probabilities for words which are most likely to be entered next by merging predictions generated by the different predictive algorithms. Presage's modular and extensible architecture allows its language model to be extended and customized to utilize statistical, syntactic, and semantic predictive algorithms. Presage's predictive capabilities are implemented by predictive plugins. Predictive plugins use services provided by the platform to implement multiple prediction techniques.
    Leader badge
    Downloads: 196 This Week
    Last Update:
    See Project
  • 4
    Marcion

    Marcion

    The study environment of ancient languages (Coptic, Greek, Latin)

    Marcion is a software forming a study environment of ancient languages (esp. Coptic, Greek, Latin) and providing many tools and resources (dictionaties, grammars, texts). Although Marcion is focused on to study the gnosticism and early christianity, it is an universal library working with various file formats and allowing to collect, organize and backup texts of any kind. Overview of gnostic sources in Coptic language delivered with Marcion: Nag Hammadi Library; Berlin Codex; Codex Tchacos (Gospel of Judas); Askew Codex (Pistis Sophia); Bruce Codex (Books of Jeu) Overview of sources of early christianity in Coptic, Greek and Latin languages: Septuagint (LXX); Greek New Testament; Coptic New Testament (Sahidic, Bohairic); Latin Vulgate
    Leader badge
    Downloads: 56 This Week
    Last Update:
    See Project
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 5
    Apertium: Machine Translation Toolbox

    Apertium: Machine Translation Toolbox

    The free and open-source rule-based machine translation platform

    Apertium is a toolbox to build open-source shallow-transfer machine translation systems, especially suitable for related language pairs: it includes the engine, maintenance tools, and open linguistic data for several language pairs.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 6

    BioC

    We describe a simple XML format to share text documents and annotation

    A minimalist approach to share text documents and data annotations. Allows a large number of different annotations to be represented. Project files contain: - simple code to hold/read/write data and perform sample processing. - BioC-formatted corpora - BioC tools that work with BioC corpora BioC goals - simplicity - interoperability - broad use - reuse There should be little investment required to learn to use a format or a software module to process that format. We are interested in reuse, and we focus on common NLP tasks that are broadly useful for textmining.
    Leader badge
    Downloads: 17 This Week
    Last Update:
    See Project
  • 7

    Arabic Corpus

    Text categorization, arabic language processing, language modeling

    The Arabic Corpus {compiled by Dr. Mourad Abbas ( http://sites.google.com/site/mouradabbas9/corpora ) The corpus Khaleej-2004 contains 5690 documents. It is divided to 4 topics (categories). The corpus Watan-2004 contains 20291 documents organized in 6 topics (categories). Researchers who use these two corpora would mention the two main references: (1) For Watan-2004 corpus ---------------------- M. Abbas, K. Smaili, D. Berkani, (2011) Evaluation of Topic Identification Methods on Arabic Corpora,JOURNAL OF DIGITAL INFORMATION MANAGEMENT,vol. 9, N. 5, pp.185-192. 2) For Khaleej-2004 corpus --------------------------------- M. Abbas, K. Smaili (2005) Comparison of Topic Identification Methods for Arabic Language, RANLP05 : Recent Advances in Natural Language Processing ,pp. 14-17, 21-23 september 2005, Borovets, Bulgary. More useful references to check: ------------------------------------------- https://sites.google.com/site/mouradabbas9/corpora
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • 8
    AzConvert is an open source program to convert different scripts of Azerbaijani language (Latin, Arabic and Cyrillic) to each other. It's written in Qt.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    XBNF Neurotranslator compiler

    XBNF Neurotranslator compiler

    (X)BNF simple and clever translation grammar compiler

    XBNF Neurotranslator is a powerfull extended BNF grammar language to handle translations easily and many features to handle different kind of situations. This project is for common arch binaries, C++ sources, tests & support tickets. No installation, juste get binary for your architecture : > See [Files] > binary.{version} Library of smart samples of grammars> https://sourceforge.net/projects/xbnf/ Docker image which embeds the Linux/64bits binary and the library. https://hub.docker.com/r/damolab/neurotranslator/ Docker image with GNU C++ toolchain to build the xbnf command: https://hub.docker.com/r/damolab/neurotranslator-compil French blog dedicated to XBNF : https://damolab.zapto.org/xbnf/
    Downloads: 3 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 10
    DisMo

    DisMo

    A POS, disfluency and multi-word unit annotator for spoken language

    DisMo is a part-of-speech, disfluency and multi-word unit automatic annotator. It is designed to manage the complexities and phenomena specific to spoken language. It currently supports English and French, with support for more languages coming soon. It is developed and maintained by George Christodoulides (Centre Valibel, IL&C, University of Louvain, Louvain-la-Neuve, Belgium). Visit www.corpusannotation.org to find out more about DisMo and other annotation tools for language corpora. If you are using DisMo to annotate your corpus, please cite the following paper: Christodoulides, George; Avanzi, Mathieu; Goldman, Jean-Philippe. DisMo: A Morphosyntactic, Disfluency and Multi-Word Unit Annotator. An Evaluation on a Corpus of French Spontaneous and Read Speech. Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC) 2014, Reykjavik, Iceland, 26-31 May 2014, pp. 3902-3907.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    GrammarChecker

    GrammarChecker

    Grammar checker for any text field via keyboard shortcut using LLM

    GrammarChecker is a utility for instant grammar correction in any text field via keyboard shortcut, powered by Large Language Models (LLM). Grammar Checker utilizes the OpenAI API, supporting both OpenAI and Google Gemini models (It is possible to add other custom models via config), and requires an API key to function. Simply select the text you want to check and trigger the assigned shortcut. Grammar Checker can be automatically launched on startup and run in the system tray. The main window provides a user-friendly interface where you can assign any specific keyboard shortcut for triggering grammar checks or adjust the prompt requirements to align with your specific writing needs, such as translating text into different language or something else.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12

    OpenNLC

    Natural Language Compiler

    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    A project that aims to create reusable components (C++ libraries, COM components, and Edit controls) for Phonetic Transliteration of Indian languages, such as Telugu, Tamil, Kannada etc.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Affisix
    Affisix is a program for automatic recognition of prefixes. It takes large amount of words and according to the user setting it tries to determine which segments of these words are prefixes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Based on the Buckwalter Morphological Analyzer (Version 1.0) for doing Arabic stemming and POS tagging. Includes a rewrite of the original Perl script, with better documentation and more flexible options, and a C++ interface (usable as a library or app).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    C4 is a C++ class library for analyzing sound files, particularly spoken and sung phonations. C4 provides features such as frequency analysis, pitch extraction, or calculation of voice quality parameters (e.g. alpha ratio, HNR, jitter, etc.).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Core program and associated utilities for building a machine translation system using the Example-Based paradigm, where previously-translated text is used to infer new translations of previously-unseen text.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    Dualword-PMC

    PMC browser

    PubMed Central browser. Source code: http://github.com/dualword/dualword-pmc/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    EzerKb is a virtual keyboard for Windows. It emulates a keyboard with, for example, Russian, Greek, or Hebrew characters without actually installing a keyboard driver for that language. EzerKb works with most (but not all) Windows programs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Fast Fuzzy Inference System
    FFIS or Fast Fuzzy Inference System is a portable and optimized implementation of Fuzzy Inference Systems. It supports both Mamdani and Takagi-Sugeno methods. The main idea behind this tool, is to provide case-special techniques rather than general solutions to resolve complicated mathematical calculations. This will lead to have more efficient defuzzification algorithms for Mamdani's model. Most systems in Mamdani's model can be defuzzified in O(n²) or even O(n) time which n is number of membership functions in output.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    FreeLM

    Tool for creating, pruning and interpolating N-gram models

    This project, currently in its early stages, aims to implement N-gram counting, smoothing, and interpolation. The project may eventually incorporate a number of smoothing methods, but initially it will incorporate just one, which is not a standard method. It's similar in spirit but modified, interpolated Kneser-Ney, but uses the principle that the amount you discount should always be the amount that gets given to the backoff state (which makes the traditional count-of-counts stuff and the associated formulas for obtaining smoothing constants unusuable). This gives slightly better results than KN for standard LM building, but has larger advantages for interpolation and pruning. See <a href=freelm.sf.net> here </a> for documentation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Fresh Memory

    Fresh Memory

    Flashcards application with Spaced Repetition method

    Fresh Memory is an application that helps to learn large amounts of any material with Spaced Repetition method. The most important subject is learning foreign words, but Fresh Memory can be also used to learn anything else. The learning data is stored as flash cards and dictionaries. The flash cards may have several fields, and the user controls what combination of fields to learn. The flashcards can have formatted text and images.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Fast Unsupervised Sentence Aligner described in "Improved unsupervised sentence alignment for symmetrical and asymmetrical parallel corpora", COLING 2010. NEWS : release 1.0b : bug fixed (release1.0a deprecated).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Genie
    Genie is a highly sophisticated cognitive child-machine. Genie at its core is an artificial intelligence project, focusing on creating a new form of life.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Grammar-multi is most useful for languages which words have many forms («more» inflected languages), and for which grammatical agreement (and other syntactic connections) in a sentence is «more» important and «obvious». Need a help of linguists. Program is not for every-day use, but to show Grammar is working. If you want your language Grammar version - tell me.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB