C++ Linguistics Software

View 202 business solutions

Browse free open source C++ Linguistics Software and projects below. Use the toggles on the left to filter open source C++ Linguistics Software by OS, license, language, programming language, and project status.

  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Crow Translate

    Crow Translate

    Lightweight translator that allows you to translate and speak text

    Crow Translate is a simple and lightweight translator written in C++ / Qt that allows you to translate and speak text using Google, Yandex, Bing, LibreTranslate and Lingva translate API. You may also be interested in my library QOnlineTranslator used in this project. Wayland does not support global shortcuts registration, but you can use D-Bus to bind actions in the system settings. For desktop environments that support additional applications actions (KDE, for example) you will see them predefined in the system shortcut settings. You can also use them for X11 sessions, but you need to disable global shortcuts registration in the application settings to avoid conflicts. On Linux to make the application look native on a non-KDE desktop environment, you need to configure Qt applications styling.
    Downloads: 29 This Week
    Last Update:
    See Project
  • 2
    OPolyglot

    OPolyglot

    Your Private Offline Translator

    OPolyglot is a fast and fully offline translator that respects your privacy. The application is powered by the robust "Bergamot" engine, allowing you to perform all translations locally on your device without the need for an internet connection. At the current stage of development, OPolyglot offers two key features: Simple Translator: A convenient tool for classic translation of typed or copied text. Screen Translator: A smart feature that allows you to take a screenshot of any screen area, automatically recognize the text (OCR), and instantly translate it. AppImage: https://sourceforge.net/projects/opolyglot/files/v0.2.2/OPolyglot-x86_64.AppImage/download snap: https://snapcraft.io/opolyglot flatpak: https://sourceforge.net/projects/opolyglot/files/v0.2.2/opolyglot-x86_64.flatpak/download windows: https://sourceforge.net/projects/opolyglot/files/v0.2.2/opsetup.exe/download source code: https://github.com/fenenko/OPolyglot
    Leader badge
    Downloads: 341 This Week
    Last Update:
    See Project
  • 3

    Presage

    the intelligent predictive text entry platform

    Presage (formerly Soothsayer) is an intelligent predictive text entry system. Presage generates predictions by modelling natural language as a combination of redundant information sources. Presage computes probabilities for words which are most likely to be entered next by merging predictions generated by the different predictive algorithms. Presage's modular and extensible architecture allows its language model to be extended and customized to utilize statistical, syntactic, and semantic predictive algorithms. Presage's predictive capabilities are implemented by predictive plugins. Predictive plugins use services provided by the platform to implement multiple prediction techniques.
    Leader badge
    Downloads: 146 This Week
    Last Update:
    See Project
  • 4
    Marcion

    Marcion

    The study environment of ancient languages (Coptic, Greek, Latin)

    Marcion is a software forming a study environment of ancient languages (esp. Coptic, Greek, Latin) and providing many tools and resources (dictionaties, grammars, texts). Although Marcion is focused on to study the gnosticism and early christianity, it is an universal library working with various file formats and allowing to collect, organize and backup texts of any kind. Overview of gnostic sources in Coptic language delivered with Marcion: Nag Hammadi Library; Berlin Codex; Codex Tchacos (Gospel of Judas); Askew Codex (Pistis Sophia); Bruce Codex (Books of Jeu) Overview of sources of early christianity in Coptic, Greek and Latin languages: Septuagint (LXX); Greek New Testament; Coptic New Testament (Sahidic, Bohairic); Latin Vulgate
    Leader badge
    Downloads: 28 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Apertium: Machine Translation Toolbox

    Apertium: Machine Translation Toolbox

    The free and open-source rule-based machine translation platform

    Apertium is a toolbox to build open-source shallow-transfer machine translation systems, especially suitable for related language pairs: it includes the engine, maintenance tools, and open linguistic data for several language pairs.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 6
    GrammarChecker

    GrammarChecker

    Grammar checker for any text field via keyboard shortcut using LLM

    GrammarChecker is a utility for instant grammar correction in any text field via keyboard shortcut, powered by Large Language Models (LLM). Grammar Checker utilizes the OpenAI API, supporting both OpenAI and Google Gemini models (It is possible to add other custom models via config), and requires an API key to function. Simply select the text you want to check and trigger the assigned shortcut. Grammar Checker can be automatically launched on startup and run in the system tray. The main window provides a user-friendly interface where you can assign any specific keyboard shortcut for triggering grammar checks or adjust the prompt requirements to align with your specific writing needs, such as translating text into different language or something else.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 7
    Thinknowlogy

    Thinknowlogy

    The world's only naturally intelligent knowledge technology

    Natural intelligence is the utilization of naturally occurring logic. This naturally occurring logic provides concrete clues for organizing natural objects, like: - Grouping objects that belong together, - Separating objects that don't belong together, - Archiving objects that have become less important. Natural language and spatial information are sources of natural intelligence: - Natural language is providing concrete logic for organizing knowledge objects, - Spatial information provides concrete logic for organizing spatial objects (utilized in, e.g., self-driving cars). In this way, our brains know how to organize their knowledge and spatial information. I focus on natural language because this source of natural intelligence is hardly understood by scientists. Hence, the inability of Large Language Models to organize changes in their knowledge independently.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 8

    Arabic Corpus

    Text categorization, arabic language processing, language modeling

    The Arabic Corpus {compiled by Dr. Mourad Abbas ( http://sites.google.com/site/mouradabbas9/corpora ) The corpus Khaleej-2004 contains 5690 documents. It is divided to 4 topics (categories). The corpus Watan-2004 contains 20291 documents organized in 6 topics (categories). Researchers who use these two corpora would mention the two main references: (1) For Watan-2004 corpus ---------------------- M. Abbas, K. Smaili, D. Berkani, (2011) Evaluation of Topic Identification Methods on Arabic Corpora,JOURNAL OF DIGITAL INFORMATION MANAGEMENT,vol. 9, N. 5, pp.185-192. 2) For Khaleej-2004 corpus --------------------------------- M. Abbas, K. Smaili (2005) Comparison of Topic Identification Methods for Arabic Language, RANLP05 : Recent Advances in Natural Language Processing ,pp. 14-17, 21-23 september 2005, Borovets, Bulgary. More useful references to check: ------------------------------------------- https://sites.google.com/site/mouradabbas9/corpora
    Leader badge
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    AzConvert is an open source program to convert different scripts of Azerbaijani language (Latin, Arabic and Cyrillic) to each other. It's written in Qt.
    Downloads: 4 This Week
    Last Update:
    See Project
  • AI Agents That Actually Do the Work Icon
    AI Agents That Actually Do the Work

    Assign real work to AI teammates that know your projects, priorities, and deadlines.

    ClickUp's Super Agents run 24/7 inside your workspace: triaging bugs, drafting content, updating statuses, and routing tasks without being told twice. Connect them to 500+ tools and let them execute, not just suggest. Build custom agents in minutes that understand your workflows and act on them autonomously.
    Try ClickUp Free
  • 10
    Grammar-multi is most useful for languages which words have many forms («more» inflected languages), and for which grammatical agreement (and other syntactic connections) in a sentence is «more» important and «obvious». Need a help of linguists. Program is not for every-day use, but to show Grammar is working. If you want your language Grammar version - tell me.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    DisMo

    DisMo

    A POS, disfluency and multi-word unit annotator for spoken language

    DisMo is a part-of-speech, disfluency and multi-word unit automatic annotator. It is designed to manage the complexities and phenomena specific to spoken language. It currently supports English and French, with support for more languages coming soon. It is developed and maintained by George Christodoulides (Centre Valibel, IL&C, University of Louvain, Louvain-la-Neuve, Belgium). Visit www.corpusannotation.org to find out more about DisMo and other annotation tools for language corpora. If you are using DisMo to annotate your corpus, please cite the following paper: Christodoulides, George; Avanzi, Mathieu; Goldman, Jean-Philippe. DisMo: A Morphosyntactic, Disfluency and Multi-Word Unit Annotator. An Evaluation on a Corpus of French Spontaneous and Read Speech. Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC) 2014, Reykjavik, Iceland, 26-31 May 2014, pp. 3902-3907.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    A project that aims to create reusable components (C++ libraries, COM components, and Edit controls) for Phonetic Transliteration of Indian languages, such as Telugu, Tamil, Kannada etc.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Affisix
    Affisix is a program for automatic recognition of prefixes. It takes large amount of words and according to the user setting it tries to determine which segments of these words are prefixes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Based on the Buckwalter Morphological Analyzer (Version 1.0) for doing Arabic stemming and POS tagging. Includes a rewrite of the original Perl script, with better documentation and more flexible options, and a C++ interface (usable as a library or app).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    BioC

    We describe a simple XML format to share text documents and annotation

    A minimalist approach to share text documents and data annotations. Allows a large number of different annotations to be represented. Project files contain: - simple code to hold/read/write data and perform sample processing. - BioC-formatted corpora - BioC tools that work with BioC corpora BioC goals - simplicity - interoperability - broad use - reuse There should be little investment required to learn to use a format or a software module to process that format. We are interested in reuse, and we focus on common NLP tasks that are broadly useful for textmining.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    C4 is a C++ class library for analyzing sound files, particularly spoken and sung phonations. C4 provides features such as frequency analysis, pitch extraction, or calculation of voice quality parameters (e.g. alpha ratio, HNR, jitter, etc.).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Core program and associated utilities for building a machine translation system using the Example-Based paradigm, where previously-translated text is used to infer new translations of previously-unseen text.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    Dualword-PMC

    PMC browser

    PubMed Central browser. Source code: http://github.com/dualword/dualword-pmc/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    EzerKb is a virtual keyboard for Windows. It emulates a keyboard with, for example, Russian, Greek, or Hebrew characters without actually installing a keyboard driver for that language. EzerKb works with most (but not all) Windows programs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Fast Fuzzy Inference System
    FFIS or Fast Fuzzy Inference System is a portable and optimized implementation of Fuzzy Inference Systems. It supports both Mamdani and Takagi-Sugeno methods. The main idea behind this tool, is to provide case-special techniques rather than general solutions to resolve complicated mathematical calculations. This will lead to have more efficient defuzzification algorithms for Mamdani's model. Most systems in Mamdani's model can be defuzzified in O(n²) or even O(n) time which n is number of membership functions in output.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    FreeLM

    Tool for creating, pruning and interpolating N-gram models

    This project, currently in its early stages, aims to implement N-gram counting, smoothing, and interpolation. The project may eventually incorporate a number of smoothing methods, but initially it will incorporate just one, which is not a standard method. It's similar in spirit but modified, interpolated Kneser-Ney, but uses the principle that the amount you discount should always be the amount that gets given to the backoff state (which makes the traditional count-of-counts stuff and the associated formulas for obtaining smoothing constants unusuable). This gives slightly better results than KN for standard LM building, but has larger advantages for interpolation and pruning. See <a href=freelm.sf.net> here </a> for documentation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Fresh Memory

    Fresh Memory

    Flashcards application with Spaced Repetition method

    Fresh Memory is an application that helps to learn large amounts of any material with Spaced Repetition method. The most important subject is learning foreign words, but Fresh Memory can be also used to learn anything else. The learning data is stored as flash cards and dictionaries. The flash cards may have several fields, and the user controls what combination of fields to learn. The flashcards can have formatted text and images.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Fast Unsupervised Sentence Aligner described in "Improved unsupervised sentence alignment for symmetrical and asymmetrical parallel corpora", COLING 2010. NEWS : release 1.0b : bug fixed (release1.0a deprecated).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Genie
    Genie is a highly sophisticated cognitive child-machine. Genie at its core is an artificial intelligence project, focusing on creating a new form of life.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    Hebrew Deflector

    A proram to de-inflect modern Hebrew words

    Hebrew Deflector tries to guess the root, the pattern and the form of a modern Hebrew word provided by the user. It uses the existing rules of the language to do that, and displays the list of possible answers. It is not a dictionary and it doesn't know whether the word (and the listed forms of it) exists or not. It also doesn't know anything about exception to the rules.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
Auth0 Logo