C++ Linguistics Software

View 201 business solutions

Browse free open source C++ Linguistics Software and projects below. Use the toggles on the left to filter open source C++ Linguistics Software by OS, license, language, programming language, and project status.

  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 1
    OPolyglot

    OPolyglot

    Your Private Offline Translator

    OPolyglot is a fast and fully offline translator that respects your privacy. The application is powered by the robust "Bergamot" engine, allowing you to perform all translations locally on your device without the need for an internet connection. At the current stage of development, OPolyglot offers two key features: Simple Translator: A convenient tool for classic translation of typed or copied text. Screen Translator: A smart feature that allows you to take a screenshot of any screen area, automatically recognize the text (OCR), and instantly translate it. OPolyglot is an excellent assistant for reading foreign documents, playing games, or working with unlocalized applications, all while keeping your data completely secure. source code: https://github.com/fenenko/OPolyglot
    Leader badge
    Downloads: 1,871 This Week
    Last Update:
    See Project
  • 2

    Presage

    the intelligent predictive text entry platform

    Presage (formerly Soothsayer) is an intelligent predictive text entry system. Presage generates predictions by modelling natural language as a combination of redundant information sources. Presage computes probabilities for words which are most likely to be entered next by merging predictions generated by the different predictive algorithms. Presage's modular and extensible architecture allows its language model to be extended and customized to utilize statistical, syntactic, and semantic predictive algorithms. Presage's predictive capabilities are implemented by predictive plugins. Predictive plugins use services provided by the platform to implement multiple prediction techniques.
    Leader badge
    Downloads: 223 This Week
    Last Update:
    See Project
  • 3
    Crow Translate

    Crow Translate

    Lightweight translator that allows you to translate and speak text

    Crow Translate is a simple and lightweight translator written in C++ / Qt that allows you to translate and speak text using Google, Yandex, Bing, LibreTranslate and Lingva translate API. You may also be interested in my library QOnlineTranslator used in this project. Wayland does not support global shortcuts registration, but you can use D-Bus to bind actions in the system settings. For desktop environments that support additional applications actions (KDE, for example) you will see them predefined in the system shortcut settings. You can also use them for X11 sessions, but you need to disable global shortcuts registration in the application settings to avoid conflicts. On Linux to make the application look native on a non-KDE desktop environment, you need to configure Qt applications styling.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 4
    XBNF Neurotranslator compiler

    XBNF Neurotranslator compiler

    (X)BNF simple and clever translation grammar compiler

    XBNF Neurotranslator is a powerfull extended BNF grammar language to handle translations easily and many features to handle different kind of situations. This project is for common arch binaries, C++ sources, tests & support tickets. No installation, juste get binary for your architecture : > See [Files] > binary.{version} Library of smart samples of grammars> https://sourceforge.net/projects/xbnf/ Docker image which embeds the Linux/64bits binary and the library. https://hub.docker.com/r/damolab/neurotranslator/ Docker image with GNU C++ toolchain to build the xbnf command: https://hub.docker.com/r/damolab/neurotranslator-compil French blog dedicated to XBNF : https://damolab.zapto.org/xbnf/
    Downloads: 46 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Marcion

    Marcion

    The study environment of ancient languages (Coptic, Greek, Latin)

    Marcion is a software forming a study environment of ancient languages (esp. Coptic, Greek, Latin) and providing many tools and resources (dictionaties, grammars, texts). Although Marcion is focused on to study the gnosticism and early christianity, it is an universal library working with various file formats and allowing to collect, organize and backup texts of any kind. Overview of gnostic sources in Coptic language delivered with Marcion: Nag Hammadi Library; Berlin Codex; Codex Tchacos (Gospel of Judas); Askew Codex (Pistis Sophia); Bruce Codex (Books of Jeu) Overview of sources of early christianity in Coptic, Greek and Latin languages: Septuagint (LXX); Greek New Testament; Coptic New Testament (Sahidic, Bohairic); Latin Vulgate
    Leader badge
    Downloads: 19 This Week
    Last Update:
    See Project
  • 6
    Apertium: Machine Translation Toolbox

    Apertium: Machine Translation Toolbox

    The free and open-source rule-based machine translation platform

    Apertium is a toolbox to build open-source shallow-transfer machine translation systems, especially suitable for related language pairs: it includes the engine, maintenance tools, and open linguistic data for several language pairs.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 7

    Arabic Corpus

    Text categorization, arabic language processing, language modeling

    The Arabic Corpus {compiled by Dr. Mourad Abbas ( http://sites.google.com/site/mouradabbas9/corpora ) The corpus Khaleej-2004 contains 5690 documents. It is divided to 4 topics (categories). The corpus Watan-2004 contains 20291 documents organized in 6 topics (categories). Researchers who use these two corpora would mention the two main references: (1) For Watan-2004 corpus ---------------------- M. Abbas, K. Smaili, D. Berkani, (2011) Evaluation of Topic Identification Methods on Arabic Corpora,JOURNAL OF DIGITAL INFORMATION MANAGEMENT,vol. 9, N. 5, pp.185-192. 2) For Khaleej-2004 corpus --------------------------------- M. Abbas, K. Smaili (2005) Comparison of Topic Identification Methods for Arabic Language, RANLP05 : Recent Advances in Natural Language Processing ,pp. 14-17, 21-23 september 2005, Borovets, Bulgary. More useful references to check: ------------------------------------------- https://sites.google.com/site/mouradabbas9/corpora
    Leader badge
    Downloads: 28 This Week
    Last Update:
    See Project
  • 8
    AzConvert is an open source program to convert different scripts of Azerbaijani language (Latin, Arabic and Cyrillic) to each other. It's written in Qt.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    Helsinki Finite-State Technology
    The Helsinki Finite-State Transducer toolkit is intended for processing natural language morphologies. The toolkit is demonstrated by wide-coverage implementations of a number of languages of varying morphological complexity.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    Fresh Memory

    Fresh Memory

    Flashcards application with Spaced Repetition method

    Fresh Memory is an application that helps to learn large amounts of any material with Spaced Repetition method. The most important subject is learning foreign words, but Fresh Memory can be also used to learn anything else. The learning data is stored as flash cards and dictionaries. The flash cards may have several fields, and the user controls what combination of fields to learn. The flashcards can have formatted text and images.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Lyreword
    Lyreword is a flexible word generator for writers, role players, conlangers and everybody who seeks some inspiration for inventing words and names.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    mgiza has now moved to github https://github.com/moses-smt/mgiza
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Affisix
    Affisix is a program for automatic recognition of prefixes. It takes large amount of words and according to the user setting it tries to determine which segments of these words are prefixes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Based on the Buckwalter Morphological Analyzer (Version 1.0) for doing Arabic stemming and POS tagging. Includes a rewrite of the original Perl script, with better documentation and more flexible options, and a C++ interface (usable as a library or app).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    BioC

    We describe a simple XML format to share text documents and annotation

    A minimalist approach to share text documents and data annotations. Allows a large number of different annotations to be represented. Project files contain: - simple code to hold/read/write data and perform sample processing. - BioC-formatted corpora - BioC tools that work with BioC corpora BioC goals - simplicity - interoperability - broad use - reuse There should be little investment required to learn to use a format or a software module to process that format. We are interested in reuse, and we focus on common NLP tasks that are broadly useful for textmining.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    C4 is a C++ class library for analyzing sound files, particularly spoken and sung phonations. C4 provides features such as frequency analysis, pitch extraction, or calculation of voice quality parameters (e.g. alpha ratio, HNR, jitter, etc.).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Core program and associated utilities for building a machine translation system using the Example-Based paradigm, where previously-translated text is used to infer new translations of previously-unseen text.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    DisMo

    DisMo

    A POS, disfluency and multi-word unit annotator for spoken language

    DisMo is a part-of-speech, disfluency and multi-word unit automatic annotator. It is designed to manage the complexities and phenomena specific to spoken language. It currently supports English and French, with support for more languages coming soon. It is developed and maintained by George Christodoulides (Centre Valibel, IL&C, University of Louvain, Louvain-la-Neuve, Belgium). Visit www.corpusannotation.org to find out more about DisMo and other annotation tools for language corpora. If you are using DisMo to annotate your corpus, please cite the following paper: Christodoulides, George; Avanzi, Mathieu; Goldman, Jean-Philippe. DisMo: A Morphosyntactic, Disfluency and Multi-Word Unit Annotator. An Evaluation on a Corpus of French Spontaneous and Read Speech. Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC) 2014, Reykjavik, Iceland, 26-31 May 2014, pp. 3902-3907.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    Dualword-PMC

    PMC browser

    PubMed Central browser. Source code: http://github.com/dualword/dualword-pmc/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    EzerKb is a virtual keyboard for Windows. It emulates a keyboard with, for example, Russian, Greek, or Hebrew characters without actually installing a keyboard driver for that language. EzerKb works with most (but not all) Windows programs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Fast Fuzzy Inference System
    FFIS or Fast Fuzzy Inference System is a portable and optimized implementation of Fuzzy Inference Systems. It supports both Mamdani and Takagi-Sugeno methods. The main idea behind this tool, is to provide case-special techniques rather than general solutions to resolve complicated mathematical calculations. This will lead to have more efficient defuzzification algorithms for Mamdani's model. Most systems in Mamdani's model can be defuzzified in O(n²) or even O(n) time which n is number of membership functions in output.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    FreeLM

    Tool for creating, pruning and interpolating N-gram models

    This project, currently in its early stages, aims to implement N-gram counting, smoothing, and interpolation. The project may eventually incorporate a number of smoothing methods, but initially it will incorporate just one, which is not a standard method. It's similar in spirit but modified, interpolated Kneser-Ney, but uses the principle that the amount you discount should always be the amount that gets given to the backoff state (which makes the traditional count-of-counts stuff and the associated formulas for obtaining smoothing constants unusuable). This gives slightly better results than KN for standard LM building, but has larger advantages for interpolation and pruning. See <a href=freelm.sf.net> here </a> for documentation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Fast Unsupervised Sentence Aligner described in "Improved unsupervised sentence alignment for symmetrical and asymmetrical parallel corpora", COLING 2010. NEWS : release 1.0b : bug fixed (release1.0a deprecated).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Genie
    Genie is a highly sophisticated cognitive child-machine. Genie at its core is an artificial intelligence project, focusing on creating a new form of life.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Grammar-multi is most useful for languages which words have many forms («more» inflected languages), and for which grammatical agreement (and other syntactic connections) in a sentence is «more» important and «obvious». Need a help of linguists. Program is not for every-day use, but to show Grammar is working. If you want your language Grammar version - tell me.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB