Linguistics Software

View 2323 business solutions

Browse free open source Linguistics software and projects below. Use the toggles on the left to filter open source Linguistics software by OS, license, language, programming language, and project status.

  • The #1 Embedded Analytics Solution for SaaS Teams. Icon
    The #1 Embedded Analytics Solution for SaaS Teams.

    Qrvey saves engineering teams time and money with a turnkey multi-tenant solution connecting your data warehouse to your SaaS application.

    Qrvey’s comprehensive embedded analytics software enables you to design more customizable analytics experiences for your end users.
    Try Developer Playground
  • Secure remote access solution to your private network, in the cloud or on-prem. Icon
    Secure remote access solution to your private network, in the cloud or on-prem.

    Deliver secure remote access with OpenVPN.

    OpenVPN is here to bring simple, flexible, and cost-effective secure remote access to companies of all sizes, regardless of where their resources are located.
    Get started — no credit card required.
  • 1
    Virastyar

    Virastyar

    Virastyar is an spell checker for low-resource languages

    Virastyar is a free and open-source (FOSS) spell checker. It stands upon the shoulders of many free/libre/open-source (FLOSS) libraries developed for processing low-resource languages, especially Persian and RTL languages Publications: Kashefi, O., Nasri, M., & Kanani, K. (2010). Towards Automatic Persian Spell Checking. SCICT. Kashefi, O., Sharifi, M., & Minaie, B. (2013). A novel string distance metric for ranking Persian respelling suggestions. Natural Language Engineering, 19(2), 259-284. Rasooli, M. S., Kahefi, O., & Minaei-Bidgoli, B. (2011). Effect of adaptive spell checking in Persian. In NLP-KE Contributors: Omid Kashefi Azadeh Zamanifar Masoumeh Mashaiekhi Meisam Pourafzal Reza Refaei Mohammad Hedayati Kamiar Kanani Mehrdad Senobari Sina Iravanin Mohammad Sadegh Rasooli Mohsen Hoseinalizadeh Mitra Nasri Alireza Dehlaghi Fatemeh Ahmadi Neda PourMorteza
    Leader badge
    Downloads: 598 This Week
    Last Update:
    See Project
  • 2
    iramuteq
    IRAMUTEQ : Interface de R pour les Analyses Multidimensionnelles de Textes et de Questionnaires. Logiciel de traitement de données pour des corpus texte ou de type individus/caractères. Permet notamment de réaliser des analyses de type "ALCESTE"
    Leader badge
    Downloads: 898 This Week
    Last Update:
    See Project
  • 3
    Mishkal: Arabic Text Vocalization

    Mishkal: Arabic Text Vocalization

    Arabic Text Vocalization system

    Automatic system of vocalization of arabic text.
    Leader badge
    Downloads: 175 This Week
    Last Update:
    See Project
  • 4
    OpenAI Translator

    OpenAI Translator

    Browser extension and cross-platform desktop app based on ChatGPT API

    Browser extension and cross-platform desktop application for translation based on ChatGPT API. I have developed a Bob plugin that utilizes ChatGPT API to provide global word translation on macOS. However, since not all users have access to macOS to benefit from the plugin, I have created this project! What began as a Chrome extension has now evolved into a multi-platform desktop app that I am currently developing. The desktop application does not support the pop-up icon after word selection. You must press the shortcut key to trigger the translation after selecting a word. It offers three modes: translation, polishing and summarization. Our tool allows for mutual translation, polishing and summarization across 55 different languages. Streaming mode is supported! It allows users to customize their translation text. One-click copying, Text-to-Speech (TTS). Available on all platforms (Windows, macOS, and Linux) for both browsers and Desktop.
    Downloads: 21 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 by Okta Icon
    Our Free Plans just got better! | Auth0 by Okta

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your secuirty. Auth0 now, thank yourself later.
    Try free now
  • 5
    Artha ~ The Open Thesaurus
    Artha is a handy thesaurus based on WordNet with distinct features like global hotkey look-up, passive desktop notifications, regular expression based search, etc.. Artha may be used as a free open-source replacement to the proprietary WordWeb Pro.
    Leader badge
    Downloads: 95 This Week
    Last Update:
    See Project
  • 6
    Crow Translate

    Crow Translate

    Lightweight translator that allows you to translate and speak text

    Crow Translate is a simple and lightweight translator written in C++ / Qt that allows you to translate and speak text using Google, Yandex, Bing, LibreTranslate and Lingva translate API. You may also be interested in my library QOnlineTranslator used in this project. Wayland does not support global shortcuts registration, but you can use D-Bus to bind actions in the system settings. For desktop environments that support additional applications actions (KDE, for example) you will see them predefined in the system shortcut settings. You can also use them for X11 sessions, but you need to disable global shortcuts registration in the application settings to avoid conflicts. On Linux to make the application look native on a non-KDE desktop environment, you need to configure Qt applications styling.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 7
    Open data for a Khmer language corpus and lexicographic data that can be used for the development of free language tools for Khmer language, such as automatic translators, dictionaries, linguistic analysis tools, etc.
    Downloads: 72 This Week
    Last Update:
    See Project
  • 8
    Marcion

    Marcion

    The study environment of ancient languages (Coptic, Greek, Latin)

    Marcion is a software forming a study environment of ancient languages (esp. Coptic, Greek, Latin) and providing many tools and resources (dictionaties, grammars, texts). Although Marcion is focused on to study the gnosticism and early christianity, it is an universal library working with various file formats and allowing to collect, organize and backup texts of any kind. Overview of gnostic sources in Coptic language delivered with Marcion: Nag Hammadi Library; Berlin Codex; Codex Tchacos (Gospel of Judas); Askew Codex (Pistis Sophia); Bruce Codex (Books of Jeu) Overview of sources of early christianity in Coptic, Greek and Latin languages: Septuagint (LXX); Greek New Testament; Coptic New Testament (Sahidic, Bohairic); Latin Vulgate
    Leader badge
    Downloads: 65 This Week
    Last Update:
    See Project
  • 9
    Argos Translate

    Argos Translate

    Open-source offline translation library written in Python

    Argos Translate uses OpenNMT for translations and can be used as either a Python library, command-line, or GUI application. Argos Translate supports installing language model packages which are zip archives with a ".argosmodel" extension containing the data needed for translation. LibreTranslate is an API and web-app built on top of Argos Translate. Argos Translate also manages automatically pivoting through intermediate languages to translate between languages that don't have a direct translation between them installed. For example, if you have a es → en and en → fr translation installed you are able to translate from es → fr as if you had that translation installed. This allows for translating between a wide variety of languages at the cost of some loss of translation quality.
    Downloads: 6 This Week
    Last Update:
    See Project
  • Red Hat Ansible Automation Platform on Microsoft Azure Icon
    Red Hat Ansible Automation Platform on Microsoft Azure

    Red Hat Ansible Automation Platform on Azure allows you to quickly deploy, automate, and manage resources securely and at scale.

    Deploy Red Hat Ansible Automation Platform on Microsoft Azure for a strategic automation solution that allows you to orchestrate, govern and operationalize your Azure environment.
    Learn More
  • 10

    Wordcorr

    Data management for comparative linguistics

    Wordcorr automates the tedious and risky process of tabulating and managing the sound correspondences used in working out the historical development of natural languages. Initial support was from NSF.
    Downloads: 35 This Week
    Last Update:
    See Project
  • 11
    WordNet Database in various SQL format
    Leader badge
    Downloads: 48 This Week
    Last Update:
    See Project
  • 12
    google-translate-api

    google-translate-api

    A free and unlimited API for Google Translate

    A free and unlimited API for Google Translate. The language in which the text should be translated. Must be one of the codes/names (not case sensitive) contained in languages.js. Auto language detection, spelling correction, language correction. Fast and reliable – it uses the same servers that translate.google.com uses.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13
    IMS Open Corpus Workbench

    IMS Open Corpus Workbench

    Indexing and query tools for very large text corpora

    The IMS Open Corpus Workbench is a collection of tools for managing and querying large text corpora (100 M words and more) with linguistic annotations. Its central component is the flexible and efficient query processor CQP, which can be used interactively in a terminal session, as a backend e.g. from a Perl script, or through the Web-based GUI CQPweb.
    Leader badge
    Downloads: 110 This Week
    Last Update:
    See Project
  • 14
    Free Dictionaries
    Free translating dictionaries. Source format: TEI-P5 XML. Delivery formats: DICT, Stardict, etc. The dictionaries may include information on the pronunciation, etymology and such, in a platform-independent format. Access: web/plugins/standalone.
    Leader badge
    Downloads: 108 This Week
    Last Update:
    See Project
  • 15
    GoogleTranslate

    GoogleTranslate

    GoogleTranslate

    Google Translate Mac Client. All known issues have been fixed and the user experience has been optimized, but there may still be a few bugs. In the new version, no matter which translation engine you use, it will first call the detection language interface of domestic Google Translate. In this case, the traffic of your proxy node is abnormal, which causes the request to be intercepted by Google, and you need to enter the verification code (you can also use + + to open the debugging optiontool commandto Icheck whether the request result is correct), so if this problem occurs, please replace your Proxy node or directly use domestic Google engine without proxy. Enter in the terminal sudo spctl –master-disableand press Enter to confirm. The password is the system power-on password.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16
    Translate Shell

    Translate Shell

    Command-line translator using Google Translate, Bing Translator, etc.

    Translate Shell (formerly Google Translate CLI) is a command-line translator powered by Google Translate (default), Bing Translator, Yandex.Translate, and Apertium. It gives you easy access to one of these translation engines in your terminal. By default, translations with detailed explanations are shown. You can also translate the text briefly (only the most relevant translation will be shown). Translate Shell can also be used like an interactive shell; input the text to be translated line by line. Translate Shell is known to work on many POSIX-compliant systems. You may use Translate Shell from any Unix shell of your choice (bash, zsh, ksh, tcsh, fish, etc.); however, the wrapper script requires either bash or zsh installed. It is a must to have corresponding fonts for the language(s) / script(s) you wish to display in your terminal.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    Apertium: Machine Translation Toolbox

    Apertium: Machine Translation Toolbox

    The free and open-source rule-based machine translation platform

    Apertium is a toolbox to build open-source shallow-transfer machine translation systems, especially suitable for related language pairs: it includes the engine, maintenance tools, and open linguistic data for several language pairs.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 18
    Saladict

    Saladict

    Pop-up dictionary and page translator with multiple search modes

    All-in-one professional pop-up dictionary and page translator which supports multiple search modes, page translations, new word notebook and PDF selection searching. Saladict supports mixed use of multiple search modes. Whether it be single-click, double-click, pop-up icon, hovering or keyboard shortcuts, you name it. The combination of dictionaries is dynamically adjusted according to search text and Saladict profile. Although Saladict is built for looking up English words, it also supports many other languages. Chinese (simplified and traditional), Japanese, Korean are supported with dictionaries. Other languages are supported with several machine translation services. Each dictionary is well polished to a unify style for comfortable reading experience. Built-in Google, Lingocloud and Youdao Page Translation.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19

    Presage

    the intelligent predictive text entry platform

    Presage (formerly Soothsayer) is an intelligent predictive text entry system. Presage generates predictions by modelling natural language as a combination of redundant information sources. Presage computes probabilities for words which are most likely to be entered next by merging predictions generated by the different predictive algorithms. Presage's modular and extensible architecture allows its language model to be extended and customized to utilize statistical, syntactic, and semantic predictive algorithms. Presage's predictive capabilities are implemented by predictive plugins. Predictive plugins use services provided by the platform to implement multiple prediction techniques.
    Leader badge
    Downloads: 43 This Week
    Last Update:
    See Project
  • 20

    sgmweka

    Weka wrapper for the SGM toolkit for text classification and modeling.

    Weka wrapper for the SGM toolkit for text classification and modeling. Provides Sparse Generative Models for scalable and accurate text classification and modeling for use in high-speed and large-scale text mining. Has lower time complexity of classification than comparable software due to inference based on sparse model representation and use of an inverted index. The provided .zip file is in the Weka package format, giving access to text classification. Other functions are usable through either Java command-line commands or class inclusion into Java projects.
    Leader badge
    Downloads: 43 This Week
    Last Update:
    See Project
  • 21
    SPPAS

    SPPAS

    SPPAS - the automatic annotation and analyses of speech

    SPPAS is a scientific computer software package written and maintained by Brigitte Bigi of the Laboratoire Parole et Langage, in Aix-en-Provence, France. Available for free, with open source code, there is simply no other package for linguists to simple use in the automatic annotations of speech, the analyses of any kind of annotated data and the conversion of annotated files. SPPAS is able to produce automatically speech annotations from a recorded speech sound and its orthographic transcription. SPPAS is helpful for the analysis of any annotated data: estimate statistical distributions, make requests, manage files, visualize annotations. SPPAS offers a file converter from/to a wide range of formats: xra, TextGrid, eaf, trs... <https://sppas.org>
    Downloads: 37 This Week
    Last Update:
    See Project
  • 22
    Alfanous

    Alfanous

    Quran Search Engine

    Alfanous (The Lantern - الفانوس ) is an Arabic search engine API provide the simple and advanced search in the Holy Quran , more features and many interfaces...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 23
    MisakaTranslator

    MisakaTranslator

    Galgame's Multilingual Real-time Machine Translation Tool

    MisakaProject is a collection of all projects developed based on MisakaTranslator or its auxiliary functions. If you are interested in adding your own projects of related types to MisakaProject, please contact the author. MisakaPatcher added support for plug-in Chinese patch, so this tool is more suitable for players who like manual translation, and also provides another way for Chinese personnel who have difficulty unpacking packets to release Chinese patch. MisakaHookFinder is suitable for some games where the translator can not be used to directly get the text hook method. Users can search for the hook special code by themselves or directly use it to get the source text. At the same time, it also supports the clipboard to output the original text.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    biyi

    biyi

    Biyi (比译) is a convenient translation and dictionary app

    Biyi is a convenient translation and dictionary app written in Flutter.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    oopinyinguide
    OO Pinyin Guide is a Java extension for OpenOffice 3 or higher. It enables the user to add pinyin transliteration over Chinese characters inside a text document. This tool can be useful for people learning or teaching Chinese.
    Leader badge
    Downloads: 31 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next

Guide to Open Source Linguistics Software

Open source linguistics software is software that has been made available to the public by its creators, allowing users to freely make modifications or additions to the code and redistribute it. This type of software can be used for a wide range of tasks in the field of linguistics, such as text analysis, speech recognition, natural language processing (NLP), machine translation, and data mining.

The benefits of open source software are numerous. It is typically cheaper than proprietary alternatives since no license fees are required and there are no restrictions on how users may modify it or reuse it. Also, being open source allows users to collaborate in small teams or entire networks to develop a project faster than would otherwise be possible with any one individual working alone. Furthermore, open-source projects tend to benefit from more constant updates and bug fixes due to there being an ongoing community around them that contributes towards maintaining them. Finally, its availability makes it easier for those who wish to learn about various aspects of programming.

Among some of the most popular open source linguistics software are GATE (General Architecture for Text Engineering), which provides comprehensive tools for performing semantic analysis; NLTK (Natural Language Toolkit), a Python library used for managing large volumes of text; Apache OpenNLP -- an open-source suite used in natural language document processing; SpaCy -- a cutting edge library with modern statistical models for NLP; Stanford CoreNLP -- a collection of highly efficient NLP tools developed at Stanford University; and TensorFlow -- an open machine learning framework specifically designed for developing deep neural network applications related to natural language processing tasks.

These are just some examples among many others currently available that can help researchers create new applications or improve existing ones that involve linguistic processes such as syntactic parsing, part-of-speech tagging, entity recognition, sentiment analysis or text summarization. Ultimately, these solutions have become indispensable resources within the field of linguistics over time due their accessibility and potential customizability by developers worldwide.

Features Offered by Open Source Linguistics Software

  1. Text Analysis: Open source linguistics software is capable of doing various text analysis, such as morphological analysis, syntactic analysis, and semantic analysis. Morphological analysis involves splitting a word into its component morphemes. This helps to identify the stem of a word which is useful for identifying related words or synonyms. Syntactic analysis is used to understand structural relationships among words in order to determine the meaning of a sentence or phrase. Finally, semantic analysis looks at how words are connected and gives context to their usage in order to uncover deeper meanings and provide an understanding of complex matters.
  2. Natural Language Processing (NLP): Natural language processing (NLP) is another feature provided by open source linguistics software. NLP enables computers to interpret human language in order to better comprehend what humans are saying. It is primarily used to build Assistive AI technologies, such as virtual assistants like Alexa or Siri that can help us with tasks based on our verbal commands.
  3. Machine Learning: Open-source linguistics software also offers machine learning capabilities that enable it to learn from its environment and improve over time without any explicit programming by developers. This means that these systems can be continually improved upon based on data they have already collected, leading eventually lead them towards better accuracy and performance over time when utilized within natural language processing applications in particular.
  4. Visualization Tools: Another common feature found in open-source linguistic tools is visualization tools that allow for users to view data graphs so they can observe trends more easily or create custom visualizations depending on the type of task being undertaken. This includes features such as 3D visualization, cluster visualization, heat maps, bar charts etc., which give greater insights into how the system works and how it may be further optimized for better results if needed.

What Types of Open Source Linguistics Software Are There?

  • Grammar Checkers: These programs allow users to check the grammar of a text and ensure it is free of any errors. They also offer suggestions for improvements and can help identify parts of speech, word usage, and other nuances in writing.
  • Text or Speech-to-Text Software: This type of software converts speech into text quickly and accurately. It’s useful for transcribing interviews or speeches without having to manually type them out.
  • Natural Language Processing (NLP) Software: NLP software uses artificial intelligence algorithms to understand language and automatically generate responses based on what it has learned from analyzing large amounts of data. This can be used to help bots better understand user input for customer service applications, as well as automated tools for language translation and summarization.
  • Machine Learning Apps: These apps use machine learning algorithms to analyze text documents and extract key words, topics, sentiments, emotions, etc., allowing users to gain deeper insights from their data more quickly than they would have before.
  • Sentiment Analysis Tools: Sentiment analysis allows users to assess the attitude of a writer towards a particular subject by analyzing the text they have written about it. This type of tool is often used in marketing research or customer feedback analysis.
  • Visualization Tools: Visualization tools take data that has been collected through linguistics research (such as words used most often in a spoken conversation) and display them graphically so that researchers can better understand patterns in language or discourse over time or within different contexts.

Benefits Provided by Open Source Linguistics Software

  1. Cost Savings: Open source linguistics software is free to use, eliminating the need for costly licenses and associated fees. This makes it an attractive option for those who cannot afford more expensive tools or don’t need a full suite of proprietary tools.
  2. Flexibility: Open source linguistics software offers greater flexibility than proprietary software, allowing users to customize the program as needed according to their needs. In addition, users may be able to modify code in order to develop new functionality or correct errors that the original developers may have overlooked.
  3. Community Support: Oftentimes, open source linguistics software has rich support communities that can provide helpful advice or answer technical questions you may have about using the program. Additionally, community members may release updated versions of the software that resolve problems they’ve found with previous versions since they are not beholden by a corporate entity.
  4. Security: By its nature, open source projects make all of their code available publicly which means that any security holes can quickly be identified and patched quickly due to increased visibility and community involvement. On the other hand, proprietary software often takes longer to fix security vulnerabilities because changes must go through a larger organization before being released publicly.
  5. Scalability: Due to its open nature, open source linguistics software allows for scalability with no restrictions on usage or data sizes meaning you can grow your project at your own pace without having any extra costs related to growing your system beyond what was originally planned for.

What Types of Users Use Open Source Linguistics Software?

  • University Student: University students use open source linguistics software for research and language learning in their classes.
  • Linguist: Linguists use open source linguistics software to study, analyze, and experiment with language structures and theories.
  • Software Developer: Software developers utilize open source linguistics software to develop applications that can process and manipulate linguistic data.
  • Language Teacher: Language teachers use open source linguistics software to teach students new languages or gain a deeper understanding of the languages they already know.
  • Open Source Contributor: Open source contributors contribute to the development of existing open source projects by submitting bug fixes, providing support through forums, writing documentation, and more.
  • Data Analyst: Data analysts leverage the power of open source linguistics software to analyze large datasets related to language usage trends, sentiment analysis, text mining, etc.
  • Speech Processing Expert: Speech processing experts rely on open source technology from speech recognition algorithms to natural language processing tools in order to build innovative applications for voice interaction or automatic transcription of audio/video content.

How Much Does Open Source Linguistics Software Cost?

The cost of open source linguistics software varies depending on the type of software and what features or capabilities it offers. Generally, most open source linguistic software is free to download and use, although some may require a small fee to access certain advanced features. For example, the popular Natural Language Toolkit (NLTK) is free to download and use but requires a paid subscription for access to its full suite of services. OpenText's GATE framework is also free, while other leading linguistics platforms like Apache OpenNLP and CoreNLP require payment for their more robust versions. Additionally, many individual linguistics libraries are available for free as open source projects that can be used in conjunction with existing tools or integrated into larger custom-built applications. Ultimately, if you do not need any additional features beyond basic language processing capabilities then you should be able to find an adequate solution without having to spend anything at all.

What Software Can Integrate With Open Source Linguistics Software?

Open source linguistics software can integrate with various other types of software to produce a robust suite of data analysis, modelling, and visualization capabilities. These include natural language processing (NLP) packages such as spaCy, text editors like Notepad++, Python programming libraries like NLTK and gensim, web frameworks such as Flask and Django, databases including MySQL or MongoDB, and machine learning frameworks like scikit-learn. Open source linguistics software can also be integrated with open source intelligence solutions such as I2 Analysts Notebook and Thomson Reuters Data Fusion. By combining open source linguistics software with other types of software, complex systems for text analysis can be created to allow for more accurate linguistic research results.

Open Source Linguistics Software Trends

  1. Increased Accessibility: Open source linguistics software has become increasingly accessible for users, allowing them to download and use the software on their own devices without having to purchase costly licenses. This has increased its popularity among language learners and linguists alike.
  2. More Options: Open source software offers more options than traditional commercial software. It often includes a wide range of features and tools that can be customized by users, allowing them to tailor it to their specific needs.
  3. Better Quality: Open source software is typically of higher quality than commercial software, as it is built by a community of users who are constantly testing and refining the code. This ensures that it is reliable and up-to-date with the latest linguistic research.
  4. Flexibility: Open source software is highly flexible, allowing users to modify the code to meet their needs. This makes it ideal for research projects, as well as for teaching applications in classrooms and online learning environments.
  5. Cost Savings: Since open source software is free, there are significant cost savings associated with using it over commercial software. This makes it attractive for budget-conscious language learners and linguists who may not be able to afford expensive licenses or upgrades.

How To Get Started With Open Source Linguistics Software

  1. Getting started with open source linguistics software is relatively easy and can be done in just a few steps. First, users will need to find the right software for their tasks. There are many different kinds of open source linguistics software available, so users should take some time to review the various options and determine which one best fits their needs.
  2. Once the right program has been selected, users will need to download it onto their computer. This process is usually very straightforward and doesn’t require any technical expertise. After that’s complete, users can begin familiarizing themselves with how the software works by reading through its documentation or user guides. Most programs have online support forums or tutorials that provide a thorough overview of how to use the software as well.
  3. Next, users can start experimenting with the program themselves by trying out its various features and testing out linguistic concepts within it. Doing this allows them to get a better feel for how everything works together and makes them more comfortable when using it in actual research projects or experiments later on down the road. As they become more familiar with their chosen open source linguistics software package, they may even discover new ways of utilizing it that they weren't aware were possible initially.
  4. Last but not least, linguists should always make sure they keep up-to-date on any new updates released for their chosen program since this ensures they're getting access to all of its most recent features and bug fixes that allow it work correctly at all times. Keeping up with these updates also helps maintain data safety since old versions of certain programs may no longer be secure due to changes in internet technology over time.