Showing 70 open source projects for "two"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 1
    CycleGAN and pix2pix in PyTorch

    CycleGAN and pix2pix in PyTorch

    Image-to-Image Translation in PyTorch

    CycleGAN and pix2pix in PyTorch repository is a PyTorch implementation of two influential image-to-image translation frameworks: CycleGAN (for unpaired translation) and pix2pix (for paired translation). This repo gives developers and researchers a convenient, modern (PyTorch-based) platform to train and test these methods — supporting both paired datasets (input to output) and unpaired datasets (domain-to-domain) with minimal changes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    OPolyglot

    OPolyglot

    Your Private Offline Translator

    ...The application is powered by the robust "Bergamot" engine, allowing you to perform all translations locally on your device without the need for an internet connection. At the current stage of development, OPolyglot offers two key features: Simple Translator: A convenient tool for classic translation of typed or copied text. Screen Translator: A smart feature that allows you to take a screenshot of any screen area, automatically recognize the text (OCR), and instantly translate it. AppImage: https://sourceforge.net/projects/opolyglot/files/v0.2.2/OPolyglot-x86_64.AppImage/download snap: https://snapcraft.io/opolyglot flatpak: https://sourceforge.net/projects/opolyglot/files/v0.2.2/opolyglot-x86_64.flatpak/download windows: https://sourceforge.net/projects/opolyglot/files/v0.2.2/opsetup.exe/download source code: https://github.com/fenenko/OPolyglot
    Leader badge
    Downloads: 343 This Week
    Last Update:
    See Project
  • 3

    Comparing String

    Practicing C's built-in string.h functions

    This programme simpl takes two string (Like two tweets) and compare them and adds a hashtag to end of them if user desired to. Very simple used built-in C string functions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    WordCount

    WordCount

    Count frequency of single, 2-word and 3-word clusters in a text

    The program can read a text file and count the occurrences of single words and clusters of 2 and 3 words. The resulting list will be sorted in descending order (highest frequency on top).
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5

    Tokenized Text Aligner

    Aligns tokens in two versions of a text with differing tokenization.

    This tool performs token-by-token alignment of two versions of a text with differing tokenization by interpreting the results of a file diff (https://docs.python.org/3/library/difflib.html). It is intended for use in the preparation of annotated linguistic corpora, where differences in tokenization may arise (i) following corrections or modifications to the source text or (ii) through the creation of different layers of annotation (part-of-speech, treebank) requiring different tokenization. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    DSL-KeyPad

    DSL-KeyPad

    Multilingual input tool. Latin, Cyrillic, IPA, Math, historic, etc.

    “DSL KeyPad” is a utility written on AutoHotkey 2.0, designed for inputting a wide range of characters using hotkeys and auxiliary functions. Its primary focus is on enhancing input capabilities for Latin and Cyrillic scripts, allowing typing in multiple languages without the need for separate keyboard layouts for each language. Requires common QWERTY (English US)/ЙЦУКЕН (Russian) keyboard layouts. More than 6,300 Unicode characters are available. Additionaly, it supports typing on the...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    PronunDict

    PronunDict

    a pronunciation dictionary of American English

    ...It runs on Windows, and should also work with Wine on Linux and macOS. NEW PronunDict for French project page! https://sourceforge.net/projects/pronundict-french/ Acknowledgement: This app uses two external dictionaries (bundled with it): 1. AmEPD -- the American English Pronunciation Dictionary by Reece H. Dunn, (https://github.com/rhdunn/amepd) 2. CMUdict -- the Carnegie Mellon University Pronouncing Dictionary, (http://www.speech.cs.cmu.edu/cgi-bin/cmudict) Warning: Unfortunately, this dictionary is not flawless. There are some errors in the dictionary entries. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Unitag is a language-independent Unicode-based part-of-speech tagging system. Written entirely in ANSI-compatible C, it should (in theory) compile on any OS, but has been tested on 32-bit Windows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    JoBimText GPL

    Projects with GPL licensed resources

    This project contains project which are dependent on other projects/libraries which are under the GPL licence
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 10
    The goal of this project is to investigate optimal ways to do genre classification for the ten indigenous South African languages. Funded by Dept of Arts and Culture of the SA Government. http://www.trifonius.co.za/projects/genre-classification
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    Linguistic Analyzer

    The Linguistic Analyzer is a tool for corpus analysis and comparison

    The Linguistic Analyzer (Almuhalil Alloghawy) is a free tool designed by a team from Al-Imam Muhammad bin Saud islamic university that can be used for corpus analysis and comparison in terms of the several linguistic characteristics, such as frequency lists generation, concordances, collocation extraction, the difference between two words, and keyword identification.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 12
    yabasta

    yabasta

    Yet Another BAsic Scraper and Text Analysis

    YA BASTA! is a Python/R application for Lyrics Web Scraper and Text Analysis. Web scraping is developed in Python, text analysis in R as Python subprocesses. YA BASTA! is only tested on windows OS. To run YA BASTA! just type on window command prompt: python.exe yabasta.py
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    PC-PATR in XML

    Converting PC-PATR grammars to XML

    PC-PATR grammar files are plain text files. This project seeks to define an XML form for these grammar files and convert legacy grammar files to the XML form.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    Safe Harbor Deidentification

    Safe Harbor Deidentification for medical documents

    Phalanx - Deidentify Safe Harbor Deidentification Mode of Phalanx is an abridged pipeline of NLP annotators culminating in NER annotators which write output of text offsets. It uses the Safe Harbor deidentification method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    Arabic Corpus

    Text categorization, arabic language processing, language modeling

    ...Mourad Abbas ( http://sites.google.com/site/mouradabbas9/corpora ) The corpus Khaleej-2004 contains 5690 documents. It is divided to 4 topics (categories). The corpus Watan-2004 contains 20291 documents organized in 6 topics (categories). Researchers who use these two corpora would mention the two main references: (1) For Watan-2004 corpus ---------------------- M. Abbas, K. Smaili, D. Berkani, (2011) Evaluation of Topic Identification Methods on Arabic Corpora,JOURNAL OF DIGITAL INFORMATION MANAGEMENT,vol. 9, N. 5, pp.185-192. 2) For Khaleej-2004 corpus --------------------------------- M. ...
    Leader badge
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    concordia

    concordia

    Powerful search library, best suited for computer-aided translation

    ...Concordance searcher - tool for translators who need their translations to "agree" with one standard. Concordia is a C++ library for fast text lookup in large corpora. It uses a RAM stored index, which takes up approximately 600MB of memory for a corpus of 2 million sentences. It is based on the idea of a suffix array, enhanced by the presence of other auxiliary data structures. The effects are stunning - Concordia is able to do simple substring lookup at the pace of 5000 queries per second (on personal PC) - a speed which can not be achieved by any other search library. Moreover, Concordia can perform its own "concordia search". ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    Ghawwas_V4

    An open source system for Arabic corpora processing

    ...Frequency list for single word and N-Grams b. Concordance c. Collocation (MI, CHI Squared, LL, T-Score, Z Score, Dice, Log Dice, Weirdness Coefficient) d. Lexical patterns search e. Two corpora frequency profile comparison based on MI, CHI, LL, T-Score, Z Score, Dice, Log Dice, Weirdness Coefficient f. Accept Windows and UTF-8 character encoding g. Accept TXT, DOC, DOCX, RTF and HTML formats h. Export the processing results in CSV file format
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    JavaScriptSesh

    Javascript library for egyptian hieroglyphs (alpha code)

    JavaScriptSesh is a library allowing web site creators to easily display hieroglyphic texts, independently of the server-side language they use. JavaScriptSesh is very alpha code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    PangInput

    PangInput

    A simple tool for typing characters in different writing systems.

    PangInput is a simple application to help you in typing characters from different languages in unicode. Three methods are available: 1) a virtual keyboard, mapping specific characters to each key on your keyboard; 2) custom character sets, which you can select by clicking on them; 3) macro sets, allowing input of complex scripts - basically mapping a latin transcription to the actual writing of characters or words.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Corpus Toolkit

    Corpus Toolkit

    A text management tool for linguistic purposes...

    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Welsh Natural Language Toolkit
    ...The project also includes CYMRIE an adapted version for Welsh of the GATE - ANNIE Named Entity Recognition (NER) application for a range of entities such as Persons, Organisations, Locations, and date and time expressions. Version 2.x The CYMRIE pipeline is accessible via a API, standalone GUI and CLI. The CymrIE pipeline has also been adapted for Twitter.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    PADIC

    A multilingual Parallel Arabic DIalectal Corpus

    PADIC (Parallel Arabic DIalectal Corpus) is a multi-dialectal corpus built in the framework of the National Research Project "TORJMAN", led by Scientific and Technical Research Center for the Development of Arabic Language and funded by the Algerian Ministry of Higher Education and Scientific Research. PADIC is composed of 6 dialects: two Algerian dialects (Algiers and Annaba cities), Palestinian, Syrian, Tunisian, Moroccan) and MSA. Mourad Abbas Computational Linguistics Department, crstdla https://sites.google.com/site/mouradabbas9 Publications ----------------- K. Meftouh, S. Harrat, S. Jamoussi, M. Abbas, K. Smaïli, Machine Translation Experiments on PADIC: A Parallel Arabic DIalect Corpus, The 29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015, Shanghai, 2015. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    Virastyar

    Virastyar

    Virastyar is an spell checker for low-resource languages

    ...SCICT. Kashefi, O., Sharifi, M., & Minaie, B. (2013). A novel string distance metric for ranking Persian respelling suggestions. Natural Language Engineering, 19(2), 259-284. Rasooli, M. S., Kahefi, O., & Minaei-Bidgoli, B. (2011). Effect of adaptive spell checking in Persian. In NLP-KE Contributors: Omid Kashefi Azadeh Zamanifar Masoumeh Mashaiekhi Meisam Pourafzal Reza Refaei Mohammad Hedayati Kamiar Kanani Mehrdad Senobari Sina Iravanin Mohammad Sadegh Rasooli Mohsen Hoseinalizadeh Mitra Nasri Alireza Dehlaghi Fatemeh Ahmadi Neda PourMorteza
    Leader badge
    Downloads: 368 This Week
    Last Update:
    See Project
  • 24

    Drug Extraction

    Drug name extraction

    Drug name recognition and normalisation/grounding to DrugBank ids and standard names. Package provides 2 taggers: 1. DrugTagger - CRF-based with DrugBank presence feature (see feature set for details). 2. DrugnameGazetteer - gazetteer/dictionary-based. Dictionary created from DrugBank.ca database. Both taggers include grounding/normalisation to DrugBank ids and standard names. Feature set: Word, Word-1, Word+1, Word-1_Word, Word_Word+1, DrugBankPresence, POS DrugBankPresence feature indicates the presence of the drug name in the DrugBank. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
Auth0 Logo