Showing 19 open source projects for "corpus"

View related business solutions
  • Red Hat Ansible Automation Platform on Microsoft Azure Icon
    Red Hat Ansible Automation Platform on Microsoft Azure

    Red Hat Ansible Automation Platform on Azure allows you to quickly deploy, automate, and manage resources securely and at scale.

    Deploy Red Hat Ansible Automation Platform on Microsoft Azure for a strategic automation solution that allows you to orchestrate, govern and operationalize your Azure environment.
  • SKUDONET Open Source Load Balancer Icon
    SKUDONET Open Source Load Balancer

    Take advantage of Open Source Load Balancer to elevate your business security and IT infrastructure with a custom ADC Solution.

    SKUDONET ADC, operates at the application layer, efficiently distributing network load and application load across multiple servers. This not only enhances the performance of your application but also ensures that your web servers can handle more traffic seamlessly.
  • 1
    IMS Open Corpus Workbench

    IMS Open Corpus Workbench

    Indexing and query tools for very large text corpora

    The IMS Open Corpus Workbench is a collection of tools for managing and querying large text corpora (100 M words and more) with linguistic annotations. Its central component is the flexible and efficient query processor CQP, which can be used interactively in a terminal session, as a backend e.g. from a Perl script, or through the Web-based GUI CQPweb.
    Leader badge
    Downloads: 78 This Week
    Last Update:
    See Project
  • 2

    DWDS/Dialing Concordance

    a collection of indexing and search tools for corpus linguists

    DWDS/Dialing Concordance (DDC) - a collection of index and search tools for corpus linguists
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    jieba

    jieba

    Stuttering Chinese word segmentation

    "Jaba" Chinese word segmentation, do the best Python Chinese word segmentation component. Four word segmentation modes are supported. Precise mode, which tries to cut the sentence most precisely, suitable for text analysis. Full mode, scans all the words that can be formed into words in the sentence, the speed is very fast, but the ambiguity cannot be resolved. The search engine mode, on the basis of the precise mode, divides the long words again to improve the recall rate, which is suitable...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    QJDicExample

    QJDicExample

    QJDicExample is an English <-> Japanese dictionary.

    QJDicExample is an Japanese to English and English to Japanese dictionary featuring words/names/kanji/sentences search. QJDicExample uses JMdict, JMnedict, Kanjidic2, Radkfilex, KanjiVG, Tanaka Corpus / Tatoeba databases for translations and zinnia recognition library for handwritten kanji recognition. Latest source code: git clone git://git.code.sf.net/p/qjdicexample/code qjdicexample-code
    Downloads: 0 This Week
    Last Update:
    See Project
  • Discover Multiview ERP: The Financial Management Revolution Icon
    Discover Multiview ERP: The Financial Management Revolution

    Reclaim precious moments with loved ones while our robust cloud accounting software streamlines your financial processes.

    Built for growing businesses and well-established enterprises alike, Multiview is a highly scalable and robust ERP.
  • 5
    kcws

    kcws

    Deep Learning Chinese Word Segment

    Deep learning chinese word segment. Install the bazel code construction tool and install tensorflow (currently this project requires tf 1.0.0alpha version or above) Switch to the code directory of this project and run ./configure. Compile background service. Pay attention to the public account of waiting for words and reply to kcws to get the corpus download address. Extract the corpus to a directory. Change to the code directory.After installing tensorflow, switch to the kcws code directory...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    The Alpheios project is developing tools to facilitate self-directed, corpus-based language learning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Epwing2Anki

    Epwing2Anki

    Used to automate creation of Japanese Anki vocabulary cards.

    Epwing2Anki may be used to automatically or semi-automatically create Japanese Anki vocabulary cards based on a provided list of words and one or more of your favorite EPWING dictionaries and/or the included EDICT J-E dictionary and Tatoeba example sentence corpus.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8

    optimize_topics.sh

    Run multiple MALLET runs and report on search term prevalence.

    Run multiple MALLET runs over a pre-existing corpus and report on search term prevalence in each run.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Downloads: 5 This Week
    Last Update:
    See Project
  • Easy management of simple and complex projects Icon
    Easy management of simple and complex projects

    We help different businesses become digital, manage projects, teams, communicate effectively and control tasks online.

    Plan more projects with Worksection. Use Gantt chart and Kanban boards to organize your projects, get your team onboard and assign tasks and due dates.
  • 10
    TextTools
    TextTools is a freeware corpus linguistics tool developed in Python to aid in research. This program analyzes user-created corpora and displays information about word (token) frequency, n-grams, clusters, collocations, keyword in context (KWIC), and keyness. TextTools is designed to be user-friendly and intuitive and will run natively on Mac OS X.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    nardino1513

    nardino1513

    Latin texts from 1513: XML edition and XQueries

    A TEI XML edition of a small corpus of Latin texts written in Dalmatian city of Zadar (today in Croatia, then under Venice rule) by two humanist teachers. The edition is accompanied by XQueries designed to discover interesting features of the corpus. Queries can be reproduced with an XML database (BaseX, eXist).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Various tools for creating annotated parallel corpora including pre-trained tagging and parsing models for various languages, sentence alignment tools and word alignment tools. Uplug also includes a web-based interface for interactive sentence and word alignment and scripts for indexing and querying parallel corpora using the Corpus Work Bench CWB. Download 'uplug-main' first and then add other packages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    zkanji - Japanese Language Study Suite

    zkanji - Japanese Language Study Suite

    Japanese vocabulary and kanji study tool with built in dictionary

    zkanji is a feature rich Japanese language study suite and dictionary for Windows. It has several kanji look-up methods, optional example sentences for many Japanese words, vocabulary printing, JLPT levels indicated for words and kanji for all N levels, spaced-repetition system for studying and more. Visit http://zkanji.sourceforge.net for details
    Leader badge
    Downloads: 75 This Week
    Last Update:
    See Project
  • 14
    An open-source corpus analysis class library written in C#. GUI of Tenka Text 0.1.3 comes with Wordlister - an advanced, extremely fast graphical wordlist tool and a simple regex concordance tool. Tenka Text - the open-source answer to WordSmith Tool
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15
    This proyect presents a system, which, from a corpus of documents, extracts information about a theme area, and a pedagogical components collection. This information is packed into fine granularity learning objects (metadata included).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    TaCo is a tasty Palm application that enables you to use the Tanaka Corpus on your handheld. The Tanaka Corpus is a collection of Japanese/English sentence pairs that a student of Japanese language can use as a source of example sentences.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    BabyTALK is to add another brick in the wall of natural languages learning. The baby needs to structure a corpus of texts when his tutor points and talks about a particular part of the corpus. The baby is also to describe any selected part of the corpus.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    A fast way to rate the reading challenging level of book or text. Unlike well known reading metrics such as Fog, Kincaid, SMOG, ARI, Flesch, and Coleman-Liau readability this metric takes into account far more factors and is standarized against a corpus
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    CoPT, Corpus Processing Tools, is a set of java classes intended to assist field linguists, NLP researchers and developers, students and software developers in all corpus-related processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next