Showing 7 open source projects for "corpus"

View related business solutions
  • Our Free Plans just got better! | Auth0 by Okta Icon
    Our Free Plans just got better! | Auth0 by Okta

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your secuirty. Auth0 now, thank yourself later.
    Try free now
  • Bright Data - All in One Platform for Proxies and Web Scraping Icon
    Bright Data - All in One Platform for Proxies and Web Scraping

    Say goodbye to blocks, restrictions, and CAPTCHAs

    Bright Data offers the highest quality proxies with automated session management, IP rotation, and advanced web unlocking technology. Enjoy reliable, fast performance with easy integration, a user-friendly dashboard, and enterprise-grade scaling. Powered by ethically-sourced residential IPs for seamless web scraping.
    Get Started
  • 1
    IMS Open Corpus Workbench

    IMS Open Corpus Workbench

    Indexing and query tools for very large text corpora

    The IMS Open Corpus Workbench is a collection of tools for managing and querying large text corpora (100 M words and more) with linguistic annotations. Its central component is the flexible and efficient query processor CQP, which can be used interactively in a terminal session, as a backend e.g. from a Perl script, or through the Web-based GUI CQPweb.
    Leader badge
    Downloads: 335 This Week
    Last Update:
    See Project
  • 2
    jieba

    jieba

    Stuttering Chinese word segmentation

    "Jaba" Chinese word segmentation, do the best Python Chinese word segmentation component. Four word segmentation modes are supported. Precise mode, which tries to cut the sentence most precisely, suitable for text analysis. Full mode, scans all the words that can be formed into words in the sentence, the speed is very fast, but the ambiguity cannot be resolved. The search engine mode, on the basis of the precise mode, divides the long words again to improve the recall rate, which is suitable...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    QJDicExample

    QJDicExample

    QJDicExample is an English <-> Japanese dictionary.

    QJDicExample is an Japanese to English and English to Japanese dictionary featuring words/names/kanji/sentences search. QJDicExample uses JMdict, JMnedict, Kanjidic2, Radkfilex, KanjiVG, Tanaka Corpus / Tatoeba databases for translations and zinnia recognition library for handwritten kanji recognition. Latest source code: git clone git://git.code.sf.net/p/qjdicexample/code qjdicexample-code
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Epwing2Anki

    Epwing2Anki

    Used to automate creation of Japanese Anki vocabulary cards.

    Epwing2Anki may be used to automatically or semi-automatically create Japanese Anki vocabulary cards based on a provided list of words and one or more of your favorite EPWING dictionaries and/or the included EDICT J-E dictionary and Tatoeba example sentence corpus.
    Downloads: 6 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    nardino1513

    nardino1513

    Latin texts from 1513: XML edition and XQueries

    A TEI XML edition of a small corpus of Latin texts written in Dalmatian city of Zadar (today in Croatia, then under Venice rule) by two humanist teachers. The edition is accompanied by XQueries designed to discover interesting features of the corpus. Queries can be reproduced with an XML database (BaseX, eXist).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    zkanji - Japanese Language Study Suite

    zkanji - Japanese Language Study Suite

    Japanese vocabulary and kanji study tool with built in dictionary

    zkanji is a feature rich Japanese language study suite and dictionary for Windows. It has several kanji look-up methods, optional example sentences for many Japanese words, vocabulary printing, JLPT levels indicated for words and kanji for all N levels, spaced-repetition system for studying and more. Visit http://zkanji.sourceforge.net for details
    Leader badge
    Downloads: 62 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next