Showing 8 open source projects for "mecab"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    SentencePiece

    SentencePiece

    Unsupervised text tokenizer for Neural Network-based text generation

    ...SentencePiece allows us to make a purely end-to-end system that does not depend on language-specific pre/postprocessing. Purely data driven, sentencePiece trains tokenization and detokenization models from sentences. Pre-tokenization (Moses tokenizer/MeCab/KyTea) is not always required. SentencePiece treats the sentences just as sequences of Unicode characters. There is no language-dependent logic.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    KoNLPy

    KoNLPy

    Python package for Korean natural language processing

    KoNLPy is a natural language processing (NLP) library for the Korean language, offering tokenization, morphological analysis, and named entity recognition.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    Alissa.UniDic-CWJ.binary

    An unofficial nuget binary package of UniDic-CWJ

    Alissa.UniDic-CWJ.binary is a nuget package to provide binary dictionary of "UniDic-CWJ, the analysis dictionary for the morphological analizer MeCab" (解析用現代書き言葉 UniDic). UniDIc-CWJ is an analysis dictionary for Japanese languages for use with he morphological analizer MeCab, published by the UniDic Consortium. This is Alissa Sabre's unofficial packaging to help setting up the dictionary for use in your app. Read some more details on this package on GitHub https://github.com/AlissaSabre/Alissa.UniDic-CWJ.binary
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Budou

    Budou

    Budou is an auto organizer tool for beautiful line breaking in CJK

    ...These spans can be styled with CSS to ensure smooth, visually coherent line breaks without splitting words or phrases. The tool supports multiple segmentation backends, including Google Cloud Natural Language API, MeCab, and TinySegmenter, enabling flexibility for both cloud-based and offline processing. Budou can be used via command line, in Python scripts, or integrated into web applications, and it provides advanced options such as caching and entity recognition for improved segmentation accuracy.
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    IDEA (Text Data Visualizer)

    IDEA (Text Data Visualizer)

    Text Data Visualizer with Django

    It is hard for non-developer to visualize data. But if you use IDEA, you can visualize data easily. If you want to test Project: IDEA locally on your environment, you require mecab-ko and mecab-ko-dic. If you have some data which you want to visualize, just put it in IDEA. Then click the Visualization button!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    AJ-JpnRa Tool

    AJ-JpnRa Tool

    「AJ-JpnRa Tool」 is Japanese text readability analysis program.

    We temporarily suspend the release of the program due to a patent application. -2020.09 AJ-JpnRa Tool is Japanese text readability analysis program, is mainly ordered by the guidelines of JLPT. You can analyze Japanese-Text Readability with the length and Chinese character level of the text by using the AJ-JpnRa Tool. And Chinese character level is analyzed by the database(AJ-JpnRa Tool), which was built according to essential Chinese character education guideline of Japan elementary...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    MySQL (5.1 and later) full-text parser plugins collection. This collection provides bigram, mecab , space, snowball and suffix parser. If you want to use Chinese or Japanese, bigram plugin might be useful.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    MeCab is a fast and customizable Japanese morphological analyzer. MeCab is designed for generic purpose and applied to variety of NLP tasks, such as Kana-Kanji conversion. MeCab provides parameter estimation functionalities based on CRFs and HMM
    Leader badge
    Downloads: 2,311 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB