Showing 8 open source projects for "mecab-ipadic"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Atera all-in-one platform IT management software with AI agents Icon
    Atera all-in-one platform IT management software with AI agents

    Ideal for internal IT departments or managed service providers (MSPs)

    Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
    Learn More
  • 1
    SentencePiece

    SentencePiece

    Unsupervised text tokenizer for Neural Network-based text generation

    ...SentencePiece allows us to make a purely end-to-end system that does not depend on language-specific pre/postprocessing. Purely data driven, sentencePiece trains tokenization and detokenization models from sentences. Pre-tokenization (Moses tokenizer/MeCab/KyTea) is not always required. SentencePiece treats the sentences just as sequences of Unicode characters. There is no language-dependent logic.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 2
    KoNLPy

    KoNLPy

    Python package for Korean natural language processing

    KoNLPy is a natural language processing (NLP) library for the Korean language, offering tokenization, morphological analysis, and named entity recognition.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    Alissa.UniDic-CWJ.binary

    An unofficial nuget binary package of UniDic-CWJ

    Alissa.UniDic-CWJ.binary is a nuget package to provide binary dictionary of "UniDic-CWJ, the analysis dictionary for the morphological analizer MeCab" (解析用現代書き言葉 UniDic). UniDIc-CWJ is an analysis dictionary for Japanese languages for use with he morphological analizer MeCab, published by the UniDic Consortium. This is Alissa Sabre's unofficial packaging to help setting up the dictionary for use in your app. Read some more details on this package on GitHub https://github.com/AlissaSabre/Alissa.UniDic-CWJ.binary
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Budou

    Budou

    Budou is an auto organizer tool for beautiful line breaking in CJK

    ...These spans can be styled with CSS to ensure smooth, visually coherent line breaks without splitting words or phrases. The tool supports multiple segmentation backends, including Google Cloud Natural Language API, MeCab, and TinySegmenter, enabling flexibility for both cloud-based and offline processing. Budou can be used via command line, in Python scripts, or integrated into web applications, and it provides advanced options such as caching and entity recognition for improved segmentation accuracy.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Context for your AI agents Icon
    Context for your AI agents

    Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.

    Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
    Try for free
  • 5
    IDEA (Text Data Visualizer)

    IDEA (Text Data Visualizer)

    Text Data Visualizer with Django

    It is hard for non-developer to visualize data. But if you use IDEA, you can visualize data easily. If you want to test Project: IDEA locally on your environment, you require mecab-ko and mecab-ko-dic. If you have some data which you want to visualize, just put it in IDEA. Then click the Visualization button!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    AJ-JpnRa Tool

    AJ-JpnRa Tool

    「AJ-JpnRa Tool」 is Japanese text readability analysis program.

    We temporarily suspend the release of the program due to a patent application. -2020.09 AJ-JpnRa Tool is Japanese text readability analysis program, is mainly ordered by the guidelines of JLPT. You can analyze Japanese-Text Readability with the length and Chinese character level of the text by using the AJ-JpnRa Tool. And Chinese character level is analyzed by the database(AJ-JpnRa Tool), which was built according to essential Chinese character education guideline of Japan elementary...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    MySQL (5.1 and later) full-text parser plugins collection. This collection provides bigram, mecab , space, snowball and suffix parser. If you want to use Chinese or Japanese, bigram plugin might be useful.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    MeCab is a fast and customizable Japanese morphological analyzer. MeCab is designed for generic purpose and applied to variety of NLP tasks, such as Kana-Kanji conversion. MeCab provides parameter estimation functionalities based on CRFs and HMM
    Leader badge
    Downloads: 2,827 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next