Search Results for "chinese word segmentation"

Showing 107 open source projects for "chinese word segmentation"

View related business solutions
  • Gain insights and build data-powered applications Icon
    Gain insights and build data-powered applications

    Your unified business intelligence platform. Self-service. Governed. Embedded.

    Chat with your business data with Looker. More than just a modern business intelligence platform, you can turn to Looker for self-service or governed BI, build your own custom applications with trusted metrics, or even bring Looker modeling to your existing BI environment.
  • Free and Open Source HR Software Icon
    Free and Open Source HR Software

    OrangeHRM provides a world-class HRIS experience and offers everything you and your team need to be that HR hero you know that you are.

    Give your HR team the tools they need to streamline administrative tasks, support employees, and make informed decisions with the OrangeHRM free and open source HR software.
  • 1
    IK Analysis for Elasticsearch

    IK Analysis for Elasticsearch

    A plugin that integrates Lucene IK analyzer into elasticsearch

    IK Analyzer is an open source, lightweight Chinese word segmentation toolkit developed based on java language. Since the release of version 1.0 in December 2006, IKAnalyzer has launched 4 major versions. Initially, it was a Chinese word segmentation component based on the open source project Luence as the main application, combined with dictionary word segmentation and grammar analysis algorithms. Starting from version 3.0, IK has developed into a common word segmentation component for Java...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Word segmentation utility for Thai language written in C
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    gse

    gse

    Go efficient multilingual NLP and text segmentation

    Go efficient multilingual NLP and text segmentation; support English, Chinese, Japanese and others. Gse is implements jieba by golang, and try add NLP support and more feature. Support common, search engine, full mode, precise mode and HMM mode multiple word segmentation modes. Support user and embed dictionary, Part-of-speech/POS tagging, analyze segment info, stop and trim words. Support multilingual: English, Chinese, Japanese and others. Support Traditional Chinese. Support HMM cut text use...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    HanLP

    HanLP

    Han Language Processing

    HanLP is a multilingual Natural Language Processing (NLP) library composed of a series of models and algorithms. Built on TensorFlow 2.0, it was designed to advance state-of-the-art deep learning techniques and popularize the application of natural language processing in both academia and industry. HanLP is capable of lexical analysis (Chinese word segmentation, part-of-speech tagging, named entity recognition), syntax analysis, text classification, and sentiment analysis. It comes...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Case Management Software for Social Services Icon
    Case Management Software for Social Services

    For human services organizations looking for case management software

    Collaborate is customizable case management software for non-profits and social services agencies with teams of 5+ staff.
  • 5
    Saladict

    Saladict

    Pop-up dictionary and page translator with multiple search modes

    All-in-one professional pop-up dictionary and page translator which supports multiple search modes, page translations, new word notebook and PDF selection searching. Saladict supports mixed use of multiple search modes. Whether it be single-click, double-click, pop-up icon, hovering or keyboard shortcuts, you name it. The combination of dictionaries is dynamically adjusted according to search text and Saladict profile. Although Saladict is built for looking up English words, it also supports...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6
    spaCy

    spaCy

    Industrial-strength Natural Language Processing (NLP)

    spaCy is a library built on the very latest research for advanced Natural Language Processing (NLP) in Python and Cython. Since its inception it was designed to be used for real world applications-- for building real products and gathering real insights. It comes with pretrained statistical models and word vectors, convolutional neural network models, easy deep learning integration and so much more. spaCy is the fastest syntactic parser in the world according to independent benchmarks...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    TextGen

    TextGen

    textgen, Text Generation models

    ... with synonyms, random word insertion, deletion, replacement, etc. method, generating new text and implementing text augmentation This project realizes the back translation function based on Baidu translation API, first translate Chinese sentences into English, and then translate English into new Chinese. This project implements the training and prediction of Seq2Seq, ConvSeq2Seq, and BART models based on PyTorch, which can be used for text generation tasks such as text translation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    ToolGood.Words

    ToolGood.Words

    A high-performance sensitive word

    A high-performance sensitive word (illegal word/dirty word) detection and filtering component, with a traditional and simplified exchange, supports full-width half-width exchange, Chinese characters to pinyin, fuzzy search, and other functions. C#Language, using StringSearchEx2.Replacefiltering, on a 48k sensitive thesaurus at over 300 million characters per second. (cpu i7 8750h). C#'s own regular is very slow, which StringSearchEx2.ContainsAnyis Regex.IsMatchmore than 88,000 times...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Fanyi

    Fanyi

    A 🇨🇳 and 🇺🇸 translate tool in your command line

    Fanyi is a tool for translating words between the Chinese and English languages, right in your command line. It’s a good supportive tool for learning and reading the Chinese language from English, or the other way around. All translation data is fetched from iciba.com and fanyi.youdao.com, and with each translation comprehensive and related samples are given for better understanding and proper usage. There are translations for words as well as sentences, and in Mac/Linux bash, words can even...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Cyber Risk Assessment and Management Platform Icon
    Cyber Risk Assessment and Management Platform

    ConnectWise Identify is a powerful cybersecurity risk assessment platform offering strategic cybersecurity assessments and recommendations.

    When it comes to cybersecurity, what your clients don’t know can really hurt them. And believe it or not, keep them safe starts with asking questions. With ConnectWise Identify Assessment, get access to risk assessment backed by the NIST Cybersecurity Framework to uncover risks across your client’s entire business, not just their networks. With a clearly defined, easy-to-read risk report in hand, you can start having meaningful security conversations that can get you on the path of keeping your clients protected from every angle. Choose from two assessment levels to cover every client’s need, from the Essentials to cover the basics to our Comprehensive Assessment to dive deeper to uncover additional risks. Our intuitive heat map shows you your client’s overall risk level and priority to address risks based on probability and financial impact. Each report includes remediation recommendations to help you create a revenue-generating action plan.
  • 10
    ChatGPT Academic

    ChatGPT Academic

    ChatGPT extension for scientific research work

    ChatGPT extension for scientific research work, specially optimized academic paper polishing experience, supports custom shortcut buttons, supports custom function plug-ins, supports markdown table display, double display of Tex formulas, complete code display function, new local Python/C++/Go project tree Analysis function/Project source code self-translation ability, newly added PDF and Word document batch summary function/PDF paper full-text translation function. All buttons are dynamically...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    spaCy models

    spaCy models

    Models for the spaCy Natural Language Processing (NLP) library

    spaCy is designed to help you do real work, to build real products, or gather real insights. The library respects your time, and tries to avoid wasting it. It's easy to install, and its API is simple and productive. spaCy excels at large-scale information extraction tasks. It's written from the ground up in carefully memory-managed Cython. If your application needs to process entire web dumps, spaCy is the library you want to be using. Since its release in 2015, spaCy has become an industry...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    BaikalDB

    BaikalDB

    BaikalDB, A Distributed HTAP Database

    ...) and sort operators (corresponding with SQL ORDER BY), with which users can fulfill their complex and time-critical analytical and transactional requirement by writing SQL statements. In a typical scenario, hundreds of millions of rows can be scanned and aggregated in few seconds. BaikalDB also supports full-text search by building inverted indices after word segmentation. Users can harness the fuzzy search features simply by adding a FULLTEXT KEY type index when creating tables.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    The Ethers Project

    The Ethers Project

    Complete Ethereum library and wallet implementation in JavaScript

    A complete Ethereum wallet implementation and utilities in JavaScript (and TypeScript). Keep your private keys in your client, safe and sound. Import and export JSON wallets (Geth, Parity and crowdsale) Import and export BIP 39 mnemonic phrases (12 word backup phrases) and HD Wallets (English as well as Czech, French, Italian, Japanese, Korean, Simplified Chinese, Spanish, Traditional Chinese. Meta-classes create JavaScript objects from any contract ABI, including ABIv2 and Human-Readable ABI...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    PaddleNLP

    PaddleNLP

    Easy-to-use and powerful NLP library with Awesome model zoo

    ... And process-wide text area API: Support for the loading of rich Chinese data sets Dataset API, can flexibly and efficiently complete data pretreatment Data API, Preset 60 + pre-training word vector Embedding API, Providing 100 + pre-training model Transformer API Wait, the efficiency of NLP task modeling can be greatly improved.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    mPDF

    mPDF

    PHP library generating PDF files from UTF-8 encoded HTML

    mPDF is a PHP library that generates PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files ‘on-the-fly’ from his website, handling different languages. It is slower than the original scripts e.g. HTML2FPDF and produces larger files when using Unicode fonts, but support for CSS styles etc. and has been much enhanced. Supports almost all languages including RTL (Arabic and Hebrew), and CJK (Chinese-Japanese-Korean). Nested block-level elements (e.g. P...
    Downloads: 105 This Week
    Last Update:
    See Project
  • 16
    Apache OpenOffice

    Apache OpenOffice

    The free and Open Source productivity suite

    Free alternative for Office productivity tools: Apache OpenOffice - formerly known as OpenOffice.org - is an open-source office productivity software suite containing word processor, spreadsheet, presentation, graphics, formula editor, and database management applications. OpenOffice is available in many languages, works on all common computers, stores data in ODF - the international open standard format - and is able to read and write files in other formats, included the format used...
    Leader badge
    Downloads: 219,048 This Week
    Last Update:
    See Project
  • 17
    WinMerge

    WinMerge

    Windows visual diff and merge for files and directories

    WinMerge is a Windows tool for visual difference display and merging, for both files and directories. It is highly useful for determining what has changed between file versions, and then merging those changes. WinMerge has Unicode support, Flexible syntax coloring editor, Visual SourceSafe integration, and Windows Shell integration. Regexp filtering for filenames and lines. Side-by-side line difference and highlights differences inside lines. A file map shows the overall file differences in...
    Leader badge
    Downloads: 31,946 This Week
    Last Update:
    See Project
  • 18
    AnyTXT Searcher

    AnyTXT Searcher

    A Powerful Desktop Full-Text Search Engine, Just Like Local Google.

    .... You can quickly find any text in any file on your disk by Anytxt almost in 0.1 second. It works on Windows 11,10, 8, 7, Vista, XP, 2008, 2012, 2016,2022... AnyTXT Searcher supports the following file formats: Plain text (txt, cpp, py, html, etc.) Microsoft OneNote (one) Microsoft Word (doc, docx) Microsoft Excel (xls, xlsx) Microsoft PowerPoint (ppt, pptx) PDF WPS Office (wps, et, dps) EBook (epub, mobi, azw3, fb2 etc.) Mind Map Format (lighten, mmap, mm, xmind etc.) OFD .....
    Leader badge
    Downloads: 2,244 This Week
    Last Update:
    See Project
  • 19
    jEdit

    jEdit

    jEdit is a programmer's text editor written in Java.

    jEdit is a programmer's text editor written in Java. It uses the Swing toolkit for the GUI and can be configured as a rather powerful IDE through the use of its plugin architecture.
    Leader badge
    Downloads: 1,029 This Week
    Last Update:
    See Project
  • 20
    OmegaT - multiplatform CAT tool

    OmegaT - multiplatform CAT tool

    The free computer aided translation (CAT) tool for professionals

    OmegaT is a free and open source multiplatform Computer Assisted Translation tool with fuzzy matching, translation memory, keyword search, glossaries, and translation leveraging into updated projects.
    Leader badge
    Downloads: 1,514 This Week
    Last Update:
    See Project
  • 21
    LibreOffice

    LibreOffice

    A free and powerful office suite

    ... reports, marketing presentations, technical drawings and diagrams. LibreOffice makes your work look great while you focus on the content, thanks to its powerful styles system and structuring tools. LibreOffice is compatible with a wide range of document formats such as Microsoft® Word (.doc, .docx), Excel (.xls, .xlsx), PowerPoint (.ppt, .pptx) and Publisher. But LibreOffice goes much further with its native support for a modern and open standard (OpenDocument Format).
    Leader badge
    Downloads: 668 This Week
    Last Update:
    See Project
  • 22
    phpList

    phpList

    Powerful Open Source Email Marketing app with analytics & segmentation

    phpList delivers Open Source email marketing, including analytics, list segmentation, content personalisaton and bounce processing. Extensive technical features and a secure and stable codebase are the result of over 17 years of continuous development. Used in 95 countries, available in 20+ languages, and used to send 25 billion email campaigns last year. Deploy it with your own SMTP server, or get a free hosted account at http://phplist.com.
    Leader badge
    Downloads: 315 This Week
    Last Update:
    See Project
  • 23
    Ant Renamer

    Ant Renamer

    Free program that eases the renaming of lots of files and folders

    This program can rename large amounts of files and folders in few clicks. It only modifies files/folders names. Changing extension, Replacing character strings by others, Inserting a character string, Moving characters, Deleting characters, Enumeration, Name creation with mp3's Tag (ID v1.1), Name creation with file's last modified date and time, Random names creation, Case change (uppercase, lowercase, first letter of each word in uppercase), Take names from a list/file, Use of EXIF info...
    Leader badge
    Downloads: 255 This Week
    Last Update:
    See Project
  • 24
    ClipAngel

    ClipAngel

    Clipboard history capture and paste tool

    This program belongs to class "Clipboard Manager". It captures many clipboard objects and allows user to select and paste one in any program. Inspired by analogs: ClipDiary, Ditto, AceText. Main priorities: usability, compatibility, reliability. Requires .Net Framework 4.52+ (Windows Vista+/Server 2008+)
    Leader badge
    Downloads: 148 This Week
    Last Update:
    See Project
  • 25
    Madedit-Mod

    Madedit-Mod

    MadEdit-Mod is a cross platform Text/Hex editor based on MadEdit

    Madedit-Mod is a cross platform text/hex editor base on MadEdit with a log of critical bug fix from me or other developers. A lot of new features were added, such as Drag-Drop Edit(cross platform), Highlight word, etc. The reason that I maintained this project is that the author of MadEdit had not worked on it for for a long time and I really like it and need more features. Find more information on Wiki pages. Currently supported Languages: English Chinese Simplified (zh_CN...
    Leader badge
    Downloads: 95 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next