Search Results for "chinese word segmentation"

Showing 120 open source projects for "chinese word segmentation"

View related business solutions
  • Holistically view your business data within a single solution. Icon
    Holistically view your business data within a single solution.

    For IT service providers and MSPs that need a data platform to manage their processes

    BrightGauge, a ConnectWise solution, was started in 2011 to fill a missing need in the small-to-medium IT Services industry: a better way to manage data and provide the value of work to clients. BrightGauge Software allows you to display all of your important business metrics in one place through the use of gauges, dashboards, and client reports. Used by more than 1,800 companies worldwide, BrightGauge integrates with popular business solutions on the market, like ConnectWise, Continuum, Webroot, QuickBooks, Datto, IT Glue, Zendesk, Harvest, Smileback, and so many more. Dig deeper into your data by adding, subtracting, multiplying, and dividing one metric against another. BrightGauge automatically computes these formulas for you. Want to show your prospects how quick you are to respond to tickets? Show off your data with embeddable gauges on public sites.
  • Easy management of simple and complex projects Icon
    Easy management of simple and complex projects

    We help different businesses become digital, manage projects, teams, communicate effectively and control tasks online.

    Plan more projects with Worksection. Use Gantt chart and Kanban boards to organize your projects, get your team onboard and assign tasks and due dates.
  • 1
    IK Analysis for Elasticsearch

    IK Analysis for Elasticsearch

    A plugin that integrates Lucene IK analyzer into elasticsearch

    IK Analyzer is an open source, lightweight Chinese word segmentation toolkit developed based on java language. Since the release of version 1.0 in December 2006, IKAnalyzer has launched 4 major versions. Initially, it was a Chinese word segmentation component based on the open source project Luence as the main application, combined with dictionary word segmentation and grammar analysis algorithms. Starting from version 3.0, IK has developed into a common word segmentation component for Java...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Word segmentation utility for Thai language written in C
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    gse

    gse

    Go efficient multilingual NLP and text segmentation

    Go efficient multilingual NLP and text segmentation; support English, Chinese, Japanese and others. Gse is implements jieba by golang, and try add NLP support and more feature. Support common, search engine, full mode, precise mode and HMM mode multiple word segmentation modes. Support user and embed dictionary, Part-of-speech/POS tagging, analyze segment info, stop and trim words. Support multilingual: English, Chinese, Japanese and others. Support Traditional Chinese. Support HMM cut text use...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    HanLP

    HanLP

    Han Language Processing

    HanLP is a multilingual Natural Language Processing (NLP) library composed of a series of models and algorithms. Built on TensorFlow 2.0, it was designed to advance state-of-the-art deep learning techniques and popularize the application of natural language processing in both academia and industry. HanLP is capable of lexical analysis (Chinese word segmentation, part-of-speech tagging, named entity recognition), syntax analysis, text classification, and sentiment analysis. It comes...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Component Content Management System for Software Documentation Icon
    Component Content Management System for Software Documentation

    Great tool for serious technical writers

    Paligo is an end-to-end Component Content Management System (CCMS) solution for technical documentation, policies and procedures, knowledge management, and more.
  • 5
    Saladict

    Saladict

    Pop-up dictionary and page translator with multiple search modes

    All-in-one professional pop-up dictionary and page translator which supports multiple search modes, page translations, new word notebook and PDF selection searching. Saladict supports mixed use of multiple search modes. Whether it be single-click, double-click, pop-up icon, hovering or keyboard shortcuts, you name it. The combination of dictionaries is dynamically adjusted according to search text and Saladict profile. Although Saladict is built for looking up English words, it also supports...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6
    spaCy

    spaCy

    Industrial-strength Natural Language Processing (NLP)

    spaCy is a library built on the very latest research for advanced Natural Language Processing (NLP) in Python and Cython. Since its inception it was designed to be used for real world applications-- for building real products and gathering real insights. It comes with pretrained statistical models and word vectors, convolutional neural network models, easy deep learning integration and so much more. spaCy is the fastest syntactic parser in the world according to independent benchmarks...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    TextGen

    TextGen

    textgen, Text Generation models

    ... with synonyms, random word insertion, deletion, replacement, etc. method, generating new text and implementing text augmentation This project realizes the back translation function based on Baidu translation API, first translate Chinese sentences into English, and then translate English into new Chinese. This project implements the training and prediction of Seq2Seq, ConvSeq2Seq, and BART models based on PyTorch, which can be used for text generation tasks such as text translation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    ToolGood.Words

    ToolGood.Words

    A high-performance sensitive word

    A high-performance sensitive word (illegal word/dirty word) detection and filtering component, with a traditional and simplified exchange, supports full-width half-width exchange, Chinese characters to pinyin, fuzzy search, and other functions. C#Language, using StringSearchEx2.Replacefiltering, on a 48k sensitive thesaurus at over 300 million characters per second. (cpu i7 8750h). C#'s own regular is very slow, which StringSearchEx2.ContainsAnyis Regex.IsMatchmore than 88,000 times...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Fanyi

    Fanyi

    A 🇨🇳 and 🇺🇸 translate tool in your command line

    Fanyi is a tool for translating words between the Chinese and English languages, right in your command line. It’s a good supportive tool for learning and reading the Chinese language from English, or the other way around. All translation data is fetched from iciba.com and fanyi.youdao.com, and with each translation comprehensive and related samples are given for better understanding and proper usage. There are translations for words as well as sentences, and in Mac/Linux bash, words can even...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Cloudflare secures and ensures the reliability of your external-facing resources such as websites, APIs, and applications. Icon
    It protects your internal resources such as behind-the-firewall applications, teams, and devices.
  • 10
    ChatGPT Academic

    ChatGPT Academic

    ChatGPT extension for scientific research work

    ChatGPT extension for scientific research work, specially optimized academic paper polishing experience, supports custom shortcut buttons, supports custom function plug-ins, supports markdown table display, double display of Tex formulas, complete code display function, new local Python/C++/Go project tree Analysis function/Project source code self-translation ability, newly added PDF and Word document batch summary function/PDF paper full-text translation function. All buttons are dynamically...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    BaikalDB

    BaikalDB

    BaikalDB, A Distributed HTAP Database

    ...) and sort operators (corresponding with SQL ORDER BY), with which users can fulfill their complex and time-critical analytical and transactional requirement by writing SQL statements. In a typical scenario, hundreds of millions of rows can be scanned and aggregated in few seconds. BaikalDB also supports full-text search by building inverted indices after word segmentation. Users can harness the fuzzy search features simply by adding a FULLTEXT KEY type index when creating tables.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    The Ethers Project

    The Ethers Project

    Complete Ethereum library and wallet implementation in JavaScript

    A complete Ethereum wallet implementation and utilities in JavaScript (and TypeScript). Keep your private keys in your client, safe and sound. Import and export JSON wallets (Geth, Parity and crowdsale) Import and export BIP 39 mnemonic phrases (12 word backup phrases) and HD Wallets (English as well as Czech, French, Italian, Japanese, Korean, Simplified Chinese, Spanish, Traditional Chinese. Meta-classes create JavaScript objects from any contract ABI, including ABIv2 and Human-Readable ABI...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    PaddleNLP

    PaddleNLP

    Easy-to-use and powerful NLP library with Awesome model zoo

    ... And process-wide text area API: Support for the loading of rich Chinese data sets Dataset API, can flexibly and efficiently complete data pretreatment Data API, Preset 60 + pre-training word vector Embedding API, Providing 100 + pre-training model Transformer API Wait, the efficiency of NLP task modeling can be greatly improved.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    mPDF

    mPDF

    PHP library generating PDF files from UTF-8 encoded HTML

    mPDF is a PHP library that generates PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files ‘on-the-fly’ from his website, handling different languages. It is slower than the original scripts e.g. HTML2FPDF and produces larger files when using Unicode fonts, but support for CSS styles etc. and has been much enhanced. Supports almost all languages including RTL (Arabic and Hebrew), and CJK (Chinese-Japanese-Korean). Nested block-level elements (e.g. P...
    Downloads: 105 This Week
    Last Update:
    See Project
  • 15
    Apache OpenOffice

    Apache OpenOffice

    The free and Open Source productivity suite

    Free alternative for Office productivity tools: Apache OpenOffice - formerly known as OpenOffice.org - is an open-source office productivity software suite containing word processor, spreadsheet, presentation, graphics, formula editor, and database management applications. OpenOffice is available in many languages, works on all common computers, stores data in ODF - the international open standard format - and is able to read and write files in other formats, included the format used...
    Leader badge
    Downloads: 219,048 This Week
    Last Update:
    See Project
  • 16
    WinMerge

    WinMerge

    Windows visual diff and merge for files and directories

    WinMerge is a Windows tool for visual difference display and merging, for both files and directories. It is highly useful for determining what has changed between file versions, and then merging those changes. WinMerge has Unicode support, Flexible syntax coloring editor, Visual SourceSafe integration, and Windows Shell integration. Regexp filtering for filenames and lines. Side-by-side line difference and highlights differences inside lines. A file map shows the overall file differences in...
    Leader badge
    Downloads: 31,946 This Week
    Last Update:
    See Project
  • 17
    AnyTXT Searcher

    AnyTXT Searcher

    A Powerful Desktop Full-Text Search Engine, Just Like Local Google.

    .... You can quickly find any text in any file on your disk by Anytxt almost in 0.1 second. It works on Windows 11,10, 8, 7, Vista, XP, 2008, 2012, 2016,2022... AnyTXT Searcher supports the following file formats: Plain text (txt, cpp, py, html, etc.) Microsoft OneNote (one) Microsoft Word (doc, docx) Microsoft Excel (xls, xlsx) Microsoft PowerPoint (ppt, pptx) PDF WPS Office (wps, et, dps) EBook (epub, mobi, azw3, fb2 etc.) Mind Map Format (lighten, mmap, mm, xmind etc.) OFD .....
    Leader badge
    Downloads: 2,244 This Week
    Last Update:
    See Project
  • 18
    jEdit

    jEdit

    jEdit is a programmer's text editor written in Java.

    jEdit is a programmer's text editor written in Java. It uses the Swing toolkit for the GUI and can be configured as a rather powerful IDE through the use of its plugin architecture.
    Leader badge
    Downloads: 1,029 This Week
    Last Update:
    See Project
  • 19
    OmegaT - multiplatform CAT tool

    OmegaT - multiplatform CAT tool

    The free computer aided translation (CAT) tool for professionals

    OmegaT is a free and open source multiplatform Computer Assisted Translation tool with fuzzy matching, translation memory, keyword search, glossaries, and translation leveraging into updated projects.
    Leader badge
    Downloads: 1,514 This Week
    Last Update:
    See Project
  • 20
    LibreOffice

    LibreOffice

    A free and powerful office suite

    ... reports, marketing presentations, technical drawings and diagrams. LibreOffice makes your work look great while you focus on the content, thanks to its powerful styles system and structuring tools. LibreOffice is compatible with a wide range of document formats such as Microsoft® Word (.doc, .docx), Excel (.xls, .xlsx), PowerPoint (.ppt, .pptx) and Publisher. But LibreOffice goes much further with its native support for a modern and open standard (OpenDocument Format).
    Leader badge
    Downloads: 668 This Week
    Last Update:
    See Project
  • 21
    phpList

    phpList

    Powerful Open Source Email Marketing app with analytics & segmentation

    phpList delivers Open Source email marketing, including analytics, list segmentation, content personalisaton and bounce processing. Extensive technical features and a secure and stable codebase are the result of over 17 years of continuous development. Used in 95 countries, available in 20+ languages, and used to send 25 billion email campaigns last year. Deploy it with your own SMTP server, or get a free hosted account at http://phplist.com.
    Leader badge
    Downloads: 315 This Week
    Last Update:
    See Project
  • 22
    Ant Renamer

    Ant Renamer

    Free program that eases the renaming of lots of files and folders

    This program can rename large amounts of files and folders in few clicks. It only modifies files/folders names. Changing extension, Replacing character strings by others, Inserting a character string, Moving characters, Deleting characters, Enumeration, Name creation with mp3's Tag (ID v1.1), Name creation with file's last modified date and time, Random names creation, Case change (uppercase, lowercase, first letter of each word in uppercase), Take names from a list/file, Use of EXIF info...
    Leader badge
    Downloads: 255 This Week
    Last Update:
    See Project
  • 23
    ClipAngel

    ClipAngel

    Clipboard history capture and paste tool

    This program belongs to class "Clipboard Manager". It captures many clipboard objects and allows user to select and paste one in any program. Inspired by analogs: ClipDiary, Ditto, AceText. Main priorities: usability, compatibility, reliability. Requires .Net Framework 4.52+ (Windows Vista+/Server 2008+)
    Leader badge
    Downloads: 148 This Week
    Last Update:
    See Project
  • 24
    Madedit-Mod

    Madedit-Mod

    MadEdit-Mod is a cross platform Text/Hex editor based on MadEdit

    Madedit-Mod is a cross platform text/hex editor base on MadEdit with a log of critical bug fix from me or other developers. A lot of new features were added, such as Drag-Drop Edit(cross platform), Highlight word, etc. The reason that I maintained this project is that the author of MadEdit had not worked on it for for a long time and I really like it and need more features. Find more information on Wiki pages. Currently supported Languages: English Chinese Simplified (zh_CN...
    Leader badge
    Downloads: 95 This Week
    Last Update:
    See Project
  • 25
    WIKINDX

    WIKINDX

    Virtual Research Environment / On-line Bibliography Manager

    Reference management, bibliography management, citations and a whole lot more. Designed by academics for academics, under continuous development since 2003, and used by both individuals and major research institutions worldwide, WIKINDX is a Virtual Research Environment (an enhanced on-line bibliography manager) storing searchable references, notes, files, citations, ideas, and more. An integrated WYSIWYG word processor exports formatted articles to RTF and HTML. Plugins include a citation...
    Leader badge
    Downloads: 34 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next