Showing 19 open source projects for "chinese word segmentation"

View related business solutions
  • Our Free Plans just got better! | Auth0 by Okta Icon
    Our Free Plans just got better! | Auth0 by Okta

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your secuirty. Auth0 now, thank yourself later.
    Try free now
  • Bright Data - All in One Platform for Proxies and Web Scraping Icon
    Bright Data - All in One Platform for Proxies and Web Scraping

    Say goodbye to blocks, restrictions, and CAPTCHAs

    Bright Data offers the highest quality proxies with automated session management, IP rotation, and advanced web unlocking technology. Enjoy reliable, fast performance with easy integration, a user-friendly dashboard, and enterprise-grade scaling. Powered by ethically-sourced residential IPs for seamless web scraping.
    Get Started
  • 1
    gse

    gse

    Go efficient multilingual NLP and text segmentation

    Go efficient multilingual NLP and text segmentation; support English, Chinese, Japanese and others. Gse is implements jieba by golang, and try add NLP support and more feature. Support common, search engine, full mode, precise mode and HMM mode multiple word segmentation modes. Support user and embed dictionary, Part-of-speech/POS tagging, analyze segment info, stop and trim words. Support multilingual: English, Chinese, Japanese and others. Support Traditional Chinese. Support HMM cut text use...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    spaCy

    spaCy

    Industrial-strength Natural Language Processing (NLP)

    spaCy is a library built on the very latest research for advanced Natural Language Processing (NLP) in Python and Cython. Since its inception it was designed to be used for real world applications-- for building real products and gathering real insights. It comes with pretrained statistical models and word vectors, convolutional neural network models, easy deep learning integration and so much more. spaCy is the fastest syntactic parser in the world according to independent benchmarks...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    ToolGood.Words

    ToolGood.Words

    A high-performance sensitive word

    A high-performance sensitive word (illegal word/dirty word) detection and filtering component, with a traditional and simplified exchange, supports full-width half-width exchange, Chinese characters to pinyin, fuzzy search, and other functions. C#Language, using StringSearchEx2.Replacefiltering, on a 48k sensitive thesaurus at over 300 million characters per second. (cpu i7 8750h). C#'s own regular is very slow, which StringSearchEx2.ContainsAnyis Regex.IsMatchmore than 88,000 times...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    WinMerge

    WinMerge

    Windows visual diff and merge for files and directories

    WinMerge is a Windows tool for visual difference display and merging, for both files and directories. It is highly useful for determining what has changed between file versions, and then merging those changes. WinMerge has Unicode support, Flexible syntax coloring editor, Visual SourceSafe integration, and Windows Shell integration. Regexp filtering for filenames and lines. Side-by-side line difference and highlights differences inside lines. A file map shows the overall file differences in...
    Leader badge
    Downloads: 35,155 This Week
    Last Update:
    See Project
  • Multi-Site Network and Cloud Connectivity for Businesses Icon
    Multi-Site Network and Cloud Connectivity for Businesses

    Internet connectivity without complexity

    As your users rely more and more on Cloud and Internet-based technologies, reliable internet connectivity becomes more and more important to your business. With Bigleaf’s proven SD-WAN architecture, groundbreaking AI, and DDoS attack mitigation, you can finally deliver the reliable internet connectivity your business needs without the limitations of traditional networking platforms. Bigleaf’s Cloud Access Network and plug-and-play router allow for limitless control to and from anywhere your traffic needs to go. Bigleaf’s self-driving AI automatically identifies and adapts to any changing circuit conditions and traffic needs—addressing issues before they impact your users. Bigleaf puts you in the driver’s seat of every complaint and support call with full-path traffic and network performance data, delivered as actionable insights, reports, and alerts.
    Learn More
  • 5
    mPDF

    mPDF

    PHP library generating PDF files from UTF-8 encoded HTML

    mPDF is a PHP library that generates PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files ‘on-the-fly’ from his website, handling different languages. It is slower than the original scripts e.g. HTML2FPDF and produces larger files when using Unicode fonts, but support for CSS styles etc. and has been much enhanced. Supports almost all languages including RTL (Arabic and Hebrew), and CJK (Chinese-Japanese-Korean). Nested block-level elements (e.g. P...
    Downloads: 93 This Week
    Last Update:
    See Project
  • 6
    OmegaT - multiplatform CAT tool

    OmegaT - multiplatform CAT tool

    The free computer aided translation (CAT) tool for professionals

    OmegaT is a free and open source multiplatform Computer Assisted Translation tool with fuzzy matching, translation memory, keyword search, glossaries, and translation leveraging into updated projects.
    Leader badge
    Downloads: 2,096 This Week
    Last Update:
    See Project
  • 7
    jEdit

    jEdit

    jEdit is a programmer's text editor written in Java.

    jEdit is a programmer's text editor written in Java. It uses the Swing toolkit for the GUI and can be configured as a rather powerful IDE through the use of its plugin architecture.
    Leader badge
    Downloads: 1,094 This Week
    Last Update:
    See Project
  • 8
    ClipAngel

    ClipAngel

    Clipboard history capture and paste tool

    This program belongs to class "Clipboard Manager". It captures many clipboard objects and allows user to select and paste one in any program. Inspired by analogs: ClipDiary, Ditto, AceText. Main priorities: usability, compatibility, reliability. Requires .Net Framework 4.52+ (Windows Vista+/Server 2008+)
    Leader badge
    Downloads: 127 This Week
    Last Update:
    See Project
  • 9
    TNN

    TNN

    Uniform deep learning inference framework for mobile

    TNN, a high-performance, lightweight neural network inference framework open sourced by Tencent Youtu Lab. It also has many outstanding advantages such as cross-platform, high performance, model compression, and code tailoring. The TNN framework further strengthens the support and performance optimization of mobile devices on the basis of the original Rapidnet and ncnn frameworks. At the same time, it refers to the high performance and good scalability characteristics of the industry's...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Free CRM Software With Something for Everyone Icon
    Free CRM Software With Something for Everyone

    216,000+ customers in over 135 countries grow their businesses with HubSpot

    Think CRM software is just about contact management? Think again. HubSpot CRM has free tools for everyone on your team, and it’s 100% free. Here’s how our free CRM solution makes your job easier.
    Get free CRM
  • 10
    fastNLP

    fastNLP

    fastNLP: A Modularized and Extensible NLP Framework

    ... (covering tasks such as Chinese word segmentation, named entity recognition, syntactic analysis, text classification, text matching, metaphor resolution, summarization, etc.). Trainer provides a variety of built-in Callback functions to facilitate experiment recording, exception capture, etc. Automatic download of some datasets and pre-trained models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    NüHelp: Word/HTML to CHM converter

    NüHelp: Word/HTML to CHM converter

    Convert Microsoft Word and HTML files to a CHM help file.

    *New version 2018.04.23* - use the contact page on opulos.com or sourceforge to report any issues. NüHelp converts Microsoft Word documents and HTML files to a CHM file. It's easy to use for beginners and has lots of options for advanced users. Requirements: 1) Word 2003, 2007, 2010 or 2013. 2) HTML Help Workshop (included in zip file) 3) .Net 3.5 Client Profile (already on most computers) *** Win7/8 users: After downloading the file, you will have to right click on the zip file...
    Leader badge
    Downloads: 5 This Week
    Last Update:
    See Project
  • 12
    ApiManager

    ApiManager

    Free API collaboration management system

    Completely open source, free API collaboration management system. Collaborative development, online testing, document management, export interface, personalized function customization. Simple and efficient BUG management system to record every change. Alibaba Cloud's secure cloud storage, backup data regularly, and support local deployment. Team collaboration, permission control, modification log. Database table, markdown, restful, mock, pdf, word. HTTPS, private projects, encrypted projects...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Transliterator between any Language files - Map Fonts, Create Encoding Scheme, Input Phonetic, Indian, Roman, Tamil, Hindi, English, French, German, Spanish or Any World Language Keyboard. Ex: [Phonetic Input]-[Any World Language Output] or ViceVersa.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    A collection of open source libraries and tools that provide solutions for common problems in processing Arabic text, especially in web applications. text normalization, phrase segmentation, text indexing, stop word lists, common spelling mistakes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    KTDictSeg is a Chinese Segment Open source Project.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    uListen is a TTS(Text To Speech) application. It can TALK you the web pages, chm files, pdf files and word files and plain text files.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 17
    Resx Editor is a small translation-oriented file editor for Microsoft RESX 2.0 files. Resx Editor is designed for professional translators (include a word count feature).
    Downloads: 7 This Week
    Last Update:
    See Project
  • 18
    Bi-gram applications based on language models produced by SRILM from Chinese Wikipedia corpus, include Chinese word segmenter, word-based (not character-based) Traditional-Simplified Chinese converter and Chinese syllable-to-word converter.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    libtabe is a library which provides useful Chinese functions/routines that can deal with fundamental elements such as pronunciation(BoPoMoFo), character frequency, word identification, word frequency. It also comes with a large free word database.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next