Search Results for "chinese word segmentation" - Page 2

Showing 68 open source projects for "chinese word segmentation"

View related business solutions
  • Top-Rated Free CRM Software Icon
    Top-Rated Free CRM Software

    216,000+ customers in over 135 countries grow their businesses with HubSpot

    HubSpot is an AI-powered customer platform with all the software, integrations, and resources you need to connect your marketing, sales, and customer service. HubSpot's connected platform enables you to grow your business faster by focusing on what matters most: your customers.
  • RMM Software | Remote Monitoring Platform and Tools Icon
    RMM Software | Remote Monitoring Platform and Tools

    Best-in-class automation, scalability, and single-pane IT management.

    Don’t settle when it comes to managing your clients’ IT infrastructure. Exceed their expectations with ConnectWise RMM, our MSP RMM software that provides proactive tools and NOC services—regardless of device environment. With the number of new vulnerabilities rising each year, smart patching procedures have never been more important. We automatically test and deploy patches when they are viable and restrict patches that are harmful. Get better protection for clients while you spend less time managing endpoints and more time growing your business. It’s tough to locate, afford, and retain quality talent. In fact, 81% of IT leaders say it’s hard to find the recruits they need. Add ConnectWise RMM, NOC services and get the expertise and problem resolution you need to become the advisor your clients demand—without adding headcount.
  • 1
    OpenSearchServer Search Engine

    OpenSearchServer Search Engine

    An open source search engine with RESTFul API and crawlers

    OpenSearchServer is a powerful, enterprise-class, search engine program. Using the web user interface, the crawlers (web, file, database, etc.) and the client libraries (REST/API , Ruby, Rails, Node.js, PHP, Perl) you will be able to integrate quickly and easily advanced full-text search capabilities in your application: Full-text with basic semantic, join queries, boolean queries, facet and filter, document (PDF, Office, etc.) indexation, web scrapping,etc. OpenSearchServer runs on...
    Downloads: 42 This Week
    Last Update:
    See Project
  • 2
    JXplorer - A Java Ldap Browser

    JXplorer - A Java Ldap Browser

    Mature LDAP, LDIF and DSML client with i18n support.

    A java LDAP client with LDIF support, security (inc SSL, SASL & GSSAPI), translated into many languages (inc. Chinese), online help, user forms and many other features. The commercial version is available at https://jxworkbench.com for $9.95. It extends JXplorer to include: - custom LDAP reporting - to pdf, word etc. - Find and Replace with regexp and attribute substitution - A secure password vault to store directory connections - etc. Support for JXplorer and JXWorkbench is available...
    Leader badge
    Downloads: 852 This Week
    Last Update:
    See Project
  • 3
    Riot search

    Riot search

    Go Open Source, Distributed, Simple and efficient Search Engine

    Go Open Source, Distributed, Simple and efficient full text search engine. Efficient indexing and search (1M blog 500M data 28 seconds index finished, 1.65 ms search response time, 19K search QPS). Support for logical search. Support Chinese word segmentation (use gse word segmentation package concurrent word, speed 27MB / s). Support the calculation of the keyword in the text close to the distance(token proximity). Support calculation BM25 correlation. Support custom scoring field and scoring...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    kcws

    kcws

    Deep Learning Chinese Word Segment

    Deep learning chinese word segment. Install the bazel code construction tool and install tensorflow (currently this project requires tf 1.0.0alpha version or above) Switch to the code directory of this project and run ./configure. Compile background service. Pay attention to the public account of waiting for words and reply to kcws to get the corpus download address. Extract the corpus to a directory. Change to the code directory.After installing tensorflow, switch to the kcws code directory...
    Downloads: 0 This Week
    Last Update:
    See Project
  • All-in-One Payroll and HR Platform Icon
    All-in-One Payroll and HR Platform

    For small and mid-sized businesses that need a comprehensive payroll and HR solution with personalized support

    We design our technology to make workforce management easier. APS offers core HR, payroll, benefits administration, attendance, recruiting, employee onboarding, and more.
  • 5
    Ansj Chinese word segmentation

    Ansj Chinese word segmentation

    Ansj word segmentation

    The real java implementation of ict. The word segmentation effect is faster than the open source version of ict. Chinese word segmentation, name recognition, part-of-speech tagging, user-defined dictionary. This is a java implementation of Chinese word segmentation based on n-Gram+CRF+HMM. The word segmentation speed reaches about 2 million words per second (tested under mac air), and the accuracy rate can reach more than 96%. At present, it has realized the functions of Chinese word...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6

    Devanagari OCR

    Devanagari Optical Character Recognition, Annotation tool

    ... enables users to view the word and character level ground truth of images. To cite this work, please use: "Devanagari OCR using a recognition driven segmentation framework and stochastic language models", Suryaprakash Kompalli, Srirangaraj Setlur, Venu Govindaraju, IJDAR, 2009, Volume: 12, Pg.: 123–138
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    OpenTeacher
    OpenTeacher is an opensource application that helps you learn a foreign language vocabulary. Just enter some words in your native and foreign language, and OpenTeacher tests you.
    Leader badge
    Downloads: 25 This Week
    Last Update:
    See Project
  • 8
    Welsh Natural Language Toolkit
    The project supports the Welsh Language Technology domain with a set of NLP tools that drive innovation and advance the development of sophisticated textual analysis solutions. The WNLT project delivers four core NLP modules; a) Word Segmentation for separating text into words b) Sentence Boundary Disambiguation for finding sentence boundaries c) Part of Speech Tagger for determining the part of speech of each word d) Morphological Analyser for identifying the root form (lemma) of words...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    ZPar statistical parser. Universal language support (depending on the availability of training data), with language-specific features for Chinese and English. Currently support word segmentation, POS tagging, dependency and phrase-structure parsing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gain insights and build data-powered applications Icon
    Gain insights and build data-powered applications

    Your unified business intelligence platform. Self-service. Governed. Embedded.

    Chat with your business data with Looker. More than just a modern business intelligence platform, you can turn to Looker for self-service or governed BI, build your own custom applications with trusted metrics, or even bring Looker modeling to your existing BI environment.
  • 10

    ZDT

    Zhongwen Development Tool - helping to study Mandarin Chinese

    .... It is also possible to search any word lists entered by the user using the flashcard plugin. * an Annotator plugin. The Annotator converts the Chinese characters into pinyin, using the Alphabet with accents. On the ZDT website, flashcard lists are available for some commonly used books, such as Hanyu Jiaocheng (汉语教程). User discussion can be found on: http://www.chinese-forums.com/index.php?/forum/35-zdt-flashcards-forum/ You also may choose to use Tickets -> Support (see above).
    Downloads: 6 This Week
    Last Update:
    See Project
  • 11
    ZORE is a syntax-based Chinese (Zh) ORE system, which can extract relations and semantic patterns from Chinese text. ZORE identifies relation candidates from auto- matically parsed dependency trees, and then extracts relations with their semantic patterns iteratively through a novel double propagation algorithm. Empirical results on two data sets show the effectiveness of the proposed system. This software source is under GPL (v.3), and a separate commercial license issued by the authors for non...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Welsh Natural Language Toolkit

    Welsh Natural Language Toolkit

    WNLT is a suite of open source natural language modules for the Welsh

    The project supports the Welsh Language Technology domain with a set of NLP tools that drive innovation and advance the development of sophisticated textual analysis solutions. The WNLT project delivers four core NLP modules; a) Word Segmentation for separating text into words b) Sentence Boundary Disambiguation for finding sentence boundaries c) Part of Speech Tagger for determining the part of speech of each word d) Morphological Analyser for identifying the root form (lemma) of words...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    FindAKanji

    a tool to learn about Chinese (and Japanese) characters

    The basic principle behind this program is 'Learning by Doing'. The user is encouraged to extend a database with acquired information about Chinese characters and Japanese Words. This information may turn out useful in the integrated text editor, for both reading and writing. So there are multiple different uses for this program: - Learn about Chinese characters. Gain, store and memorize knowledge about specific characters and words. - Input devices for Chinese (and Japanese) characters...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    phpMyAdmin

    phpMyAdmin

    A software tool to bring MySQL to the Web

    phpMyAdmin is a tool written in PHP intended to handle the administration of MySQL over the Web. Currently it can create and drop databases, create/drop/alter tables, delete/edit/add columns, execute any SQL statement, manage indexes on columns.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    I started this project because I wanted a dead-simple blog. One that didn't need a database, used flat text files, and looked nice. It's main advantage is that it only requires PHP 5 and write permissions. There is no setup, just unzip and copy.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 16

    GeoSegmenter

    A Chinese word segmenter for the geoscience domain

    GeoSegmenter is a Chinese word segmenter built specifically for the geoscience domain. It uses the conditional random fields (CRF) framework to build segmentation models. GeoSegmenter is trained with manually annotated geoscience documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    Darkbot

    The IRC's Talking Robot

    [ Please read https://sourceforge.net/p/darkbot/news/2014/01/darkbots-revitalization/ ] Darkbot is a portable IRC chat robot written in the C language that can be taught responses to user inquiries, and even have conversations with them. Darkbot was originally created by Jason Hamilton as an aid for help channels on Intenet Relay Chat.
    Leader badge
    Downloads: 9 This Week
    Last Update:
    See Project
  • 18

    Docear

    An Academic Literature Suite

    Docear (pronounced dog-ear) is what we call an “academic literature suite”. It integrates everything you need to search, organize and create academic literature in a single application: a digital library, reference manager, PDF and file manager, note taking and mind mapping. And the best: Docear works seemlessly with many existing tools like Mendeley, Microsoft Word, and Foxit Reader. Docear is free and open source, based on Freeplane, funded by the German Federal Ministry of Technology...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 19
    Wukong

    Wukong

    Highly customizable full-text search engine

    Efficient indexing and searching (1M Weibo 500M data is indexed in 28 seconds, search response time is 1.65 milliseconds, and search QPS is 19K). Support Chinese word segmentation (concurrent word segmentation using the sego word segmentation package, speed 27MB/sec). Support to calculate the proximity distance of keywords in the text (token proximity). When a request to add a document to the index comes in, the main coroutine will send the text to be segmented to a word segmentation coroutine...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    Bwst

    BWST is a short word for Brute-force Word-list Segmentation Technique.

    Simply, it’s a brute-force word list generating application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    EnZhDictUtf8

    English to Traditional Chinese database for MySQL for translators

    ... localhost and then i use PhpMyAdmin to search the `en_zh_word` column of the `words` database to find the chinese word for a given english word. And then if i want to know the BoPoMoFo sound for a given chinese character, i search the `zhuyin` column of the `zhuyin` table. ... The license of the translations is the license of the original publishers of the translations, usually a gpl style license like pydict or linux. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Transliterator between any Language files - Map Fonts, Create Encoding Scheme, Input Phonetic, Indian, Roman, Tamil, Hindi, English, French, German, Spanish or Any World Language Keyboard. Ex: [Phonetic Input]-[Any World Language Output] or ViceVersa.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    CRF Segmenter

    An improved method for discriminating Chinese word segmenter

    CRF Segmenter is an improved method for discriminating Chinese word segmenter. We introduce some global features and context features and get almost the same performance only with much smaller corpus.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    A web search engine and crawler written in java/mysql, fulltext and vertical search, word segmentation system .
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    WordSegment

    WordSegment

    wordseg project is a word segment module implemented by C#

    wordseg project is a word segment module implemented by C#. It is used to segment text into tokens and to label token's attribute according its context and semantic by front-maximum matching and CRF algorithms. The following are some sentences need to be segmented: 张晓晨和付仲恺一起坐在家(西坝河东里社区)里的沙发上看非诚勿扰。 百度公司的名字源于“众里寻他千百度”这诗句。 After above sentences be segmented by wordseg, the result as follows for each sentence: 张晓晨[PER] 和 付仲恺[PER] 一起 坐 在 家 ( 西坝河东里社区[LOC] ) 里 的 沙发[PDT] 上 看 非 诚 勿扰 。 百度公司[ORG] 的 名字...
    Downloads: 0 This Week
    Last Update:
    See Project