Search Results for "chinese word segmentation" - Page 3

Showing 109 open source projects for "chinese word segmentation"

View related business solutions
  • Translate docs, audio, and videos in real time with Google AI Icon
    Translate docs, audio, and videos in real time with Google AI

    Make your content and apps multilingual with fast, dynamic machine translation available in thousands of language pairs.

    Google Cloud’s AI-powered APIs help you translate documents, websites, apps, audio files, videos, and more at scale with best-in-class quality and enterprise-grade control and security.
  • Digital Payments by Deluxe Payment Exchange Icon
    Digital Payments by Deluxe Payment Exchange

    A single integrated payables solution that takes manual payment processes out of the equation, helping reduce risk and cutting costs for your business

    Save time, money and your sanity. Deluxe Payment Exchange+ (DPX+) is our integrated payments solution that streamlines and automates your accounts payable (AP) disbursements. DPX+ ensures secure payments and offers suppliers alternate ways to receive funds, including mailed checks, ACH, virtual credit cards, debit cards, or eCheck payments. By simply integrating with your existing accounting software like QuickBooks®, you’ll implement efficient payment solutions for AP with ease—without costly development fees or untimely delays.
  • 1

    ZDT

    Zhongwen Development Tool - helping to study Mandarin Chinese

    .... It is also possible to search any word lists entered by the user using the flashcard plugin. * an Annotator plugin. The Annotator converts the Chinese characters into pinyin, using the Alphabet with accents. On the ZDT website, flashcard lists are available for some commonly used books, such as Hanyu Jiaocheng (汉语教程). User discussion can be found on: http://www.chinese-forums.com/index.php?/forum/35-zdt-flashcards-forum/ You also may choose to use Tickets -> Support (see above).
    Downloads: 10 This Week
    Last Update:
    See Project
  • 2
    ZORE is a syntax-based Chinese (Zh) ORE system, which can extract relations and semantic patterns from Chinese text. ZORE identifies relation candidates from auto- matically parsed dependency trees, and then extracts relations with their semantic patterns iteratively through a novel double propagation algorithm. Empirical results on two data sets show the effectiveness of the proposed system. This software source is under GPL (v.3), and a separate commercial license issued by the authors for non...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Welsh Natural Language Toolkit

    Welsh Natural Language Toolkit

    WNLT is a suite of open source natural language modules for the Welsh

    The project supports the Welsh Language Technology domain with a set of NLP tools that drive innovation and advance the development of sophisticated textual analysis solutions. The WNLT project delivers four core NLP modules; a) Word Segmentation for separating text into words b) Sentence Boundary Disambiguation for finding sentence boundaries c) Part of Speech Tagger for determining the part of speech of each word d) Morphological Analyser for identifying the root form (lemma) of words...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Excellator

    Excellator

    WYSIWYG engineering calculations in MS Excel, report generation.

    The Excellator program helps in the management of basic computations in MS Excel and creation of reports in MS Word. Excellator facilitates to process a MS Excel book which is a self-contained document and is fully operable independently of Excellator. Working style is WYSIWYG - all formulae and data are displayed in the computational table. Cell addresses are not used, only variable names are used instead. Presentation http://prezi.com/6yrd0msgz9ch/?utm_campaign=share&utm_medium=copy&rc...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Free CRM Software With Something for Everyone Icon
    Free CRM Software With Something for Everyone

    216,000+ customers in over 135 countries grow their businesses with HubSpot

    Think CRM software is just about contact management? Think again. HubSpot CRM has free tools for everyone on your team, and it’s 100% free. Here’s how our free CRM solution makes your job easier.
  • 5
    Anti prohibited-words software
    The software is designed for the net-surfers whose countries are strict in the control of network information. This software provides a solution to evade the detection of the prohibited-words examination system.(only available in Chinese version)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    FindAKanji

    a tool to learn about Chinese (and Japanese) characters

    The basic principle behind this program is 'Learning by Doing'. The user is encouraged to extend a database with acquired information about Chinese characters and Japanese Words. This information may turn out useful in the integrated text editor, for both reading and writing. So there are multiple different uses for this program: - Learn about Chinese characters. Gain, store and memorize knowledge about specific characters and words. - Input devices for Chinese (and Japanese...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    phpMyAdmin

    phpMyAdmin

    A software tool to bring MySQL to the Web

    phpMyAdmin is a tool written in PHP intended to handle the administration of MySQL over the Web. Currently it can create and drop databases, create/drop/alter tables, delete/edit/add columns, execute any SQL statement, manage indexes on columns.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    I started this project because I wanted a dead-simple blog. One that didn't need a database, used flat text files, and looked nice. It's main advantage is that it only requires PHP 5 and write permissions. There is no setup, just unzip and copy.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 9

    GeoSegmenter

    A Chinese word segmenter for the geoscience domain

    GeoSegmenter is a Chinese word segmenter built specifically for the geoscience domain. It uses the conditional random fields (CRF) framework to build segmentation models. GeoSegmenter is trained with manually annotated geoscience documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Automated RMM Tools | RMM Software Icon
    Automated RMM Tools | RMM Software

    Proactively monitor, manage, and support client networks with ConnectWise Automate

    Out-of-the-box scripts. Around-the-clock monitoring. Unmatched automation capabilities. Start doing more with less and exceed service delivery expectations.
  • 10
    odt2braille
    odt2braille is a Braille extension to OpenOffice.org Writer. odt2braille enables authors to print documents to a Braille embosser and to export documents as Braille files. The Braille output is well-formatted and highly customizable.
    Leader badge
    Downloads: 20 This Week
    Last Update:
    See Project
  • 11

    Darkbot

    The IRC's Talking Robot

    [ Please read https://sourceforge.net/p/darkbot/news/2014/01/darkbots-revitalization/ ] Darkbot is a portable IRC chat robot written in the C language that can be taught responses to user inquiries, and even have conversations with them. Darkbot was originally created by Jason Hamilton as an aid for help channels on Intenet Relay Chat.
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • 12

    Docear

    An Academic Literature Suite

    Docear (pronounced dog-ear) is what we call an “academic literature suite”. It integrates everything you need to search, organize and create academic literature in a single application: a digital library, reference manager, PDF and file manager, note taking and mind mapping. And the best: Docear works seemlessly with many existing tools like Mendeley, Microsoft Word, and Foxit Reader. Docear is free and open source, based on Freeplane, funded by the German Federal Ministry of Technology...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    Wukong

    Wukong

    Highly customizable full-text search engine

    Efficient indexing and searching (1M Weibo 500M data is indexed in 28 seconds, search response time is 1.65 milliseconds, and search QPS is 19K). Support Chinese word segmentation (concurrent word segmentation using the sego word segmentation package, speed 27MB/sec). Support to calculate the proximity distance of keywords in the text (token proximity). When a request to add a document to the index comes in, the main coroutine will send the text to be segmented to a word segmentation coroutine...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    My Vocabulary

    My Vocabulary

    vocabulary builder of any language

    MyVocabulary is a software application to create the vocabulary that a person wants to know when he is studying a new language (english, spanish, chinese, german,...). Also, when a word is added to the vocabulary, the word's audio (mp3 file) is downloaded automatically from a web site.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    Bwst

    BWST is a short word for Brute-force Word-list Segmentation Technique.

    Simply, it’s a brute-force word list generating application.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16

    EnZhDictUtf8

    English to Traditional Chinese database for MySQL for translators

    ... on my localhost and then i use PhpMyAdmin to search the `en_zh_word` column of the `words` database to find the chinese word for a given english word. And then if i want to know the BoPoMoFo sound for a given chinese character, i search the `zhuyin` column of the `zhuyin` table. ... The license of the translations is the license of the original publishers of the translations, usually a gpl style license like pydict or linux. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    FlareGet - Download Manager

    FlareGet - Download Manager

    An advanced multi-threaded and multisegment download manager.

    FlareGet is a full featured, multi-threaded and multi-segment download manager and accelerator for Windows and Linux
    Downloads: 55 This Week
    Last Update:
    See Project
  • 18
    Transliterator between any Language files - Map Fonts, Create Encoding Scheme, Input Phonetic, Indian, Roman, Tamil, Hindi, English, French, German, Spanish or Any World Language Keyboard. Ex: [Phonetic Input]-[Any World Language Output] or ViceVersa.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    CRF Segmenter

    An improved method for discriminating Chinese word segmenter

    CRF Segmenter is an improved method for discriminating Chinese word segmenter. We introduce some global features and context features and get almost the same performance only with much smaller corpus.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    A web search engine and crawler written in java/mysql, fulltext and vertical search, word segmentation system .
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    WordSegment

    WordSegment

    wordseg project is a word segment module implemented by C#

    wordseg project is a word segment module implemented by C#. It is used to segment text into tokens and to label token's attribute according its context and semantic by front-maximum matching and CRF algorithms. The following are some sentences need to be segmented: 张晓晨和付仲恺一起坐在家(西坝河东里社区)里的沙发上看非诚勿扰。 百度公司的名字源于“众里寻他千百度”这诗句。 After above sentences be segmented by wordseg, the result as follows for each sentence: 张晓晨[PER] 和 付仲恺[PER] 一起 坐 在 家 ( 西坝河东里社区[LOC] ) 里 的 沙发[PDT] 上 看 非 诚 勿扰 。 百度公司...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    CRFSharp

    CRFSharp

    CRFSharp is a .NET(C#) implementation of Conditional Random Field

    CRFSharp(aka CRF#) is a .NET(C#) implementation of Conditional Random Fields, an machine learning algorithm for learning from labeled sequences of examples. It is widely used in Natural Language Process (NLP) tasks, for example: word breaker, postagging, named entity recognized, query chunking and so on. CRF#'s mainly algorithm is the same as CRF++ written by Taku Kudo. It encodes model parameters by L-BFGS. Moreover, it has many significant improvement than CRF++, such as totally parallel...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Bibus is a bibliographic database. It uses a MySQL or SQLite database to store references. It can directly insert references in OpenOffice.org and MS Word and generate the bibliographic index.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    A collection of open source libraries and tools that provide solutions for common problems in processing Arabic text, especially in web applications. text normalization, phrase segmentation, text indexing, stop word lists, common spelling mistakes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    flash implementation of word net browser 's core control, word net can be displayed in web browser now.
    Downloads: 0 This Week
    Last Update:
    See Project