Menu

Correcting Chinese Segmentation

Anonymous
2018-11-11
2018-11-12
  • Anonymous

    Anonymous - 2018-11-11

    Hi Dr. Koichi Higuchi,
    First, let me say thanks for the great tool. I am working on a project analyzing Chinese transcripts produced by second language users. Everything is working great, but I noticed that the tools is segmenting some characters together that shouldn't really be words. Is there anyway to manually separate some of these characters so they are not read as one word?
    On another note, I'll send a citation of the project once completed.
    Thanks!

     
  • HIGUCHI Koichi

    HIGUCHI Koichi - 2018-11-12

    Hi,

    Does “force pick up” functionality work for you? With this functionality, the specified character or character string is always extracted as one word.

    Please go to “Pre-Processing”, “Select Words to Analyze” in the menu of KH Coder. Then enter Chinese characters or character strings into “force pick up” field and click OK. Finally, run pre-processing again.

    Best,

     

Anonymous
Anonymous

Add attachments
Cancel





MongoDB Logo MongoDB