Menu

Evaluating Chinese ASR

dovark
2012-10-26
2012-10-28
  • Nickolay V. Shmyrev

    Hi

    Most Chinese texts do not have word bounaries and words are usually just one or two characters. So the systems are just trained on a character streams and the error rate is evaluated as CER since reference is also not split on words. And word split is not a trivial task itself.

    CER is not exactly the same value as WER but it's used in most evaluations so this is just a common practice.

     
  • dovark

    dovark - 2012-10-27

    Thanks. Would SER (syllable error rate) be a better/worse statistic than CER for continuous speech recognition? Or is SER equivalent to phoneme-error-rate in English?

     
  • Nickolay V. Shmyrev

    Or is SER equivalent to phoneme-error-rate in English?

    I think SER is not directly equivalent to Manadarin CER because of number of entries in language model and different probability distributions. We can not compare 60k words or symbols encoding words in a language to 1000 most common English syllables given they have very different distribution patterns.

     

Log in to post a comment.