I'm planning to build Korean dictionary for hunspell. I did some success with hunspell command but it required Hangul Unicode syllables/jamo converter like this.
$ echo korean_words | ./syl2jamo | hunspell -d ko | ./jamo2syl
Usually (modern) Korean language is written with Hangul syllables code (U+AC00-U+D7A3 in Unicode). But it is a bad choice when calculating word edit distances. Hangul jamo code (U+1100 in Unicode) is much better and can be processed in the same way of Western scripts, as each jamo code represents its own sound/keystroke.
Can hunspell convert Hangul syllable characters into Jamo characters and vice versa, so internally it processes Hangul text only in Jamo code? This will be a good start to implement Korean language support.
Conversion is simple: