|
From: E <oth...@ao...> - 2014-01-07 09:22:11
|
Hello, This is probably off-topic but I hope someone here knows answer. I am converting ARPA LM to FST using "transducersaurus". The command that I use is C*min(det(L*(G))) My problem is, the *.fst.txt file that is created has output symbols at unexpected locations. I expect the output symbols to be <eps> at all phonemes in the word except last phoneme and output symbol = "word" for only the last phoneme. But what I'm seeing is that sometimes output symbol = "word" even for phonemes that are not last phonemes in the word. I wonder what the issue is and how to properly identify word end boundary in an FST. Let me know if sharing my files will help in diagnosis. Thanks, Ethan |