Trying to use the g2p-seq2seq program. Should I format the cmudict.dict to the SPHINX format before using it as input to train in g2p-seq2seq? or does it take care of that? Just to elaborate, do I need to remove stress markers and variant markers and convert spaces between words and phones to tabs?
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hi:
Trying to use the g2p-seq2seq program. Should I format the cmudict.dict to the SPHINX format before using it as input to train in g2p-seq2seq? or does it take care of that? Just to elaborate, do I need to remove stress markers and variant markers and convert spaces between words and phones to tabs?
Yes
No