CMU Sphinx / Forums / Help: Hebrew model for letters

Speech Recognition Toolkit

Hebrew model for letters

Forum: Help

Created: 2023-08-09

Updated: 2023-08-10

Alex Rudnicky - 2023-08-10

Current speech speech systems can use speech-to-letter models for decoding the audio signal, together with a language model to detect legal sequences. Look into into speech-to-vec 2 models; thesecan generate a feature space that can then be trained to classify tokens for specific languages. With either approach you will need to have at least some annotated speech to map the audio to symbols. One rule of thumb is ~50 instances for each symbol, but this can vary accordining to the end task. Trying for a uniform distribution over symbols is a good idea.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Hebrew model for letters