Menu

Single Word Classification and Corpus

Help
Gizmoguy
2016-10-18
2016-10-18
  • Gizmoguy

    Gizmoguy - 2016-10-18

    Hi all,

    I would like to add limited vocabulary speech recognition to a larger project. For my purposes I need to classify a speech sample but it is not necessary to decode it to phonemes or a word. As I do not know much about speech recognition, my initial approach would be clustering MFCC features, but I need a speech corpus of single words with multiple speakers for each word, which I have so far been unable to find.

    If anyone can provide information on a good technique for my purpose or a freely-available corpus, I would be grateful.

    Thanks.

     
    • Nickolay V. Shmyrev

      This is called voice activity detection or VAD

      You can download a database for training here http://www.openslr.org/17/

      You can read paper about it here https://arxiv.org/pdf/1510.08484v1.pdf

       
      • Gizmoguy

        Gizmoguy - 2016-10-18

        Thanks for the prompt reply.

        It is not speech/non-speech I need to classify, but I need to classify the same word being spoken across different speakers.

         

Log in to post a comment.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.