Reducing size of pocketsphinx for keyword spotting

Help
Ming Chen
2014-08-11
2014-08-12
  • Ming Chen

    Ming Chen - 2014-08-11

    I have been using the keyword spotting mode of pocketsphinx for a while. I'm wondering what is the best way to reduce as much of the source code/ project in terms of size as possible to make pocketsphinx work as a light-weight keyword detector. Any ideas are appreciated.

    Regards

     
    • Nickolay V. Shmyrev

      You can

      1) Drop unused searches like ngram search and fsg search, drop language model code from sphinxbase

      2) Compress the dictionary into CART-WFST

      3) Quantize the model to 4 bit to make it smaller.

      4) Retrain the model to reduce amount of senones.

      That should give you quite significant reduction in size. It is possible to fit everything in 2mb probably.

      Other steps depends on your requirements on vocabulary, size and accuracy.

       
      • Ming Chen

        Ming Chen - 2014-08-12

        Thanks for the advice.

        2) Compress the dictionary into CART-WFST

        May I know what is CART-WFST? Can I simply keep only the words I am going to detect in the dictionary?

        3) Quantize the model to 4 bit to make it smaller.

        What document and code should I refer to to quantize the AM?

        4) Retrain the model to reduce amount of senones.

        Is it the part that require a new set of recordings?

        Regards

         
        Last edit: Ming Chen 2014-08-12

Get latest updates about Open Source Projects, Conferences and News.

Sign up for the SourceForge newsletter:





No, thanks