Thanks for your reply. I've got several questions:
Does PocketSphinx perform phoneme recognition?
Are there any tutorials for PocketSphinx? The only tutorials that I've found are for Shpinx3.
I notice that SphinxTrain and SphinxBase appear to contain the same binaries as PocketSphinx. Are SphinxTrain and SphinxBase supersets of PocketSphinx?
I am looking to adapt the acoustic models provided to British English. Which of the three projects should I be using?
Can adaptation be performed simultaneously using data from more than 1 speaker, or even incrementally? i.e. can an adapted model be readapted at a latter time?
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
I notice that SphinxTrain and SphinxBase appear to contain the same binaries as PocketSphinx. Are SphinxTrain and SphinxBase supersets of PocketSphinx?
You can learn about package hierarchy from the tutorial
I am looking to adapt the acoustic models provided to British English. Which of the three projects should I be using?
You need all three
Can adaptation be performed simultaneously using data from more than 1 speaker, or even incrementally? i.e. can an adapted model be readapted at a latter time?
Yes
Last edit: Nickolay V. Shmyrev 2012-12-29
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Thanks. To recap, bearing in mind that I am interested in phoneme recognition, should I stick to SphinxTrain, SphinxBase and Shinx3 (I recall reading somewhere that Sphinx4 does not perform phoneme recognition), or has all the functionality of Shinx3 been moved to SphinxTrain, SphinxBase?
Last edit: videohead 2012-12-29
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
As a follow up, the Phoneme Recognition page of the wiki http://cmusphinx.sourceforge.net/wiki/phonemerecognition references sphinx3_decoder.exe. No such similarly-named binary (decoder.exe) is contained in SphinxTrain, SphinxBase or PocketSphinx .
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
I am aware that the lack of text constraints makes phoneme recognition inaccurate. Unfortunately my application requires phoneme transcriptions. The emphasis is not merely on what the intends to say but on what is actually said, for example elided or mispronounced words etc. I guess I should give Sphinx3 a try and see just how accurate it is and go on from there. Can you provide the *.sln file for Sphinx3? It is missing from the download.
Thanks
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hi -
I'm trying to compile Sphinx3 for Win 32. Unfortunately the source files
do not include a .sln file. Please advise.
Regards,
Use pocketsphinx
Thanks for your reply. I've got several questions:
Does PocketSphinx perform phoneme recognition?
Are there any tutorials for PocketSphinx? The only tutorials that I've found are for Shpinx3.
I notice that SphinxTrain and SphinxBase appear to contain the same binaries as PocketSphinx. Are SphinxTrain and SphinxBase supersets of PocketSphinx?
I am looking to adapt the acoustic models provided to British English. Which of the three projects should I be using?
Can adaptation be performed simultaneously using data from more than 1 speaker, or even incrementally? i.e. can an adapted model be readapted at a latter time?
No
http://cmusphinx.sourceforge.net/wiki/tutorial
You can learn about package hierarchy from the tutorial
http://cmusphinx.sourceforge.net/wiki/tutorialoverview
You need all three
Yes
Last edit: Nickolay V. Shmyrev 2012-12-29
Thanks. To recap, bearing in mind that I am interested in phoneme recognition, should I stick to SphinxTrain, SphinxBase and Shinx3 (I recall reading somewhere that Sphinx4 does not perform phoneme recognition), or has all the functionality of Shinx3 been moved to SphinxTrain, SphinxBase?
Last edit: videohead 2012-12-29
As a follow up, the Phoneme Recognition page of the wiki http://cmusphinx.sourceforge.net/wiki/phonemerecognition references sphinx3_decoder.exe. No such similarly-named binary (decoder.exe) is contained in SphinxTrain, SphinxBase or PocketSphinx .
Phoneme recognition is almost never accurate, for that reason most likely you do not need it.
No
I am aware that the lack of text constraints makes phoneme recognition inaccurate. Unfortunately my application requires phoneme transcriptions. The emphasis is not merely on what the intends to say but on what is actually said, for example elided or mispronounced words etc. I guess I should give Sphinx3 a try and see just how accurate it is and go on from there. Can you provide the *.sln file for Sphinx3? It is missing from the download.
Thanks
sln file is available in subversion, for more details see:
https://sourceforge.net/p/cmusphinx/code/
Thanks.