I'm using pocketsphinx to convert an audio file recorded on a Motorola Droid.
Since the Droid can only upload M4A files, I'm converting the file to a wav
with mplayer. When I put the wav into pocketsphinx, I get this output:
If, however, I record a wav file in Audacity directly on my desktop,
pocketsphinx works. This issue is happening consistently among ~10 Droid-
generated files and ~10 Audacity-generated files. The conversion is taking
place on a 64-bit Ubuntu 10.04 machine.
Thanks marekl0. Now I have an 8kHz, stereo, PCM 16-bit WAV file and I'm still
getting the same error. Do you know if there are any other limitations on what
kind of wav I can use?
Thanks,
Zach
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
In general you should have Uncompressed PCM 16-bit mono WAV. Most acoustic
models are created for 16kHz files so if you use other sampling frequencies
you have to be sure that acoustic model you are using is created for this
frequency.
Marek
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
I'm using pocketsphinx to convert an audio file recorded on a Motorola Droid.
Since the Droid can only upload M4A files, I'm converting the file to a wav
with mplayer. When I put the wav into pocketsphinx, I get this output:
If, however, I record a wav file in Audacity directly on my desktop,
pocketsphinx works. This issue is happening consistently among ~10 Droid-
generated files and ~10 Audacity-generated files. The conversion is taking
place on a 64-bit Ubuntu 10.04 machine.
An example file that doesn't work (from Droid) is here:
http://zachrattner.com/wav/test.wav
An example file that does work (from Audacity) is here:
http://zachrattner.com/wav/test2.wav
If anyone could shed some light on what I'm doing wrong, I'd appreciate it.
Thanks,
Zach
Hi Zach,
Sphinx files should be uncompressed PCM 16-bit files while your test file is
32-bit IEEE float file. Change output options in mplayer to PCM 16-bit
Thanks marekl0. Now I have an 8kHz, stereo, PCM 16-bit WAV file and I'm still
getting the same error. Do you know if there are any other limitations on what
kind of wav I can use?
Thanks,
Zach
In general you should have Uncompressed PCM 16-bit mono WAV. Most acoustic
models are created for 16kHz files so if you use other sampling frequencies
you have to be sure that acoustic model you are using is created for this
frequency.
Marek
moreover (I forget to add) your current settings of pocketsphinx are for 16kHz
files (see -samprate option)