I have been trying to find a way to get the timestamps of each word in an
utterance for some time now. I believe this is done in the C implementation of
pocketsphinx using:
void ps_seg_frames(ps_seg_t *seg, int *out_sf, int *out_ef);
is it possible to access this function through the python bindings or find the
timestamps in any other way in python?
Also is there any documentation on the python bindings api?
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
The patch to implement this feature has been just committed to subversion
repository. Just download and compile the snapshot to have this functionality.
See pocketsphinx/python/ps_test_seg.py for details on how to use it.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
I have been trying to find a way to get the timestamps of each word in an
utterance for some time now. I believe this is done in the C implementation of
pocketsphinx using:
is it possible to access this function through the python bindings or find the
timestamps in any other way in python?
Also is there any documentation on the python bindings api?
Hello, this feature is not implemented yet. There was a patch here but it was
lost due to sourceforge outage it seems
https://sourceforge.net/tracker/?func=detail&atid=301904&aid=3480256&group_id
=1904
It should be relatively easy to add this binding
No
How might one attempt to add these bindings?
The patch to implement this feature has been just committed to subversion
repository. Just download and compile the snapshot to have this functionality.
See pocketsphinx/python/ps_test_seg.py for details on how to use it.