Hello, my english is not very good, sorry for that. I will try to explain
myself. First a small briefing:
Platform: Android + PocketSphinx.
Language Model Parameters: voxforge_es_sphinx.cd_cont_1500 - v.0.1.1.
Dictionary: 244 Words.
Grammar: JSGF.
When I started working with Android + PocketSphinix the Decoder takes about
one/three second to load all stuff using "hub4wsj_sc_8k" and that was great.
Now I am using the VoxForge Language Model Parameters for spanish and I am
getting about five/seven seconds at time of load the Decoder. That is too much
time.
I know that my dictionary and JSGF file loads very quickly. So the problem is
inside the Language Model Paremeters.
Question is: why those differences between loading time?
Thanks in advance.
Regards,
Lilo.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Not really an answer but you only need to encounter that 5s/7s once. One thing
that I did is just hide that 5s in the loading screen they initially see when
running the app.
The spanish voxforge audio is also available to train your own model.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Anonymous
-
2012-02-07
@eliasmajic thank you very much for your reply. By the nature of the
application, I can not wait the initial seven seconds. I know that I can add
all heavy load in a service executed at system boot and avoid the loading in
each launch of the application (this will be my last alternative).
About to train a new model I have doubts. Actually I am creating the
Dictionary and the JSGF in a dinamic way. So I don't know how to train a model
without a static content. Due to my bad english... my knowledges about that
are very poor. For that reason I like to know why one model is loaded faster
than other.
One more time, sorry for my bad english. I am not an smart monkey! I am an
average human :)
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Anonymous
-
2012-02-08
At this time, just with testing purposes I am trying to train a new Acoustic
Model using VoxForge stuff for spanish. But I'm getting the following Warnings
and Errors:
Initially, Dictionary and Transcription files was in UTF-8 but I was getting
"broken" characters in the log output.. so I set both files to ISO-8859-15.
Then all characters like áéíóú was showed fine, but I have the same errors.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
WARNING: This word: VACíA was in the transcript file, but is not in the
dictionary ( TODO ESTABA LO MISMO QUE UNA HORA ANTES CUANDO EL Té HUMEABA
EN LA TAZA DE OJEDA AHORA VACíA Y BLANQUEABAN SOBRE LA MESA LOS PLIEGOS
). Do cases match?
Dictionary might require resorting after you converted it to another encoding.
It's recommended to use UTF-8 , not 8859-15.
The warning just tells you that the word is missing. All you need to do is to
make sure word is there.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Anonymous
-
2012-02-08
Thank you for your reply. Initially I was using the original file (encoded
with UTF-8) but I got the following warnings:
This seems to be a mistake in the original voxforge data. I think it's easy to
fix it in the archive itself and then train.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Anonymous
-
2012-02-08
I think that both (Dictionary and Transcription) files are not mistaken in the
original VoxForge, I can read them without broken characters. How can I double
check it?
Please, sorry for by bad english.. maybe I misunderstood something. Thank you!
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Maybe the training script that creates prompts using awk breaks the text. You
might want to set locale on your machine to en_US.UTF-8 or to C. It could
affect awk behaviour.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Anonymous
-
2012-02-08
Same result as before. But don't worry I don't like to bother you.
Just one thing: the idea to create a new acoustic model is due to I am getting
a "Decoder" load time of seven seconds using PocketSphinx + Android (please
check my post number 1 and 3). But I think that it will be the same, due to it
will be only a new compilation using the same base. So... it is there any way
to reuse the current VoxForge in Spanish doing it like "hub4wsj_sc_8k" that
takes only 3 seconds to load? Maybe I am asking silly questions...
Just one thing: the idea to create a new acoustic model is due to I am
getting a "Decoder" load time of seven seconds using PocketSphinx + Android
(please check my post number 1 and 3). But I think that it will be the same,
due to it will be only a new compilation using the same base. So... it is
there any way to reuse the current VoxForge in Spanish doing it like
"hub4wsj_sc_8k" that takes only 3 seconds to load? Maybe I am asking silly
questions..
.
As said above, you need to train semi-continuous model from the Voxforge data.
Also you can convert mdef file to binary format with
pocketsphinx_mdef_convert. Then it will load faster.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Anonymous
-
2012-02-08
Thank you! I am already using a mdef file in binary format. There is about one
second difference with the mdef text file. So, I will try to create a new
model using a different computer.
Really, thank you very much for your time.
I was trying to edit my first post to put everything in order, but I can not.
I will maintain you updated with my progress.. if you need something from me,
just poke me :)
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hello, my english is not very good, sorry for that. I will try to explain
myself. First a small briefing:
Platform: Android + PocketSphinx.
Language Model Parameters: voxforge_es_sphinx.cd_cont_1500 - v.0.1.1.
Dictionary: 244 Words.
Grammar: JSGF.
When I started working with Android + PocketSphinix the Decoder takes about
one/three second to load all stuff using "hub4wsj_sc_8k" and that was great.
Now I am using the VoxForge Language Model Parameters for spanish and I am
getting about five/seven seconds at time of load the Decoder. That is too much
time.
I know that my dictionary and JSGF file loads very quickly. So the problem is
inside the Language Model Paremeters.
Question is: why those differences between loading time?
Thanks in advance.
Regards,
Lilo.
Not really an answer but you only need to encounter that 5s/7s once. One thing
that I did is just hide that 5s in the loading screen they initially see when
running the app.
The spanish voxforge audio is also available to train your own model.
@eliasmajic thank you very much for your reply. By the nature of the
application, I can not wait the initial seven seconds. I know that I can add
all heavy load in a service executed at system boot and avoid the loading in
each launch of the application (this will be my last alternative).
About to train a new model I have doubts. Actually I am creating the
Dictionary and the JSGF in a dinamic way. So I don't know how to train a model
without a static content. Due to my bad english... my knowledges about that
are very poor. For that reason I like to know why one model is loaded faster
than other.
One more time, sorry for my bad english. I am not an smart monkey! I am an
average human :)
At this time, just with testing purposes I am trying to train a new Acoustic
Model using VoxForge stuff for spanish. But I'm getting the following Warnings
and Errors:
Initially, Dictionary and Transcription files was in UTF-8 but I was getting
"broken" characters in the log output.. so I set both files to ISO-8859-15.
Then all characters like áéíóú was showed fine, but I have the same errors.
Dictionary might require resorting after you converted it to another encoding.
It's recommended to use UTF-8 , not 8859-15.
The warning just tells you that the word is missing. All you need to do is to
make sure word is there.
Thank you for your reply. Initially I was using the original file (encoded
with UTF-8) but I got the following warnings:
It should be:
Dictionary extraction:
Transcription extraction:
This seems to be a mistake in the original voxforge data. I think it's easy to
fix it in the archive itself and then train.
I think that both (Dictionary and Transcription) files are not mistaken in the
original VoxForge, I can read them without broken characters. How can I double
check it?
Please, sorry for by bad english.. maybe I misunderstood something. Thank you!
Maybe the training script that creates prompts using awk breaks the text. You
might want to set locale on your machine to en_US.UTF-8 or to C. It could
affect awk behaviour.
Same result as before. But don't worry I don't like to bother you.
Just one thing: the idea to create a new acoustic model is due to I am getting
a "Decoder" load time of seven seconds using PocketSphinx + Android (please
check my post number 1 and 3). But I think that it will be the same, due to it
will be only a new compilation using the same base. So... it is there any way
to reuse the current VoxForge in Spanish doing it like "hub4wsj_sc_8k" that
takes only 3 seconds to load? Maybe I am asking silly questions...
Thank you Mr. Nickolay.
.
As said above, you need to train semi-continuous model from the Voxforge data.
Also you can convert mdef file to binary format with
pocketsphinx_mdef_convert. Then it will load faster.
Thank you! I am already using a mdef file in binary format. There is about one
second difference with the mdef text file. So, I will try to create a new
model using a different computer.
Really, thank you very much for your time.
I was trying to edit my first post to put everything in order, but I can not.
I will maintain you updated with my progress.. if you need something from me,
just poke me :)