Re: [Kaldi-users] Integration of Kaldi into a voice-to-voice translator

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

You have to be careful that you give the online decoder the right options
corresponding tot the features that you used.  See if there are any
non-default options in your MFCC extraction or PLP extraction, for
instance, and that the LDA window is set correctly.

Dan

On Tue, May 14, 2013 at 10:10 AM, Mailing list used for User Communication
and Updates <kal...@li...> wrote:

> As I said on my data have results like 70% WER. It is due to some bug,
> which I have not fixed yet.
>
> In the online demo I tried to use our test (quite general data - in terms
> of LM) and test it against online demo,
> because as far as I understood it the online demo was trained on the same
> data which are in supplied as the test data.
>
> Basically, I was asking for the numbers because I need to know if I have
> still bug in my script or it is just normal performance.
>
> Thansk
>
> Ondra
>
>
>
>
> On 14 May 2013 15:39, Mailing list used for User Communication and Updates
> <kal...@li...> wrote:
>
>> Hi,
>>
>> I don't have specific numbers to share (as I said the online decoder
>> is not rigorously tested as far as I know). In principle I think the
>> online decoder should perform slightly worse than the offline versions
>> due to things like online CMN computation.
>>
>> What do you mean exactly when you say that the results for the online
>> demo are not so good? As far as I remember the WER for the test WAV
>> files was about 7%, which is not so bad as today's speech recognition
>> goes. Of course the WER in this case is so low, because the language
>> model is relatively small and perfectly matches the test files (the LM
>> is trained on the text corresponding to the audio book from which the
>> test utterances were taken - not the best practice of course).
>>
>> Vassil
>>
>> On Tue, May 14, 2013 at 2:59 PM, Mailing list used for User
>> Communication and Updates <kal...@li...> wrote:
>> > Hi Vassil,
>> >
>> > May I asked about the performance (in terms of WER/SER) of online
>> decoder?
>> >
>> > What is the expected WER for online-wav-gmm-decode-faster,
>> > if I reached with  gmm-latgen-faster scores in table below?
>> > I would like to use the same trained models for
>> online-wav-gmm-decode-faster
>> > as
>> > I used for  gmm-latgen-faster (and lattice-best-path).
>> >
>> > exp             RT coef         WER
>> > _ri3b_fmmi_b    11.33336625     19.42
>> > tri2b_mpe       1.2307175       21.22
>> > mono            2.8952          49.88
>> > tri3b_mmi       1.24198625      20.01
>> > tri1            2.398165        28.13
>> > tri2b_mmi       1.65662125      20.34
>> > tri2a           2.15838         28.43
>> > tri2b           1.451105        27.96
>> > tri3b           1.0596075       27.79
>> > tri3b_fmmi_c    11.4361875      19.13
>> > tri2b_mmi_b0.05 2.0089575       19.64
>> > tri3b_fmmi_d    16.1227625      19.5
>> >
>> > The names and the core settings of experiments are the same as in
>> > egs/voxforge/s5/run.sh
>> >
>> >
>> > With the online-wav-gmm-decode-faster
>> > I reached really bad scores due to bugs (which I still need to fix),
>> > but still also in the online demo the results are not so nice.
>> >
>> > Thanks for any help
>> >
>> > Ondra Platek
>> >
>> >
>> > On 26 April 2013 15:35, Mailing list used for User Communication and
>> Updates
>> > <kal...@li...> wrote:
>> >>
>> >> Hi,
>> >> as for taking input from a microphone you might want to have a look at
>> >> the (optional) online decoder. There is a simple demo script in
>> >> egs/voxforge/online-demo. There is also a version that can decode .wav
>> >> files without a separate feature extraction step. The code is not
>> >> extensively tested and lacks some polish and flexibility, but perhaps
>> >> you can use it as an example.
>> >>
>> >> Vassil
>> >>
>> >> On Fri, Apr 26, 2013 at 3:55 PM, Mailing list used for User
>> >> Communication and Updates <kal...@li...> wrote:
>> >> > Dear all,
>> >> > I'm working on the development of an R&D voice-to-voice module and I
>> >> > would
>> >> > like to use Kaldi for the transcription part (we made some tests with
>> >> > PocketSphinx but we found that Kaldi is more adapted for our task).
>> >> >
>> >> > The problem is that I'm not really expert in speech2text (Kaldi
>> models
>> >> > will
>> >> > be provided by another team) and I did not found scripts (or plugins)
>> >> > allowing an easy integration of Kaldi : How to manage streams (with
>> >> > gstreammer for example)? how we can decode a single wav file? etc.
>> >> >
>> >> > Is someone else has already handle these problems? are there any
>> scripts
>> >> > to
>> >> > facilitate this task?
>> >> >
>> >> > thanks in advance for your reply,
>> >> >
>> >> > Best regards
>> >> >
>> >> >
>> >> >
>> >> >
>> ------------------------------------------------------------------------------
>> >> > Try New Relic Now & We'll Send You this Cool Shirt
>> >> > New Relic is the only SaaS-based application performance monitoring
>> >> > service
>> >> > that delivers powerful full stack analytics. Optimize and monitor
>> your
>> >> > browser, app, & servers with just a few lines of code. Try New Relic
>> >> > and get this awesome Nerd Life shirt!
>> >> > http://p.sf.net/sfu/newrelic_d2d_apr
>> >> > _______________________________________________
>> >> > Kaldi-users mailing list
>> >> > Kal...@li...
>> >> > https://lists.sourceforge.net/lists/listinfo/kaldi-users
>> >> >
>> >>
>> >>
>> >>
>> ------------------------------------------------------------------------------
>> >> Try New Relic Now & We'll Send You this Cool Shirt
>> >> New Relic is the only SaaS-based application performance monitoring
>> >> service
>> >> that delivers powerful full stack analytics. Optimize and monitor your
>> >> browser, app, & servers with just a few lines of code. Try New Relic
>> >> and get this awesome Nerd Life shirt!
>> http://p.sf.net/sfu/newrelic_d2d_apr
>> >> _______________________________________________
>> >> Kaldi-users mailing list
>> >> Kal...@li...
>> >> https://lists.sourceforge.net/lists/listinfo/kaldi-users
>> >
>> >
>> >
>> >
>> ------------------------------------------------------------------------------
>> > AlienVault Unified Security Management (USM) platform delivers complete
>> > security visibility with the essential security capabilities. Easily and
>> > efficiently configure, manage, and operate all of your security controls
>> > from a single console and one unified framework. Download a free trial.
>> > http://p.sf.net/sfu/alienvault_d2d
>> > _______________________________________________
>> > Kaldi-users mailing list
>> > Kal...@li...
>> > https://lists.sourceforge.net/lists/listinfo/kaldi-users
>> >
>>
>>
>> ------------------------------------------------------------------------------
>> AlienVault Unified Security Management (USM) platform delivers complete
>> security visibility with the essential security capabilities. Easily and
>> efficiently configure, manage, and operate all of your security controls
>> from a single console and one unified framework. Download a free trial.
>> http://p.sf.net/sfu/alienvault_d2d
>> _______________________________________________
>> Kaldi-users mailing list
>> Kal...@li...
>> https://lists.sourceforge.net/lists/listinfo/kaldi-users
>>
>
>
>
> ------------------------------------------------------------------------------
> AlienVault Unified Security Management (USM) platform delivers complete
> security visibility with the essential security capabilities. Easily and
> efficiently configure, manage, and operate all of your security controls
> from a single console and one unified framework. Download a free trial.
> http://p.sf.net/sfu/alienvault_d2d
> _______________________________________________
> Kaldi-users mailing list
> Kal...@li...
> https://lists.sourceforge.net/lists/listinfo/kaldi-users
>
>