From: Mailing l. u. f. U. C. a. U. <kal...@li...> - 2013-05-14 16:10:00
|
You have to be careful that you give the online decoder the right options corresponding tot the features that you used. See if there are any non-default options in your MFCC extraction or PLP extraction, for instance, and that the LDA window is set correctly. Dan On Tue, May 14, 2013 at 10:10 AM, Mailing list used for User Communication and Updates <kal...@li...> wrote: > As I said on my data have results like 70% WER. It is due to some bug, > which I have not fixed yet. > > In the online demo I tried to use our test (quite general data - in terms > of LM) and test it against online demo, > because as far as I understood it the online demo was trained on the same > data which are in supplied as the test data. > > Basically, I was asking for the numbers because I need to know if I have > still bug in my script or it is just normal performance. > > Thansk > > Ondra > > > > > On 14 May 2013 15:39, Mailing list used for User Communication and Updates > <kal...@li...> wrote: > >> Hi, >> >> I don't have specific numbers to share (as I said the online decoder >> is not rigorously tested as far as I know). In principle I think the >> online decoder should perform slightly worse than the offline versions >> due to things like online CMN computation. >> >> What do you mean exactly when you say that the results for the online >> demo are not so good? As far as I remember the WER for the test WAV >> files was about 7%, which is not so bad as today's speech recognition >> goes. Of course the WER in this case is so low, because the language >> model is relatively small and perfectly matches the test files (the LM >> is trained on the text corresponding to the audio book from which the >> test utterances were taken - not the best practice of course). >> >> Vassil >> >> On Tue, May 14, 2013 at 2:59 PM, Mailing list used for User >> Communication and Updates <kal...@li...> wrote: >> > Hi Vassil, >> > >> > May I asked about the performance (in terms of WER/SER) of online >> decoder? >> > >> > What is the expected WER for online-wav-gmm-decode-faster, >> > if I reached with gmm-latgen-faster scores in table below? >> > I would like to use the same trained models for >> online-wav-gmm-decode-faster >> > as >> > I used for gmm-latgen-faster (and lattice-best-path). >> > >> > exp RT coef WER >> > _ri3b_fmmi_b 11.33336625 19.42 >> > tri2b_mpe 1.2307175 21.22 >> > mono 2.8952 49.88 >> > tri3b_mmi 1.24198625 20.01 >> > tri1 2.398165 28.13 >> > tri2b_mmi 1.65662125 20.34 >> > tri2a 2.15838 28.43 >> > tri2b 1.451105 27.96 >> > tri3b 1.0596075 27.79 >> > tri3b_fmmi_c 11.4361875 19.13 >> > tri2b_mmi_b0.05 2.0089575 19.64 >> > tri3b_fmmi_d 16.1227625 19.5 >> > >> > The names and the core settings of experiments are the same as in >> > egs/voxforge/s5/run.sh >> > >> > >> > With the online-wav-gmm-decode-faster >> > I reached really bad scores due to bugs (which I still need to fix), >> > but still also in the online demo the results are not so nice. >> > >> > Thanks for any help >> > >> > Ondra Platek >> > >> > >> > On 26 April 2013 15:35, Mailing list used for User Communication and >> Updates >> > <kal...@li...> wrote: >> >> >> >> Hi, >> >> as for taking input from a microphone you might want to have a look at >> >> the (optional) online decoder. There is a simple demo script in >> >> egs/voxforge/online-demo. There is also a version that can decode .wav >> >> files without a separate feature extraction step. The code is not >> >> extensively tested and lacks some polish and flexibility, but perhaps >> >> you can use it as an example. >> >> >> >> Vassil >> >> >> >> On Fri, Apr 26, 2013 at 3:55 PM, Mailing list used for User >> >> Communication and Updates <kal...@li...> wrote: >> >> > Dear all, >> >> > I'm working on the development of an R&D voice-to-voice module and I >> >> > would >> >> > like to use Kaldi for the transcription part (we made some tests with >> >> > PocketSphinx but we found that Kaldi is more adapted for our task). >> >> > >> >> > The problem is that I'm not really expert in speech2text (Kaldi >> models >> >> > will >> >> > be provided by another team) and I did not found scripts (or plugins) >> >> > allowing an easy integration of Kaldi : How to manage streams (with >> >> > gstreammer for example)? how we can decode a single wav file? etc. >> >> > >> >> > Is someone else has already handle these problems? are there any >> scripts >> >> > to >> >> > facilitate this task? >> >> > >> >> > thanks in advance for your reply, >> >> > >> >> > Best regards >> >> > >> >> > >> >> > >> >> > >> ------------------------------------------------------------------------------ >> >> > Try New Relic Now & We'll Send You this Cool Shirt >> >> > New Relic is the only SaaS-based application performance monitoring >> >> > service >> >> > that delivers powerful full stack analytics. Optimize and monitor >> your >> >> > browser, app, & servers with just a few lines of code. Try New Relic >> >> > and get this awesome Nerd Life shirt! >> >> > http://p.sf.net/sfu/newrelic_d2d_apr >> >> > _______________________________________________ >> >> > Kaldi-users mailing list >> >> > Kal...@li... >> >> > https://lists.sourceforge.net/lists/listinfo/kaldi-users >> >> > >> >> >> >> >> >> >> ------------------------------------------------------------------------------ >> >> Try New Relic Now & We'll Send You this Cool Shirt >> >> New Relic is the only SaaS-based application performance monitoring >> >> service >> >> that delivers powerful full stack analytics. Optimize and monitor your >> >> browser, app, & servers with just a few lines of code. Try New Relic >> >> and get this awesome Nerd Life shirt! >> http://p.sf.net/sfu/newrelic_d2d_apr >> >> _______________________________________________ >> >> Kaldi-users mailing list >> >> Kal...@li... >> >> https://lists.sourceforge.net/lists/listinfo/kaldi-users >> > >> > >> > >> > >> ------------------------------------------------------------------------------ >> > AlienVault Unified Security Management (USM) platform delivers complete >> > security visibility with the essential security capabilities. Easily and >> > efficiently configure, manage, and operate all of your security controls >> > from a single console and one unified framework. Download a free trial. >> > http://p.sf.net/sfu/alienvault_d2d >> > _______________________________________________ >> > Kaldi-users mailing list >> > Kal...@li... >> > https://lists.sourceforge.net/lists/listinfo/kaldi-users >> > >> >> >> ------------------------------------------------------------------------------ >> AlienVault Unified Security Management (USM) platform delivers complete >> security visibility with the essential security capabilities. Easily and >> efficiently configure, manage, and operate all of your security controls >> from a single console and one unified framework. Download a free trial. >> http://p.sf.net/sfu/alienvault_d2d >> _______________________________________________ >> Kaldi-users mailing list >> Kal...@li... >> https://lists.sourceforge.net/lists/listinfo/kaldi-users >> > > > > ------------------------------------------------------------------------------ > AlienVault Unified Security Management (USM) platform delivers complete > security visibility with the essential security capabilities. Easily and > efficiently configure, manage, and operate all of your security controls > from a single console and one unified framework. Download a free trial. > http://p.sf.net/sfu/alienvault_d2d > _______________________________________________ > Kaldi-users mailing list > Kal...@li... > https://lists.sourceforge.net/lists/listinfo/kaldi-users > > |