From: Sakriani S. <ss...@is...> - 2013-11-18 04:34:25
|
Hi Daniel and Vassil, Thank you for the response. We first learn from voxforge setup. As we need to put our ASR in the server and receive the input directly from the microphone, we currently use: onlinebin/online-audio-server-decode-faster with following parameters: beam=15.0 rt-min=0.5 rt-max=3.0 Our old system based on Julius decoder, but compare with Julius, the response with Kaldi online decoding is really slow. Please let us know how to make sure it stays within real time. Thank you. -Sakti- On 2013/11/17 1:04, Vassil Panayotov wrote: > Yes it has some basic/experimental logic to automatically tweak the > beam according to the current decoding real time > factor(--rt-min/--rt-max parameters), but I don't think anyone has > tested how well it works in practice(my impressions is that it's far > from perfect). > If this doesn't work for the original poster for some reason, the only > other options that wouldn't require changing the code would be to > tweak the beam or the maximum active states. > > Vassil > > On Sat, Nov 16, 2013 at 5:46 PM, Daniel Povey <dp...@gm...> wrote: >> When you say the online decoding, you mean e.g. the voxforge setup? >> the stuff in onlinebin/ ? I thought the online decoder had a >> mechanism to make sure it stays within real time. >> Dan >> >> >> On Fri, Nov 15, 2013 at 12:18 AM, Sakriani Sakti <ss...@is...> wrote: >>> Dear Kaldi-developers, >>> >>> First of all, we would like to join Kaldi-users mailing list. Please kindly >>> include me into Kaldi-users mailing list. >>> >>> We are currently build simultaneous speech translation, where we >>> build the ASR system using Kaldi. To construct simultaneous speech >>> translation, we really need real-time ASR decoding. Currently, the >>> online decoding in Kaldi seems respond very slow and we couldn't find >>> the way to multi-thread decoding as in offline case. Is there any way, to >>> make the online decoding a little faster? >>> >>> Another things, the ctm format file mainly output the recognized word >>> with time information. Is there any way to get more detail information, >>> such as phone with time information or HMM-state with time information >>> as in HTK decoding output? >>> >>> It would be great if you could help us for this. >>> Thank you. >>> Sincerely yours, >>> Sakriani Sakti >>> >>> >>> ------------------------------------------------------------------------------ >>> DreamFactory - Open Source REST & JSON Services for HTML5 & Native Apps >>> OAuth, Users, Roles, SQL, NoSQL, BLOB Storage and External API Access >>> Free app hosting. Or install the open source package on any LAMP server. >>> Sign up and see examples for AngularJS, jQuery, Sencha Touch and Native! >>> http://pubads.g.doubleclick.net/gampad/clk?id=63469471&iu=/4140/ostg.clktrk >>> _______________________________________________ >>> Kaldi-developers mailing list >>> Kal...@li... >>> https://lists.sourceforge.net/lists/listinfo/kaldi-developers >> ------------------------------------------------------------------------------ >> DreamFactory - Open Source REST & JSON Services for HTML5 & Native Apps >> OAuth, Users, Roles, SQL, NoSQL, BLOB Storage and External API Access >> Free app hosting. Or install the open source package on any LAMP server. >> Sign up and see examples for AngularJS, jQuery, Sencha Touch and Native! >> http://pubads.g.doubleclick.net/gampad/clk?id=63469471&iu=/4140/ostg.clktrk >> _______________________________________________ >> Kaldi-developers mailing list >> Kal...@li... >> https://lists.sourceforge.net/lists/listinfo/kaldi-developers |