From: Vassil P. <vas...@gm...> - 2015-07-05 08:05:25
|
BTW, when preparing LibriSpeech, I've noticed that the quality of MP3 conversion can vary substantially, depending on the particular tool used. For example the output of mpg123(or maybe it was mpg321) was very noisy and the ASR WER was 10-15% absolute higher than when alternative MP3 decoders were used. When converting to 16kHz .wav ffmpeg cuts off the frequencies higher than 7kHz. So eventually I settled for mplayer. It preserves the frequency content in the 7-8kHz range and as far as I could tell the audio sounded a bit "closer" to the original recording, although I'm not sure if there is any measurable difference in ASR performance b/w ffmpeg and mplayer produced .wav-s. The versions of the tools I've tried were those shipped with Ubuntu 10.04 and 12.04, so the issues may be fixed in the more recent releases. Vassil On Fri, Jul 3, 2015 at 9:53 PM, Daniel Povey <dp...@gm...> wrote: > The sampling rate is critical, but the bitrate is not really critical- > just make sure it sounds OK without super-obvious artifacts. Vassil > (cc'd) will know what bitrate he encoded the Librispeech data with, > but matching this exactly is probably not important. > Dan > > > On Fri, Jul 3, 2015 at 10:45 AM, Jonathan L <jon...@gm...> > wrote: > > The data I want to train on is in MP3 format at a 128kbps bitrate and a > > 44.1kHz sample rate. The LibriSpeech data has a 16kHz sample rate, but > > doesn't seem to have a specified bitrate, When I convert the MP3 files > into > > 16kHz sample-rate WAV files, what bitrate should I convert them to? > > > > Is there anything else I should consider when converting the speech > files? > > > > On Mon, Jun 29, 2015 at 12:24 PM, Vijayaditya Peddinti > > <p.v...@gm...> wrote: > >> > >> You need to provide the egs directory, not exp directory. You can check > >> stage -3 of steps/nnet2/train_multisplice_accel2.sh to see how egs > directory > >> can be created from the alignment and data directories. > >> The context variables necessary for creating these examples can be found > >> in nnet_ms_a_online/conf/splice.conf file. > >> > >> Vijay > >> > >> On Mon, Jun 29, 2015 at 9:14 AM, Jonathan L <jon...@gm...> > >> wrote: > >>> > >>> The train_more*.sh scripts accept an 'exp' directory instead of a > >>> 'data/train' directory. Is there another script that would accept the > >>> 'data/train' directory as input instead? > >>> > >>> On Mon, Jun 29, 2015 at 12:08 PM, Vijayaditya Peddinti > >>> <p.v...@gm...> wrote: > >>>> > >>>> See the scripts steps/nnet2/train_more*.sh > >>>> > >>>> Vijay > >>>> > >>>> On Mon, Jun 29, 2015 at 9:02 AM, Jonathan L < > jon...@gm...> > >>>> wrote: > >>>>> > >>>>> I'm looking to further train an existing LibriSpeech nnet2_a_online > >>>>> model on a new dataset. > >>>>> > >>>>> I have prepared the files for this new dataset inside a data/train > >>>>> directory, as described in the Data Preparation tutorial. I want to > keep the > >>>>> nnet2_a_online model initialized to the parameters it learned from > training > >>>>> on LibriSpeech, but continue its training on this new dataset. Is > there a > >>>>> script that would allow me to specify the nnet2_a_online model and > the > >>>>> dataset's data/train directory as input in order to output a model > that has > >>>>> been trained more on this new dataset? > >>>>> > >>>>> > >>>>> > ------------------------------------------------------------------------------ > >>>>> Monitor 25 network devices or servers for free with OpManager! > >>>>> OpManager is web-based network management software that monitors > >>>>> network devices and physical & virtual servers, alerts via email & > sms > >>>>> for fault. Monitor 25 devices for free with no restriction. Download > >>>>> now > >>>>> http://ad.doubleclick.net/ddm/clk/292181274;119417398;o > >>>>> _______________________________________________ > >>>>> Kaldi-users mailing list > >>>>> Kal...@li... > >>>>> https://lists.sourceforge.net/lists/listinfo/kaldi-users > >>>>> > >>>> > >>> > >> > > > > > > > ------------------------------------------------------------------------------ > > Don't Limit Your Business. Reach for the Cloud. > > GigeNET's Cloud Solutions provide you with the tools and support that > > you need to offload your IT needs and focus on growing your business. > > Configured For All Businesses. Start Your Cloud Today. > > https://www.gigenetcloud.com/ > > _______________________________________________ > > Kaldi-users mailing list > > Kal...@li... > > https://lists.sourceforge.net/lists/listinfo/kaldi-users > > > |