|
From: Jan T. <af...@ce...> - 2014-07-11 15:48:24
|
BTW, it might be the case that you might need to downsample to 44.1kHz and 16bits per sample. I think you can do this without any fear about accuracy/performance of the recognizer. y. On Fri, Jul 11, 2014 at 11:40 AM, Jan Trmal <af...@ce...> wrote: > Hi, type 65534 is some microsoft specific type used to add some specific extensions and previously unsupported features to the wave format. > > you can try to use sox to convert/sanitize the wav. > > y. > > > > On Fri, Jul 11, 2014 at 11:24 AM, Zibo Meng <mzb...@gm...> wrote: > >> Hi, >> >> I got another problem. >> >> When I tried make_mfcc.sh to create the feats.scp files it did not work. >> >> I checked the log file where it said some thing like: >> >> compute-mfcc-feats --verbose=2 --config=conf/mfcc.conf >> scp,p:exp/make_mfcc/train/wav_data.1.scp ark:- >> ERROR (compute-mfcc-feats:Read():wave-reader.cc:144) WaveData: can read >> only PCM data, audio_format is not 1: 65534 >> WARNING (compute-mfcc-feats:Read():feat/wave-reader.h:148) Exception >> caught in WaveHolder object (reading). >> WARNING (compute-mfcc-feats:LoadCurrent():util/kaldi-table-inl.h:232) >> TableReader: failed to load object from 'test.wav' >> >> Then I checked the attributes of my test.wav file which were as follows: >> Input File : 'test.wav' >> Channels : 1 >> Sample Rate : 48000 >> Precision : 24-bit >> Duration : 00:03:30.09 = 10084224 samples ~ 15756.6 CDDA sectors >> File Size : 30.3M >> Bit Rate : 1.15M >> Sample Encoding: 24-bit Signed Integer PCM >> >> Can you tell me what should I modify to my audio files. Thank you so much! >> >> Best, >> >> Zibo >> >> >> >> On Thu, Jul 10, 2014 at 3:37 PM, Zibo Meng <mzb...@gm...> wrote: >> >>> Hi, >>> >>> I am preparing the data for dnn training using my own data set. I >>> followed the instruction on http://kaldi.sourceforge.net/data_prep.html. >>> >>> I created the file "text" as the first 3 lines: >>> S002-U-000300-000470 OH >>> S002-U-000470-000630 I'D >>> S002-U-000630-000870 LIKE >>> >>> the wav.scp file: >>> S002-U <path to the corresponding wav file> >>> S002-O <path to the corresponding wav file> >>> S003-U <path to the corresponding wav file> >>> >>> and the utt2spk file: >>> S002-U-000300-000470 002-U >>> S002-U-000470-000630 002-U >>> S002-U-000630-000870 002-U >>> >>> Then I used utt2spk_to_spk2utt.pl to create the spk2utt file. >>> Everything went well until I tried to use the mak_mfcc.sh to create the >>> feats.scp file where I got the error message like: >>> >>> utils/validate_data_dir.sh: file data/utt2spk is not in sorted order or >>> has duplicates >>> >>> seems like my utt2spk file could not pass through the validation. >>> >>> Can any body help me out of here? Thank you so much. >>> >>> Best, >>> >>> Zibo >>> >> >> >> >> ------------------------------------------------------------------------------ >> >> >> _______________________________________________ >> Kaldi-users mailing list >> Kal...@li... >> https://lists.sourceforge.net/lists/listinfo/kaldi-users >> >> > |