From: Lee B. <lb...@nc...> - 2012-09-15 20:09:06
|
I am a student trying to come up to speed with both speech recognition and HMM's and the use of Kaldi. I was trying to run thru the kaldi tutorial using the LDC switchboard data. When running steps/compute_cmvn_stats.sh data/train exp/make_mfcc/train /mnt/kaldi/egs/swbd_experiment/s5/mfcc_dir the step compute-cmvn-stats --spk2utt=ark:data/train/spk2utt scp:data/train/feats.scp ark,scp:/mnt/kaldi/egs/swbd_experiment/s5/mfcc_dir/cmvn_train.ark,/mnt/kaldi/egs/swbd_experiment/s5/mfcc_dir/cmvn_train.scp fails with the following message ERROR (compute-cmvn-stats:SplitFilename():kaldi-io.cc:504) Cannot get offset from filename /mnt/kaldi/egs/swbd_experiment/s5/mfcc_dir/raw_mfcc_train.1.ark:4295001228 (possibly you compiled in 32-bit and have a >32-bit byte offset into a file; you'll have to compile 64-bit. ERROR (compute-cmvn-stats:SplitFilename():kaldi-io.cc:504) Cannot get offset from filename /mnt/kaldi/egs/swbd_experiment/s5/mfcc_dir/raw_mfcc_train.1.ark:4295001228 (possibly you compiled in 32-bit and have a >32-bit byte offset into a file; you'll have to compile 64-bit. In digging thru the install scripts, I couldnt see any references to compiling with 64-bit. I am running under cygwin. So 1) is there an FAQ that would cover these sorts of issues 2) could I get some guidance on how to get around this issue -- Regards Lee Baker |