You can subscribe to this list here.
2011 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
(4) |
Jul
|
Aug
|
Sep
(1) |
Oct
(4) |
Nov
(1) |
Dec
(14) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
2012 |
Jan
(1) |
Feb
(8) |
Mar
|
Apr
(1) |
May
(3) |
Jun
(13) |
Jul
(7) |
Aug
(11) |
Sep
(6) |
Oct
(14) |
Nov
(16) |
Dec
(1) |
2013 |
Jan
(3) |
Feb
(8) |
Mar
(17) |
Apr
(21) |
May
(27) |
Jun
(11) |
Jul
(11) |
Aug
(21) |
Sep
(39) |
Oct
(17) |
Nov
(39) |
Dec
(28) |
2014 |
Jan
(36) |
Feb
(30) |
Mar
(35) |
Apr
(17) |
May
(22) |
Jun
(28) |
Jul
(23) |
Aug
(41) |
Sep
(17) |
Oct
(10) |
Nov
(22) |
Dec
(56) |
2015 |
Jan
(30) |
Feb
(32) |
Mar
(37) |
Apr
(28) |
May
(79) |
Jun
(18) |
Jul
(35) |
Aug
|
Sep
(1) |
Oct
|
Nov
|
Dec
|
From: Daniel P. <dp...@gm...> - 2013-05-09 02:07:32
|
Thanks. This issue has been fixed. I suspect you are using an out-of-date version of Kaldi. If you have done "svn up" and are still at revision 2256 or thereabouts, it means you have not updated your repository to the "new" sourceforge. See instructions at kaldi.sf.net on how to change. Dan On Wed, May 8, 2013 at 9:51 PM, Chao Weng <cw...@gm...> wrote: > Hi All, > > I just finished setting up the GPGPU environment and running the > experiments in the Kaldi. In the beginning, I could not compile through all > the cuda related programs with "undefined references" link errors, now I > found a workaround by putting the cuda related ld flags "-lcublas" and > "-lcudart" in the end (i.e. after -lpthread). My system is Ubuntu 12.04 > with gcc 4.6.3. Just sending this in case you have the same problem. > > Bests, > Chao > > > ------------------------------------------------------------------------------ > Learn Graph Databases - Download FREE O'Reilly Book > "Graph Databases" is the definitive new guide to graph databases and > their applications. This 200-page book is written by three acclaimed > leaders in the field. The early access version is available now. > Download your free book today! http://p.sf.net/sfu/neotech_d2d_may > _______________________________________________ > Kaldi-developers mailing list > Kal...@li... > https://lists.sourceforge.net/lists/listinfo/kaldi-developers > > |
From: Chao W. <cw...@gm...> - 2013-05-09 01:51:35
|
Hi All, I just finished setting up the GPGPU environment and running the experiments in the Kaldi. In the beginning, I could not compile through all the cuda related programs with "undefined references" link errors, now I found a workaround by putting the cuda related ld flags "-lcublas" and "-lcudart" in the end (i.e. after -lpthread). My system is Ubuntu 12.04 with gcc 4.6.3. Just sending this in case you have the same problem. Bests, Chao |
From: Daniel P. <dp...@gm...> - 2013-05-08 14:34:44
|
Actually, the problem might also be that the "make install" part of the OpenFST installation failed, because there should be a file openfst-1.3.2/lib/openfst.a Type "make openfst_tgt" in the tools/ directory and see what happens Dan On Wed, May 8, 2013 at 10:30 AM, Daniel Povey <dp...@gm...> wrote: > There should be a soft link from "openfst" to "openfst-1.3.2". See if it > exists and try to figure out why not, if it does not. > Dan > > > On Wed, May 8, 2013 at 10:14 AM, Mailing list used for User Communication > and Updates <kal...@li...> wrote: > >> >> >> On 05/08/2013 10:10 AM, Mailing list used for User Communication and >> Updates wrote: >> > On Wed, May 8, 2013 at 2:52 PM, Mailing list used for User >> > Communication and Updates <kal...@li...> wrote: >> >> Using Opensuse 12.3 64 bit, kernel 3.9. >> >> >> >> 1. I issued svn co svn://svn.code.sf.net/p/kaldi/code/trunkkaldi-trunk >> >> 2. download appeared to finish correctly >> >> 3. examined INSTALL, went into dir /tools and did make, finished >> without >> >> error as far as I can tell >> >> 4. then ../src and ./configure, which stops at >> >> >> >> $ ./configure >> >> Configuring ... >> >> Checking OpenFST library in /home/colin/kaldi-trunk/tools/openfst ... >> >> Checking OpenFst library was patched. >> >> Doing OS specific configurations ... >> >> ***configure failed: Static OpenFST library not found: See >> >> ../tools/INSTALL *** >> > >> > Could you check to see if you have the following file >> > kaldi-trunk/tools/openfst/lib/libfst.a? Normally typing make in the >> > tools directory should create it, unless there was some error that >> > didn't get reported. >> > >> >> $ find ./ -name libfst.a >> ./tools/openfst-1.3.2/lib64/libfst.a >> ./tools/openfst-1.3.2/src/lib/.libs/libfst.a >> >> -- >> --- >> Colin Beckingham >> 613-454-5369 >> http://www.it4gh.com >> >> >> ------------------------------------------------------------------------------ >> Learn Graph Databases - Download FREE O'Reilly Book >> "Graph Databases" is the definitive new guide to graph databases and >> their applications. This 200-page book is written by three acclaimed >> leaders in the field. The early access version is available now. >> Download your free book today! http://p.sf.net/sfu/neotech_d2d_may >> _______________________________________________ >> Kaldi-users mailing list >> Kal...@li... >> https://lists.sourceforge.net/lists/listinfo/kaldi-users >> > > |
From: Arnab G. <ar...@gm...> - 2013-05-08 14:33:55
|
On Wed, May 8, 2013 at 3:14 PM, Mailing list used for User Communication and Updates <kal...@li...> wrote: >>> Using Opensuse 12.3 64 bit, kernel 3.9. >>> $ ./configure >>> Configuring ... >>> Checking OpenFST library in /home/colin/kaldi-trunk/tools/openfst ... >>> Checking OpenFst library was patched. >>> Doing OS specific configurations ... >>> ***configure failed: Static OpenFST library not found: See >>> ../tools/INSTALL *** > > $ find ./ -name libfst.a > ./tools/openfst-1.3.2/lib64/libfst.a > ./tools/openfst-1.3.2/src/lib/.libs/libfst.a OK, a simple fix will be to create a symlink for libfst.a in openfst-1.3.2/lib. Also, tools/openfst-1.3.2 itself should be symlinked as tools/openfst -- if not, let us know. But I am surprised that the directory is lib64 an not lib. This could be an OS-specific issue that I cannot recreate. -Arnab |
From: Daniel P. <dp...@gm...> - 2013-05-08 14:30:55
|
There should be a soft link from "openfst" to "openfst-1.3.2". See if it exists and try to figure out why not, if it does not. Dan On Wed, May 8, 2013 at 10:14 AM, Mailing list used for User Communication and Updates <kal...@li...> wrote: > > > On 05/08/2013 10:10 AM, Mailing list used for User Communication and > Updates wrote: > > On Wed, May 8, 2013 at 2:52 PM, Mailing list used for User > > Communication and Updates <kal...@li...> wrote: > >> Using Opensuse 12.3 64 bit, kernel 3.9. > >> > >> 1. I issued svn co svn://svn.code.sf.net/p/kaldi/code/trunk kaldi-trunk > >> 2. download appeared to finish correctly > >> 3. examined INSTALL, went into dir /tools and did make, finished without > >> error as far as I can tell > >> 4. then ../src and ./configure, which stops at > >> > >> $ ./configure > >> Configuring ... > >> Checking OpenFST library in /home/colin/kaldi-trunk/tools/openfst ... > >> Checking OpenFst library was patched. > >> Doing OS specific configurations ... > >> ***configure failed: Static OpenFST library not found: See > >> ../tools/INSTALL *** > > > > Could you check to see if you have the following file > > kaldi-trunk/tools/openfst/lib/libfst.a? Normally typing make in the > > tools directory should create it, unless there was some error that > > didn't get reported. > > > > $ find ./ -name libfst.a > ./tools/openfst-1.3.2/lib64/libfst.a > ./tools/openfst-1.3.2/src/lib/.libs/libfst.a > > -- > --- > Colin Beckingham > 613-454-5369 > http://www.it4gh.com > > > ------------------------------------------------------------------------------ > Learn Graph Databases - Download FREE O'Reilly Book > "Graph Databases" is the definitive new guide to graph databases and > their applications. This 200-page book is written by three acclaimed > leaders in the field. The early access version is available now. > Download your free book today! http://p.sf.net/sfu/neotech_d2d_may > _______________________________________________ > Kaldi-users mailing list > Kal...@li... > https://lists.sourceforge.net/lists/listinfo/kaldi-users > |
From: Arnab G. <ar...@gm...> - 2013-05-08 14:10:47
|
On Wed, May 8, 2013 at 2:52 PM, Mailing list used for User Communication and Updates <kal...@li...> wrote: > Using Opensuse 12.3 64 bit, kernel 3.9. > > 1. I issued svn co svn://svn.code.sf.net/p/kaldi/code/trunk kaldi-trunk > 2. download appeared to finish correctly > 3. examined INSTALL, went into dir /tools and did make, finished without > error as far as I can tell > 4. then ../src and ./configure, which stops at > > $ ./configure > Configuring ... > Checking OpenFST library in /home/colin/kaldi-trunk/tools/openfst ... > Checking OpenFst library was patched. > Doing OS specific configurations ... > ***configure failed: Static OpenFST library not found: See > ../tools/INSTALL *** Could you check to see if you have the following file kaldi-trunk/tools/openfst/lib/libfst.a? Normally typing make in the tools directory should create it, unless there was some error that didn't get reported. > 5. Checked tools install, it just tells me to do make. I did a search on > "openfst*" to find a library or executable on the system, but outside > the download dirs nothing available. No, it's not going to be in the systems directories, but just in the tools directory. > 6. attempted to compile as root (bad, but I thought maybe compile needed > to be able to write somewhere) no difference. You don't need to do anything as root. > 7. attempted to define --fst-root but no change to error. That's only helpful if you already have a OpenFST installation that you want to use. > Any suggestions? > > -- > --- > Colin Beckingham > http://www.it4gh.com > > ------------------------------------------------------------------------------ > Learn Graph Databases - Download FREE O'Reilly Book > "Graph Databases" is the definitive new guide to graph databases and > their applications. This 200-page book is written by three acclaimed > leaders in the field. The early access version is available now. > Download your free book today! http://p.sf.net/sfu/neotech_d2d_may > _______________________________________________ > Kaldi-users mailing list > Kal...@li... > https://lists.sourceforge.net/lists/listinfo/kaldi-users |
From: Daniel P. <dp...@gm...> - 2013-05-07 17:46:46
|
Sorry-- you'll have to learn some kind of UNIX if you want to do this. Dan On Tue, May 7, 2013 at 10:16 AM, Arnab Ghoshal <ar...@gm...> wrote: > I doubt anyone is maintaining Kaldi for Windows. Did you try to install > using cygwin? Which part of the installation didn't work? > -Arnab > > > On Tue, May 7, 2013 at 9:54 AM, ZEBIBA ABDULKERIM < > zeb...@ya...> wrote: > >> Hello Dear >> >> i am trying to use kaldi for speeech recognition and unable to install in >> window 7 so is there any link or manual for this? >> >> kind regards >> zebiba >> >> >> ------------------------------------------------------------------------------ >> Learn Graph Databases - Download FREE O'Reilly Book >> "Graph Databases" is the definitive new guide to graph databases and >> their applications. This 200-page book is written by three acclaimed >> leaders in the field. The early access version is available now. >> Download your free book today! http://p.sf.net/sfu/neotech_d2d_may >> _______________________________________________ >> Kaldi-developers mailing list >> Kal...@li... >> https://lists.sourceforge.net/lists/listinfo/kaldi-developers >> >> > > > ------------------------------------------------------------------------------ > Learn Graph Databases - Download FREE O'Reilly Book > "Graph Databases" is the definitive new guide to graph databases and > their applications. This 200-page book is written by three acclaimed > leaders in the field. The early access version is available now. > Download your free book today! http://p.sf.net/sfu/neotech_d2d_may > _______________________________________________ > Kaldi-developers mailing list > Kal...@li... > https://lists.sourceforge.net/lists/listinfo/kaldi-developers > > |
From: Arnab G. <ar...@gm...> - 2013-05-07 14:17:16
|
I doubt anyone is maintaining Kaldi for Windows. Did you try to install using cygwin? Which part of the installation didn't work? -Arnab On Tue, May 7, 2013 at 9:54 AM, ZEBIBA ABDULKERIM < zeb...@ya...> wrote: > Hello Dear > > i am trying to use kaldi for speeech recognition and unable to install in > window 7 so is there any link or manual for this? > > kind regards > zebiba > > > ------------------------------------------------------------------------------ > Learn Graph Databases - Download FREE O'Reilly Book > "Graph Databases" is the definitive new guide to graph databases and > their applications. This 200-page book is written by three acclaimed > leaders in the field. The early access version is available now. > Download your free book today! http://p.sf.net/sfu/neotech_d2d_may > _______________________________________________ > Kaldi-developers mailing list > Kal...@li... > https://lists.sourceforge.net/lists/listinfo/kaldi-developers > > |
From: Arnab G. <ar...@gm...> - 2013-05-07 14:11:03
|
You could call gmm-init-mono with the --bimary=false flag in which case the model file is saved as ASCII. Or you could use gmm-copy with the same flag. BTW, HCompV doesn't do what you said it does. You probably meant HHEd. But in general, it's best to understand Kaldi by itself and not try to map everything to the HTK world. -Arnab On Tue, May 7, 2013 at 12:11 PM, Arif Khan <ife...@gm...> wrote: > Hi, > > I am using kaldi for Aurora 2 experiments which are based on word based > training and recognition and not Monophones. > > My question is after executing the following command. > "Usage: gmm-init-mono <topology-in> <dim> <model-out> <tree-out> \n". > How can we view the contents of the model file "<model-out>" created. > > > In HTK we have CompV module, which copies the topology of hmm for different > phones/words from a single topology file and are > saved in a text format and we can view the hmm for different phones/words. > Do the same things happens after the gmm-inint-mono command. How can we view > this. > > Any help will be highly appreciated. > > Thanks, > Arif > > > > ------------------------------------------------------------------------------ > Learn Graph Databases - Download FREE O'Reilly Book > "Graph Databases" is the definitive new guide to graph databases and > their applications. This 200-page book is written by three acclaimed > leaders in the field. The early access version is available now. > Download your free book today! http://p.sf.net/sfu/neotech_d2d_may > _______________________________________________ > Kaldi-developers mailing list > Kal...@li... > https://lists.sourceforge.net/lists/listinfo/kaldi-developers > |
From: Arif K. <ife...@gm...> - 2013-05-07 11:11:30
|
Hi, I am using kaldi for Aurora 2 experiments which are based on word based training and recognition and not Monophones. My question is after executing the following command. "Usage: gmm-init-mono <topology-in> <dim> <model-out> <tree-out> \n". How can we view the contents of the model file "<model-out>" created. In HTK we have CompV module, which copies the topology of hmm for different phones/words from a single topology file and are saved in a text format and we can view the hmm for different phones/words. Do the same things happens after the gmm-inint-mono command. How can we view this. Any help will be highly appreciated. Thanks, Arif |
From: ZEBIBA A. <zeb...@ya...> - 2013-05-07 08:54:50
|
Hello Dear i am trying to use kaldi for speeech recognition and unable to install in window 7 so is there any link or manual for this? kind regards zebiba |
From: Daniel P. <dp...@gm...> - 2013-05-03 02:27:20
|
As written, it's primarily a command line tool, but it's designed in such a way that it should be possible to take the code and put it into other apps. But at the present time, you would have to have a fair amount of both speech expertise and familiarity with Kaldi, in order to be able to do this. Dan On Thu, May 2, 2013 at 10:25 PM, Wesley Jackson <fel...@gm...>wrote: > > Thanks Dan, I was able to modify the script to recognize my Darwin > version. Also, when I checked out 'kaldi-stable', this wasn't a problem; I > must have originally downloaded a different version. > > Also, I was wondering if Kaldi is primarily a command-line tool or if it > can be used in XCode apps, or communicate with web apps? > > Thank you! > > *Wes Jackson* > wes...@br... > fel...@gm... > Mobile: *559•760•5017* > *www.spectralliaisons.com* > > Research Associate > Brain Plasticity, Inc. > > On May 2, 2013, at 6:15 PM, Daniel Povey <dp...@gm...> wrote: > > It's possible that it will work for that version but the configuration > script just doesn't know that, so you could possibly just modify it to > accept this version and treat it as an earlier one. Arnab may know better. > Dan > > > On Wed, May 1, 2013 at 2:55 PM, Wes Jackson <fel...@gm...>wrote: > >> >> Hello, I installed tools, but when I try to configure /src, I get an >> error saying 'OS X version '10.8' not supported'. Is there anything I can >> do to install Kaldi for Mac? >> >> Thank you! >> >> -- >> >> *Wes Jackson* >> >> wes...@br... >> >> *www.spectralliaisons.com* >> >> >> Research Associate >> >> Brain Plasticity, Inc. >> * >> * >> >> >> ------------------------------------------------------------------------------ >> Get 100% visibility into Java/.NET code with AppDynamics Lite >> It's a free troubleshooting tool designed for production >> Get down to code-level detail for bottlenecks, with <2% overhead. >> Download for free and get started troubleshooting in minutes. >> http://p.sf.net/sfu/appdyn_d2d_ap2 >> _______________________________________________ >> Kaldi-developers mailing list >> Kal...@li... >> https://lists.sourceforge.net/lists/listinfo/kaldi-developers >> >> > > |
From: Wesley J. <fel...@gm...> - 2013-05-03 02:25:53
|
Thanks Dan, I was able to modify the script to recognize my Darwin version. Also, when I checked out 'kaldi-stable', this wasn't a problem; I must have originally downloaded a different version. Also, I was wondering if Kaldi is primarily a command-line tool or if it can be used in XCode apps, or communicate with web apps? Thank you! Wes Jackson wes...@br... fel...@gm... Mobile: 559•760•5017 www.spectralliaisons.com Research Associate Brain Plasticity, Inc. On May 2, 2013, at 6:15 PM, Daniel Povey <dp...@gm...> wrote: > It's possible that it will work for that version but the configuration script just doesn't know that, so you could possibly just modify it to accept this version and treat it as an earlier one. Arnab may know better. > Dan > > > On Wed, May 1, 2013 at 2:55 PM, Wes Jackson <fel...@gm...> wrote: > > Hello, I installed tools, but when I try to configure /src, I get an error saying 'OS X version '10.8' not supported'. Is there anything I can do to install Kaldi for Mac? > > Thank you! > > -- > Wes Jackson > > wes...@br... > > www.spectralliaisons.com > > > > Research Associate > > Brain Plasticity, Inc. > > > > ------------------------------------------------------------------------------ > Get 100% visibility into Java/.NET code with AppDynamics Lite > It's a free troubleshooting tool designed for production > Get down to code-level detail for bottlenecks, with <2% overhead. > Download for free and get started troubleshooting in minutes. > http://p.sf.net/sfu/appdyn_d2d_ap2 > _______________________________________________ > Kaldi-developers mailing list > Kal...@li... > https://lists.sourceforge.net/lists/listinfo/kaldi-developers > > |
From: Daniel P. <dp...@gm...> - 2013-05-03 01:18:18
|
If you look in the standard recipes, there is a prototype topology somewhere e.g. in conf/topo.proto or somewhere like that that you could edit if you want to have more than 3 states. Regarding things like adding states for silence and short pause-- it sounds to me like you are trying to "literally translate" a HTK recipe. This is not a good idea, as Kaldi is a different toolkit with different ways of doing things. It will just do the right thing with regard to the number of Gaussians per state, so you don't have to really do anything. Dan On Thu, May 2, 2013 at 8:58 AM, Arif Khan <ife...@gm...> wrote: > Hi, > > I want to run Aurora 2.0 experiments with Kaldi. For that I need to model > 18 state hmm for digits. and 5 state hmm for silence and 3 state hmm for > short pause (between two consecutive digits). > > My question is: > a) How we tie/share a state among different hmm models. > > b) I want to use iterative training, therefore will have to manipulate hmm > in the middle of training for adding states for short pause and silence and > increasing Gaussian mixtures per state, so How > we modify hmm during the middle of training. In HTK we use HHED module to > do this. > > Any help or reference material would be highly appreciated. > > > Best regards, > Arif > > > > ------------------------------------------------------------------------------ > Get 100% visibility into Java/.NET code with AppDynamics Lite > It's a free troubleshooting tool designed for production > Get down to code-level detail for bottlenecks, with <2% overhead. > Download for free and get started troubleshooting in minutes. > http://p.sf.net/sfu/appdyn_d2d_ap2 > _______________________________________________ > Kaldi-developers mailing list > Kal...@li... > https://lists.sourceforge.net/lists/listinfo/kaldi-developers > > |
From: Daniel P. <dp...@gm...> - 2013-05-03 01:15:39
|
It's possible that it will work for that version but the configuration script just doesn't know that, so you could possibly just modify it to accept this version and treat it as an earlier one. Arnab may know better. Dan On Wed, May 1, 2013 at 2:55 PM, Wes Jackson <fel...@gm...> wrote: > > Hello, I installed tools, but when I try to configure /src, I get an error > saying 'OS X version '10.8' not supported'. Is there anything I can do to > install Kaldi for Mac? > > Thank you! > > -- > > *Wes Jackson* > > wes...@br... > > *www.spectralliaisons.com* > > > Research Associate > > Brain Plasticity, Inc. > * > * > > > ------------------------------------------------------------------------------ > Get 100% visibility into Java/.NET code with AppDynamics Lite > It's a free troubleshooting tool designed for production > Get down to code-level detail for bottlenecks, with <2% overhead. > Download for free and get started troubleshooting in minutes. > http://p.sf.net/sfu/appdyn_d2d_ap2 > _______________________________________________ > Kaldi-developers mailing list > Kal...@li... > https://lists.sourceforge.net/lists/listinfo/kaldi-developers > > |
From: Arif K. <ife...@gm...> - 2013-05-02 12:58:41
|
Hi, I want to run Aurora 2.0 experiments with Kaldi. For that I need to model 18 state hmm for digits. and 5 state hmm for silence and 3 state hmm for short pause (between two consecutive digits). My question is: a) How we tie/share a state among different hmm models. b) I want to use iterative training, therefore will have to manipulate hmm in the middle of training for adding states for short pause and silence and increasing Gaussian mixtures per state, so How we modify hmm during the middle of training. In HTK we use HHED module to do this. Any help or reference material would be highly appreciated. Best regards, Arif |
From: Wes J. <fel...@gm...> - 2013-05-01 18:55:24
|
Hello, I installed tools, but when I try to configure /src, I get an error saying 'OS X version '10.8' not supported'. Is there anything I can do to install Kaldi for Mac? Thank you! -- *Wes Jackson* wes...@br... *www.spectralliaisons.com* Research Associate Brain Plasticity, Inc. * * |
From: Arnab G. <ar...@gm...> - 2013-04-26 11:14:44
|
On Fri, Apr 26, 2013 at 8:52 AM, Sakriani Sakti <ss...@is...> wrote: > Dear Kaldi Developers, > > I'm very new to Kaldi. Currently, I finished re-do the wsj recipe. > My questions are: > - How to generate the Kaldi result in ctm format? Look at trunk/egs/swbd/s5b/local/score_sclite.sh > - How to read the *.ali files? Use ./bin/show-alignments > - If I want to try is to re-train the WSJ model with new data, and > we already have mlf files (phone and word with time information) of > new data generated from HTK, is there any way to convert it to Kaldi > alignment files format? Your best bet will be to extract just the raw word transcript (forget about the time information) and run the kaldi recipe from start. You could use ./featbin/copy-feats to convert HTK features to Kaldi format. > It would be great if you could help me for this. > Thank you. > Sincerely yours, > Sakriani Sakti > > ------------------------------------------------------------------------------ > Try New Relic Now & We'll Send You this Cool Shirt > New Relic is the only SaaS-based application performance monitoring service > that delivers powerful full stack analytics. Optimize and monitor your > browser, app, & servers with just a few lines of code. Try New Relic > and get this awesome Nerd Life shirt! http://p.sf.net/sfu/newrelic_d2d_apr > _______________________________________________ > Kaldi-developers mailing list > Kal...@li... > https://lists.sourceforge.net/lists/listinfo/kaldi-developers |
From: Sakriani S. <ss...@is...> - 2013-04-26 08:10:26
|
Dear Kaldi Developers, I'm very new to Kaldi. Currently, I finished re-do the wsj recipe. My questions are: - How to generate the Kaldi result in ctm format? - How to read the *.ali files? - If I want to try is to re-train the WSJ model with new data, and we already have mlf files (phone and word with time information) of new data generated from HTK, is there any way to convert it to Kaldi alignment files format? It would be great if you could help me for this. Thank you. Sincerely yours, Sakriani Sakti |
From: Arnab G. <ar...@gm...> - 2013-04-19 12:30:17
|
OK, used -1 to mean "take till end of file". Instead of using a bool to accept larger segments at the end it's now a float (time in secs) till which an overshooting segment will be accepted. -Arnab On Thu, Apr 18, 2013 at 7:38 PM, Nagendra Kumar Goel <nag...@go...> wrote: > Arnab, > I prefer not to use soxi as its an overkill sometimes. Sometimes the data > may not even be in wav format (sure will convert before using feature > extraction but that's a different pipe). > How about if we make the syntax requirements more strict - like require the > value to be exactly -1. The only issue will be that it's loaded as float, > but we could take the difference and require that to be very small. This > will help you catch bugs in your scripts early on while keeping me safe. > > I recall earlier there was some data that had incorrect segmentation (like > end time was rounded off), causing scripts to unnecessarily fail for some > segments. However that data has been cleaned up. > > Nagendra > > -----Original Message----- > From: Arnab Ghoshal [mailto:ar...@gm...] > Sent: Thursday, April 18, 2013 2:03 PM > To: Nagendra Kumar Goel > Cc: Daniel Povey; kal...@li... > Subject: Re: [Kaldi-developers] extract-segments > > The reason I don't like the special value is that there is a check to reject > segments that are too small. This is a command line option and is visible to > the user. The special value (in the current code it's really an interval) is > hidden and one can only know about it by reading the code. But the hidden > option has a higher priority than the visible option. So while it is > reasonable for a user to expect any segments with invalid start and end > times (i.e. start >= end) to be rejected, sometimes the whole file may > actually get included instead. > This is, in fact, how we found the problem-- a scripting bug caused some end > times to be 0, which went undetected till some process way down the line > died due to a very big segment that shouldn't have been there. > > There is also an option to accept invalid end times (false by default) and I > am not sure what is the reason to have that functionality. > > The way I would have solved your particular problem is to get the start > (which will be 0) and end times for the single utterance files, while > keeping the segments format unchanged. You could use soxi to get the end > time. > > Let me know if this works for you. > > -Arnab > > On Thu, Apr 18, 2013 at 6:35 PM, Nagendra Kumar Goel > <nag...@go...> wrote: >> I have been using this to mix in data that is segmented with data that >> is sentence by sentence files. I didn't care if its 0 or -1. >> >> Is there a specific reason you don't like it? It solves a real problem >> for me. >> >> >> >> From: Daniel Povey [mailto:dp...@gm...] >> Sent: Thursday, April 18, 2013 1:32 PM >> To: Arnab Ghoshal; Nagendra Kumar Goel >> Cc: kal...@li... >> Subject: Re: [Kaldi-developers] extract-segments >> >> >> >> I think Nagendra may have been using this, he should chime in. >> Dan >> >> >> >> >> >> On Thu, Apr 18, 2013 at 1:30 PM, Arnab Ghoshal <ar...@gm...> wrote: >> >> Hi all, >> >> we just noticed that there is an (unmentioned) assumption in >> extract-segments.cc that an end time of (0, -1] in the segments file >> means "include till the end of the file". But there are additional >> logical bugs that causes an end time of 0 to have the same effect. I >> do not like having this special value of the end time and plan to >> remove it. But is there anybody who has a good reason to keep such a >> functionality? >> >> -Arnab >> >> ---------------------------------------------------------------------- >> -------- Precog is a next-generation analytics platform capable of >> advanced analytics on semi-structured data. The platform includes APIs >> for building apps and a phenomenal toolset for data science. >> Developers can use our toolset for easy data analysis & visualization. >> Get a free account! >> http://www2.precog.com/precogplatform/slashdotnewsletter >> _______________________________________________ >> Kaldi-developers mailing list >> Kal...@li... >> https://lists.sourceforge.net/lists/listinfo/kaldi-developers >> >> > |
From: Daniel P. <dp...@gm...> - 2013-04-18 18:49:08
|
-1 is exactly representable as float. I'd say make it exactly -1. It's not pretty, but I don't see it hurting anyone either. Dan On Thu, Apr 18, 2013 at 2:38 PM, Nagendra Kumar Goel < nag...@go...> wrote: > Arnab, > I prefer not to use soxi as its an overkill sometimes. Sometimes the > data > may not even be in wav format (sure will convert before using feature > extraction but that's a different pipe). > How about if we make the syntax requirements more strict - like require the > value to be exactly -1. The only issue will be that it's loaded as float, > but we could take the difference and require that to be very small. This > will help you catch bugs in your scripts early on while keeping me safe. > > I recall earlier there was some data that had incorrect segmentation (like > end time was rounded off), causing scripts to unnecessarily fail for some > segments. However that data has been cleaned up. > > Nagendra > > -----Original Message----- > From: Arnab Ghoshal [mailto:ar...@gm...] > Sent: Thursday, April 18, 2013 2:03 PM > To: Nagendra Kumar Goel > Cc: Daniel Povey; kal...@li... > Subject: Re: [Kaldi-developers] extract-segments > > The reason I don't like the special value is that there is a check to > reject > segments that are too small. This is a command line option and is visible > to > the user. The special value (in the current code it's really an interval) > is > hidden and one can only know about it by reading the code. But the hidden > option has a higher priority than the visible option. So while it is > reasonable for a user to expect any segments with invalid start and end > times (i.e. start >= end) to be rejected, sometimes the whole file may > actually get included instead. > This is, in fact, how we found the problem-- a scripting bug caused some > end > times to be 0, which went undetected till some process way down the line > died due to a very big segment that shouldn't have been there. > > There is also an option to accept invalid end times (false by default) and > I > am not sure what is the reason to have that functionality. > > The way I would have solved your particular problem is to get the start > (which will be 0) and end times for the single utterance files, while > keeping the segments format unchanged. You could use soxi to get the end > time. > > Let me know if this works for you. > > -Arnab > > On Thu, Apr 18, 2013 at 6:35 PM, Nagendra Kumar Goel > <nag...@go...> wrote: > > I have been using this to mix in data that is segmented with data that > > is sentence by sentence files. I didn't care if its 0 or -1. > > > > Is there a specific reason you don't like it? It solves a real problem > > for me. > > > > > > > > From: Daniel Povey [mailto:dp...@gm...] > > Sent: Thursday, April 18, 2013 1:32 PM > > To: Arnab Ghoshal; Nagendra Kumar Goel > > Cc: kal...@li... > > Subject: Re: [Kaldi-developers] extract-segments > > > > > > > > I think Nagendra may have been using this, he should chime in. > > Dan > > > > > > > > > > > > On Thu, Apr 18, 2013 at 1:30 PM, Arnab Ghoshal <ar...@gm...> > wrote: > > > > Hi all, > > > > we just noticed that there is an (unmentioned) assumption in > > extract-segments.cc that an end time of (0, -1] in the segments file > > means "include till the end of the file". But there are additional > > logical bugs that causes an end time of 0 to have the same effect. I > > do not like having this special value of the end time and plan to > > remove it. But is there anybody who has a good reason to keep such a > > functionality? > > > > -Arnab > > > > ---------------------------------------------------------------------- > > -------- Precog is a next-generation analytics platform capable of > > advanced analytics on semi-structured data. The platform includes APIs > > for building apps and a phenomenal toolset for data science. > > Developers can use our toolset for easy data analysis & visualization. > > Get a free account! > > http://www2.precog.com/precogplatform/slashdotnewsletter > > _______________________________________________ > > Kaldi-developers mailing list > > Kal...@li... > > https://lists.sourceforge.net/lists/listinfo/kaldi-developers > > > > > > |
From: Nagendra K. G. <nag...@go...> - 2013-04-18 18:38:42
|
Arnab, I prefer not to use soxi as its an overkill sometimes. Sometimes the data may not even be in wav format (sure will convert before using feature extraction but that's a different pipe). How about if we make the syntax requirements more strict - like require the value to be exactly -1. The only issue will be that it's loaded as float, but we could take the difference and require that to be very small. This will help you catch bugs in your scripts early on while keeping me safe. I recall earlier there was some data that had incorrect segmentation (like end time was rounded off), causing scripts to unnecessarily fail for some segments. However that data has been cleaned up. Nagendra -----Original Message----- From: Arnab Ghoshal [mailto:ar...@gm...] Sent: Thursday, April 18, 2013 2:03 PM To: Nagendra Kumar Goel Cc: Daniel Povey; kal...@li... Subject: Re: [Kaldi-developers] extract-segments The reason I don't like the special value is that there is a check to reject segments that are too small. This is a command line option and is visible to the user. The special value (in the current code it's really an interval) is hidden and one can only know about it by reading the code. But the hidden option has a higher priority than the visible option. So while it is reasonable for a user to expect any segments with invalid start and end times (i.e. start >= end) to be rejected, sometimes the whole file may actually get included instead. This is, in fact, how we found the problem-- a scripting bug caused some end times to be 0, which went undetected till some process way down the line died due to a very big segment that shouldn't have been there. There is also an option to accept invalid end times (false by default) and I am not sure what is the reason to have that functionality. The way I would have solved your particular problem is to get the start (which will be 0) and end times for the single utterance files, while keeping the segments format unchanged. You could use soxi to get the end time. Let me know if this works for you. -Arnab On Thu, Apr 18, 2013 at 6:35 PM, Nagendra Kumar Goel <nag...@go...> wrote: > I have been using this to mix in data that is segmented with data that > is sentence by sentence files. I didn't care if its 0 or -1. > > Is there a specific reason you don't like it? It solves a real problem > for me. > > > > From: Daniel Povey [mailto:dp...@gm...] > Sent: Thursday, April 18, 2013 1:32 PM > To: Arnab Ghoshal; Nagendra Kumar Goel > Cc: kal...@li... > Subject: Re: [Kaldi-developers] extract-segments > > > > I think Nagendra may have been using this, he should chime in. > Dan > > > > > > On Thu, Apr 18, 2013 at 1:30 PM, Arnab Ghoshal <ar...@gm...> wrote: > > Hi all, > > we just noticed that there is an (unmentioned) assumption in > extract-segments.cc that an end time of (0, -1] in the segments file > means "include till the end of the file". But there are additional > logical bugs that causes an end time of 0 to have the same effect. I > do not like having this special value of the end time and plan to > remove it. But is there anybody who has a good reason to keep such a > functionality? > > -Arnab > > ---------------------------------------------------------------------- > -------- Precog is a next-generation analytics platform capable of > advanced analytics on semi-structured data. The platform includes APIs > for building apps and a phenomenal toolset for data science. > Developers can use our toolset for easy data analysis & visualization. > Get a free account! > http://www2.precog.com/precogplatform/slashdotnewsletter > _______________________________________________ > Kaldi-developers mailing list > Kal...@li... > https://lists.sourceforge.net/lists/listinfo/kaldi-developers > > |
From: Nagendra K. G. <nag...@go...> - 2013-04-18 18:03:57
|
I have been using this to mix in data that is segmented with data that is sentence by sentence files. I didn't care if its 0 or -1. Is there a specific reason you don't like it? It solves a real problem for me. From: Daniel Povey [mailto:dp...@gm...] Sent: Thursday, April 18, 2013 1:32 PM To: Arnab Ghoshal; Nagendra Kumar Goel Cc: kal...@li... Subject: Re: [Kaldi-developers] extract-segments I think Nagendra may have been using this, he should chime in. Dan On Thu, Apr 18, 2013 at 1:30 PM, Arnab Ghoshal <ar...@gm...> wrote: Hi all, we just noticed that there is an (unmentioned) assumption in extract-segments.cc that an end time of (0, -1] in the segments file means "include till the end of the file". But there are additional logical bugs that causes an end time of 0 to have the same effect. I do not like having this special value of the end time and plan to remove it. But is there anybody who has a good reason to keep such a functionality? -Arnab ---------------------------------------------------------------------------- -- Precog is a next-generation analytics platform capable of advanced analytics on semi-structured data. The platform includes APIs for building apps and a phenomenal toolset for data science. Developers can use our toolset for easy data analysis & visualization. Get a free account! http://www2.precog.com/precogplatform/slashdotnewsletter _______________________________________________ Kaldi-developers mailing list Kal...@li... https://lists.sourceforge.net/lists/listinfo/kaldi-developers |
From: Arnab G. <ar...@gm...> - 2013-04-18 18:03:11
|
The reason I don't like the special value is that there is a check to reject segments that are too small. This is a command line option and is visible to the user. The special value (in the current code it's really an interval) is hidden and one can only know about it by reading the code. But the hidden option has a higher priority than the visible option. So while it is reasonable for a user to expect any segments with invalid start and end times (i.e. start >= end) to be rejected, sometimes the whole file may actually get included instead. This is, in fact, how we found the problem-- a scripting bug caused some end times to be 0, which went undetected till some process way down the line died due to a very big segment that shouldn't have been there. There is also an option to accept invalid end times (false by default) and I am not sure what is the reason to have that functionality. The way I would have solved your particular problem is to get the start (which will be 0) and end times for the single utterance files, while keeping the segments format unchanged. You could use soxi to get the end time. Let me know if this works for you. -Arnab On Thu, Apr 18, 2013 at 6:35 PM, Nagendra Kumar Goel <nag...@go...> wrote: > I have been using this to mix in data that is segmented with data that is > sentence by sentence files. I didn’t care if its 0 or -1. > > Is there a specific reason you don’t like it? It solves a real problem for > me. > > > > From: Daniel Povey [mailto:dp...@gm...] > Sent: Thursday, April 18, 2013 1:32 PM > To: Arnab Ghoshal; Nagendra Kumar Goel > Cc: kal...@li... > Subject: Re: [Kaldi-developers] extract-segments > > > > I think Nagendra may have been using this, he should chime in. > Dan > > > > > > On Thu, Apr 18, 2013 at 1:30 PM, Arnab Ghoshal <ar...@gm...> wrote: > > Hi all, > > we just noticed that there is an (unmentioned) assumption in > extract-segments.cc that an end time of (0, -1] in the segments file > means "include till the end of the file". But there are additional > logical bugs that causes an end time of 0 to have the same effect. I > do not like having this special value of the end time and plan to > remove it. But is there anybody who has a good reason to keep such a > functionality? > > -Arnab > > ------------------------------------------------------------------------------ > Precog is a next-generation analytics platform capable of advanced > analytics on semi-structured data. The platform includes APIs for building > apps and a phenomenal toolset for data science. Developers can use > our toolset for easy data analysis & visualization. Get a free account! > http://www2.precog.com/precogplatform/slashdotnewsletter > _______________________________________________ > Kaldi-developers mailing list > Kal...@li... > https://lists.sourceforge.net/lists/listinfo/kaldi-developers > > |
From: Daniel P. <dp...@gm...> - 2013-04-18 17:39:12
|
Perhaps the condition should be "< 0" rather than "<= 0" ? If that is the problem. Dan On Thu, Apr 18, 2013 at 1:35 PM, Nagendra Kumar Goel < nag...@go...> wrote: > I have been using this to mix in data that is segmented with data that is > sentence by sentence files. I didn’t care if its 0 or -1. **** > > Is there a specific reason you don’t like it? It solves a real problem for > me.**** > > ** ** > > *From:* Daniel Povey [mailto:dp...@gm...] > *Sent:* Thursday, April 18, 2013 1:32 PM > *To:* Arnab Ghoshal; Nagendra Kumar Goel > *Cc:* kal...@li... > *Subject:* Re: [Kaldi-developers] extract-segments**** > > ** ** > > I think Nagendra may have been using this, he should chime in. > Dan**** > > ** ** > > ** ** > > On Thu, Apr 18, 2013 at 1:30 PM, Arnab Ghoshal <ar...@gm...> wrote:* > *** > > Hi all, > > we just noticed that there is an (unmentioned) assumption in > extract-segments.cc that an end time of (0, -1] in the segments file > means "include till the end of the file". But there are additional > logical bugs that causes an end time of 0 to have the same effect. I > do not like having this special value of the end time and plan to > remove it. But is there anybody who has a good reason to keep such a > functionality? > > -Arnab > > > ------------------------------------------------------------------------------ > Precog is a next-generation analytics platform capable of advanced > analytics on semi-structured data. The platform includes APIs for building > apps and a phenomenal toolset for data science. Developers can use > our toolset for easy data analysis & visualization. Get a free account! > http://www2.precog.com/precogplatform/slashdotnewsletter > _______________________________________________ > Kaldi-developers mailing list > Kal...@li... > https://lists.sourceforge.net/lists/listinfo/kaldi-developers**** > > ** ** > |