kaldi-developers Mailing List for Kaldi (Page 15)

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

Hello,

first of all let me thank you for bringing cutting-edge speech recognition
to the mortals!

I am using Kaldi to jump-start training of recurrent neural networks for
phoneme recognition on Timit and to compare results between Kaldi decoders
and the recurrent net based ones.

The s5 recipe for Timit ships with two scorers: sclite and basic. Sclite
tends to compute lower error rates, which I attribute to different scoring
of errors relating to the silence token. However, for scoring it requires
not only the decoded phoneme sequence, but also the timing of each phoneme.
Since my decoder doesn't align the decoded phones precisely in time, I was
using the basic scoring script.

I have two questions:
1. am I correct about the differences between the two scorers' computed
error rates to different handling of the silence token? I rescored models
obtained using the standard recipe and they get consistently higher error
rates using the basic scorer.
2. Do you have any intuitions on how precise the phone timing information
needs to be for the sclite scorer to work? Is the timing quality part of
the score or is it only used to save on computations?

Sincerely,
Jan Chorowski

2011	Jan	Feb	Mar	Apr	May	Jun (4)	Jul	Aug	Sep (1)	Oct (4)	Nov (1)	Dec (14)
2012	Jan (1)	Feb (8)	Mar	Apr (1)	May (3)	Jun (13)	Jul (7)	Aug (11)	Sep (6)	Oct (14)	Nov (16)	Dec (1)
2013	Jan (3)	Feb (8)	Mar (17)	Apr (21)	May (27)	Jun (11)	Jul (11)	Aug (21)	Sep (39)	Oct (17)	Nov (39)	Dec (28)
2014	Jan (36)	Feb (30)	Mar (35)	Apr (17)	May (22)	Jun (28)	Jul (23)	Aug (41)	Sep (17)	Oct (10)	Nov (22)	Dec (56)
2015	Jan (30)	Feb (32)	Mar (37)	Apr (28)	May (79)	Jun (18)	Jul (35)	Aug	Sep (1)	Oct	Nov	Dec

kaldi-developers Mailing List for Kaldi (Page 15)

kaldi-developers — Kaldi Developers