Title
On the Relationship Between Bayes Risk and Word Error Rate in ASR
Abstract
Recently, a number of (approximate) approaches emerged in speech processing, which try to overcome the known lack of match between symbol level evaluation measures (e.g., word error rate) and the standard string (symbol sequence) cost (e.g., sentence error)-based Bayes decision rule, by using symbol level cost functions for Bayes decision rule. Nevertheless, experiments show that for a majority of test samples both decision rules still give equal decisions, especially at lower error rates. In this paper, analytic evidence for these observations is provided. A set of conditions is presented, for which Bayes decision rule with symbol level and string level cost function leads to the same decisions. Furthermore, the case of word error cost represented by the Levenshtein (edit) distance is investigated, which upon others covers the important case of speech recognition. A Hamming distance-based upper bound to the Levenshtein cost function is discussed. This cost function relates to former, word-posterior based decision rules, and the corresponding efficient decision rule is shown to be strongly related to Bayes decision rule with the Levenshtein cost. The analytic results are verified experimentally, and their quantitative effect is studied by experiments on four different well-known large vocabulary automatic speech recognition tasks.
Year
DOI
Venue
2011
10.1109/TASL.2010.2091635
Audio, Speech, and Language Processing, IEEE Transactions
Keywords
Field
DocType
Bayes methods,decision theory,speech processing,speech recognition,ASR,Bayes risk,Hamming distance-based upper bound,Levenshtein cost function,Levenshtein distance,edit distance,sentence error-based Bayes decision rule,speech processing,standard string cost,string level cost function,symbol level cost functions,symbol level evaluation measures,symbol sequence cost,vocabulary automatic speech recognition tasks,word error cost,word error rate,word-posterior based decision rules,Bayes decision rule,Bayes risk,Hamming cost,Levenshtein cost,cost function,edit distance
Decision rule,Edit distance,Pattern recognition,Computer science,Word error rate,Levenshtein distance,Speech recognition,Hamming distance,Artificial intelligence,Decision theory,Bayes error rate,Bayes' theorem
Journal
Volume
Issue
ISSN
19
5
1558-7916
Citations 
PageRank 
References 
4
0.44
9
Authors
4
Name
Order
Citations
PageRank
Ralf Schlüter11337136.18
Nussbaum-Thom, M.240.44
Hermann Ney340.44
Markus Nußbaum-Thom4737.00