Increasing Context for Estimating Confidence Scores in Automatic Speech Recognition
IEEE/ACM Transactions on Audio Speech and Language Processing
Institute of Electrical and Electronics Engineers (IEEE)
MetadataShow full item record
Ragni, A., Gales, M., Rose, O., Knill, K., Kastanos, A., Li, Q., & Ness, P. (2022). Increasing Context for Estimating Confidence Scores in Automatic Speech Recognition. IEEE/ACM Transactions on Audio Speech and Language Processing https://doi.org/10.1109/TASLP.2022.3161153
Accurate confidence measures for predictions from machine learning techniques play a critical role in the deployment and training of many speech and language processing applications. For example, confidence scores are important when making use of automatically generated transcriptions in training automatic speech recognition (ASR) systems, as well as down-stream applications, such as information retrieval and conversational assistants. Previous work on improving confidence scores for these systems has focused on two main directions: designing features correlated with improved confidence prediction; and employing sequence models to account for the importance of contextual information. Few studies, however, have explored incorporating contextual information more broadly, such as from the future, in addition to the past, or making use of alternative multiple hypotheses in addition to the most likely one. This article introduces two general approaches for encapsulating contextual information from lattices. Experimental results illustrating the importance of increasing contextual information for estimating confidence scores are presented on a range of limited resource languages where word error rates range between 30% and 60%. The results show that the novel approaches provide significant gains in the accuracy of confidence estimation.
Speech recognition, confidence, recurrent neural network, attention, graph structures
All authors were supported in part by the ALTA Institute, Cambridge University. A. Ragni and M. Gales were also supported in part by the Office of the Director of National Intelligence (ODNI), Intelligence Advanced Research Projects Activity (IARPA), via Air Force Research Laboratory (AFRL) contract # FA8650-17-C-9117.
Cambridge Assessment (unknown)
Cambridge Assessment (Unknown)
Embargo Lift Date
External DOI: https://doi.org/10.1109/TASLP.2022.3161153
This record's URL: https://www.repository.cam.ac.uk/handle/1810/335029
All Rights Reserved
Licence URL: http://www.rioxx.net/licenses/all-rights-reserved