Confidence Estimation and Deletion Prediction Using Bidirectional Recurrent Neural Networks

Ragni, A; Li, Q; Gales, MJF; Wang, Y

Confidence Estimation and Deletion Prediction Using Bidirectional Recurrent Neural Networks

Accepted version

Peer-reviewed

Repository URI

https://www.repository.cam.ac.uk/handle/1810/287923

Repository DOI

https://doi.org/10.17863/CAM.35236

Files

Accepted version (249 KB)

Type

Conference Object

Authors

Ragni, A

Li, Q

Gales, MJF

Wang, Y

Abstract

The standard approach to assess reliability of automatic speech transcriptions is through the use of confidence scores. If accurate, these scores provide a flexible mechanism to flag transcription errors for upstream and downstream applications. One challenging type of errors that recognisers make are deletions. These errors are not accounted for by the standard confidence estimation schemes and are hard to rectify in the upstream and downstream processing. High deletion rates are prominent in limited resource and highly mismatched training/testing conditions studied under IARPA Babel and Material programs. This paper looks at the use of bidirectional recurrent neural networks to yield confidence estimates in predicted as well as deleted words. Several simple schemes are examined for combination. To assess usefulness of this approach, the combined confidence score is examined for untranscribed data selection that favours transcriptions with lower deletion errors. Experiments are conducted using IARPA Babel/Material program languages.

Keywords

confidence score, deletion error, bidirectional recurrent neural network

Journal Title

2018 IEEE Spoken Language Technology Workshop, SLT 2018 - Proceedings

Conference Name

2018 IEEE Spoken Language Technology Workshop (SLT)

Journal ISSN

2639-5479

Publisher

IEEE

Publisher DOI

https://doi.org/10.1109/SLT.2018.8639678

Rights

http://www.rioxx.net/licenses/all-rights-reserved

Sponsorship

ALTA Institute, Cambridge University; The Office of the Director of National Intelligence (ODNI), Intelligence Advanced Research Projects Activity (IARPA) via Air Force Research Laboratory (AFRL)

Collections

Cambridge University Research Outputs