Repository logo
 

Spoken language 'grammatical error correction'

Accepted version
Peer-reviewed

Type

Conference Object

Change log

Authors

Lu, Y 
Gales, MJF 
Wang, Y 

Abstract

Spoken language ‘grammatical error correction’ (GEC) is an important mechanism to help learners of a foreign language, here English, improve their spoken grammar. GEC is challeng- ing for non-native spoken language due to interruptions from disfluent speech events such as repetitions and false starts and issues in strictly defining what is acceptable in spoken language. Furthermore there is little labelled data to train models. One way to mitigate the impact of speech events is to use a disflu- ency detection (DD) model. Removing the detected disfluencies converts the speech transcript to be closer to written language, which has significantly more labelled training data. This paper considers two types of approaches to leveraging DD models to boost spoken GEC performance. One is sequential, a separately trained DD model acts as a pre-processing module providing a more structured input to the GEC model. The second approach is to train DD and GEC models in an end-to-end fashion, simul- taneously optimising both modules. Embeddings enable end- to-end models to have a richer information flow. Experimen- tal results show that DD effectively regulates GEC input; end- to-end training works well when fine-tuned on limited labelled in-domain data; and improving DD by incorporating acoustic information helps improve spoken GEC.

Description

Keywords

grammatical error correction, disfluency detection

Journal Title

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Conference Name

Interspeech 2020

Journal ISSN

2308-457X
1990-9772

Volume Title

2020-October

Publisher

ISCA

Rights

All rights reserved
Sponsorship
Cambridge Assessment (unknown)
Cambridge Assessment (Unknown)