On the evaluation and application of neural language models for grammatical error detection

Davis, Christopher

doi:https://doi.org/10.17863/CAM.108291

On the evaluation and application of neural language models for grammatical error detection

Repository URI

https://www.repository.cam.ac.uk/handle/1810/367854

Repository DOI

https://doi.org/10.17863/CAM.108291

Files

Primary Thesis (4.76 MB)

Type

Thesis

Authors

Davis, Christopher

https://orcid.org/0000-0003-4517-5851

Abstract

Neural language models (NLM) have become a core component in many downstream applications within the field of natural language processing, including the task of data-driven automatic grammatical error detection (GED). This thesis explores whether information from NLMs can positively transfer to GED within the domain of learning English as a second language (ESL), and looks at whether NLMs encode and make use of linguistic signals that would facilitate robust and generalisable GED performance.

First, I investigate whether information from different types of neural language model can be transferred to models for GED. I evaluate five models against three publicly available ESL benchmarks, and report results showing positive transfer effects to the extent that fine-grained error detection using a single model is becoming viable. Second, I carry out a causal investigation to understand whether NLM-GED models make use of robust linguistic signals during inference – in theory, this would enable them to generalise across different data distributions. The results show a high degree of linear encoding of noun-number within each model’s token-level contextual representations, but they also show markedly varying error detection performance across model types and across in- and out-of-domain datasets. Altogether, the results indicate models employ different strategies for error detection. Third, I re-frame the typically downstream GED task as an evaluation framework to test whether the pre-trained NLMs implicitly encode information about grammatical errors as an artefact of their language modelling objective. I present results illustrating stark differences between masked language models and autoregressive language models – while the former seemingly encodes much more information related to the detection of grammatical errors, the results also present evidence of a brittle encoding across different syntactic constructions.

Altogether, this thesis presents a holistic analysis of NLMs – how they might be applied to GED, whether they utilise linguistic information to enable robust inference, and whether their pre-training objective implicitly imbues them with knowledge about grammaticality.

Date

2023-09-22

Advisors

Buttery, Paula

Keywords

Computer Science, Natural Language Processing

Qualification

Doctor of Philosophy (PhD)

Awarding Institution

University of Cambridge

Rights and licensing

Sponsorship

EPSRC (1940766)

Relationships

Is supplemented by:

https://doi.org/10.18653/v1/D18-1151
https://doi.org/10.18653/v1/W19-4406
https://doi.org/10.1007/978-3-319-10888-9_47
https://doi.org/10.4324/9781315841342-1
https://doi.org/10.5555/2002472.2002496

Collections

Theses - Computer Science and Technology

On the evaluation and application of neural language models for grammatical error detection

Repository URI

Repository DOI

Files

Type

Change log

Authors

Abstract

Description

Date

Advisors

Keywords

Qualification

Awarding Institution

Rights and licensing

Sponsorship

Relationships

Collections