Elastic weight consolidation for better bias inoculation

Thorne, J; Vlachos, A

Elastic weight consolidation for better bias inoculation

Published version

Peer-reviewed

Repository URI

https://www.repository.cam.ac.uk/handle/1810/324367

Repository DOI

https://doi.org/10.17863/CAM.71822

Files

Published version (402.03 KB)

Type

Conference Object

Authors

Thorne, J

Vlachos, Andreas

https://orcid.org/0000-0003-2123-5071

Abstract

The biases present in training datasets have been shown to affect models for sentence pair classification tasks such as natural language inference (NLI) and fact verification. While fine-tuning models on additional data has been used to mitigate them, a common issue is that of catastrophic forgetting of the original training dataset. In this paper, we show that elastic weight consolidation (EWC) allows fine-tuning of models to mitigate biases while being less susceptible to catastrophic forgetting. In our evaluation on fact verification and NLI stress tests, we show that fine-tuning with EWC dominates standard fine-tuning, yielding models with lower levels of forgetting on the original (biased) dataset for equivalent gains in accuracy on the fine-tuning (unbiased) dataset.

Keywords

cs.CL, cs.CL, cs.LG

Journal Title

EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference

Conference Name

EACL 2021 - 16th Conference of the European Chapter of the Association for Computational Linguistics

Publisher

Association for Computational Linguistics

Publisher DOI

https://doi.org/10.17863/CAM.71822

Rights

Attribution 4.0 International

Sponsorship

European Commission Horizon 2020 (H2020) ERC (865958)
European Commission Horizon 2020 (H2020) ERC (965576)

Collections

Cambridge University Research Outputs