We will be undertaking essential maintenance work on Apollo's infrastructure on Thursday 14 August and Friday 15 August, therefore expect intermittent access to Apollo's content and search interface during that time. Please also note that Apollo's "Request a copy" service will be temporarily disabled while we undertake this work.
Repository logo
 

Developing a sentence level fairness metric using word embeddings

Published version
Peer-reviewed

Repository DOI


Change log

Abstract

Fairness is a principal social value that is observable in civilisations around the world. Yet, a fairness metric for digital texts that describe even a simple social interaction, e.g., ‘The boy hurt the girl’ has not been developed. We address this by employing word embeddings that use factors found in a new social psychology literature review on the topic. We use these factors to build fairness vectors. These vectors are used as sentence level measures, whereby each dimension reflects a fairness component. The approach is employed to approximate human perceptions of fairness. The method leverages a pro-social bias within word embeddings, for which we obtain an F1 = 79.8 on a list of sentences using the Universal Sentence Encoder (USE). A second approach, using principal component analysis (PCA) and machine learning (ML), produces an F1 = 86.2. Repeating these tests using Sentence Bidirectional Encoder Representations from Transformers (SBERT) produces an F1 = 96.9 and F1 = 100 respectively. Improvements using subspace representations are further suggested. By proposing a first-principles approach, the paper contributes to the analysis of digital texts along an ethical dimension.

Description

Acknowledgements: This project has received funding from the European Union’s Horizon 2020 research and innovation programme under the NGI TRUST grant agreement no. 825618. The Psychometrics Centre, Cambridge Judge Business School Small Grants Scheme, and Isaac Newton Trust. Some of the work was conducted at the Social Decision-Making Laboratory, University of Cambridge. We would like to express our gratitude to Dr Joseph Watson for his valuable feedback. We would also like to thank the two anonymous reviewers for their critical review of the paper.

Journal Title

International Journal of Digital Humanities

Conference Name

Journal ISSN

2524-7832
2524-7840

Volume Title

5

Publisher

Springer Science and Business Media LLC

Rights and licensing

Except where otherwised noted, this item's license is described as http://creativecommons.org/licenses/by/4.0/
Sponsorship
NGI Trust (825618)
The Psychometrics Centre, Cambridge Judge Business School Small Grants Scheme (The Psychometrics Centre, Cambridge Judge Business School Small Grants Scheme)
The Isaac Newton Trust (The Isaac Newton Trust)
This project has received funding from the European Union’s Horizon 2020 research and innovation programme under the NGI TRUST grant agreement no. 825618. The Psychometrics Centre, Cambridge Judge Business School Small Grants Scheme, and Isaac Newton Trust