First the worst: Finding better gender translations during beam search
Findings of ACL 2022
MetadataShow full item record
Saunders, d., sallis, r., & Byrne, W. First the worst: Finding better gender translations during beam search. Findings of ACL 2022. https://doi.org/10.17863/CAM.82514
Generating machine translations via beam search seeks the most likely output under a model. However, beam search has been shown to amplify demographic biases exhibited by a model. We aim to address this, focusing on gender bias resulting from systematic errors in grammatical gender translation. Almost all prior work on this problem adjusts the training data or the model itself. By contrast, our ap- proach changes only the inference procedure. We explore two techniques: constraining beam search to improve gender diversity in n-best lists, and reranking n-best lists using gender features obtained from the source sentence. Combining these strongly improves WinoMT gender translation accuracy for three language pairs without additional bilingual data or re- training. We also demonstrate our approach’s utility for consistently gendering named enti- ties, and its flexibility to handle new gendered language beyond the binary.
EPSRC grants EP/M508007/1 and EP/N509620/1 and performed using resources from the Cambridge Tier-2 system operated by the University of Cambridge Research Computing Service funded by EPSRC Tier-2 cap- ital grant EP/P020259/1.
Embargo Lift Date
External DOI: https://doi.org/10.17863/CAM.82514
This record's URL: https://www.repository.cam.ac.uk/handle/1810/335074
All Rights Reserved
Licence URL: http://www.rioxx.net/licenses/all-rights-reserved