Repository logo
 

A psychometric framework for evaluating and shaping personality traits in large language models

Published version
Peer-reviewed

Repository DOI


Change log

Authors

Abstract

The advent of large language models (LLMs) has revolutionized natural language processing, enabling the generation of coherent and contextually relevant human-like text. As LLMs increasingly power conversational agents used by the general public worldwide, the synthetic personality traits embedded in these models by virtue of training on large amounts of human data are becoming increasingly important to evaluate. The style in which LLMs respond can mimic different human personality traits. Here, as these patterns can be a key factor determining the effectiveness of communication, we present a comprehensive psychometric methodology for administering and validating personality tests on widely used LLMs, as well as for shaping personality in the generated text of such LLMs. Applying this method to 18 LLMs, we found that: personality measurements in the outputs of some LLMs under specific prompting configurations are reliable and valid; evidence of reliability and validity of synthetic LLM personality is stronger for larger and instruction-fine-tuned models; and personality in LLM outputs can be shaped along desired dimensions to mimic specific human personality profiles. We discuss the application and ethical implications of the measurement and shaping method, in particular regarding responsible artificial intelligence.

Description

Funder: Luning Sun gratefully acknowledges financial support from Invesco through their philanthropic donation to Cambridge Judge Business School.


Funder: Stephen Fitz gratefully acknowledges support from the University of Chicago Research Computing Center, which provided compute credits essential to this research.

Journal Title

Nature Machine Intelligence

Conference Name

Journal ISSN

2522-5839
2522-5839

Volume Title

7

Publisher

Nature Research

Rights and licensing

Except where otherwised noted, this item's license is described as http://creativecommons.org/licenses/by-nc-nd/4.0/
Sponsorship
EPSRC (EP/T022159/1)
G.S.-G. is supported by the Bill & Melinda Gates Foundation through a Gates Cambridge Scholarship (OPP1144). S.F. is supported by the University of Chicago Research Computing Center. Inference for open models used compute resources provided by the Cambridge Service for Data Driven Discovery (CSD3) at the University of Cambridge, made possible by Tier-2 funding from the EPSRC (EP/T022159/1) and DiRAC funding from STFC (https://www.dirac.ac.uk).