Latent human traits in the language of social media: An open-vocabulary approach
View / Open Files
Authors
Kulkarni, Vivek
Kern, Margaret L
Kosinski, Michal
Matz, Sandra
Ungar, Lyle
Skiena, Steven
Schwartz, H Andrew
Publication Date
2018-11Journal Title
PLoS One
ISSN
1932-6203
Volume
13
Issue
11
Pages
e0201703-e0201703
Language
eng
Type
Article
This Version
VoR
Metadata
Show full item recordCitation
Kulkarni, V., Kern, M. L., Stillwell, D., Kosinski, M., Matz, S., Ungar, L., Skiena, S., & et al. (2018). Latent human traits in the language of social media: An open-vocabulary approach. PLoS One, 13 (11), e0201703-e0201703. https://doi.org/10.1371/journal.pone.0201703
Abstract
Over the past century, personality theory and research has successfully identified core sets of characteristics that consistently describe and explain fundamental differences in the way people think, feel and behave. Such characteristics were derived through theory, dictionary analyses, and survey research using explicit self-reports. The availability of social media data spanning millions of users now makes it possible to automatically derive characteristics from behavioral data-language use-at large scale. Taking advantage of linguistic information available through Facebook, we study the process of inferring a new set of potential human traits based on unprompted language use. We subject these new traits to a comprehensive set of evaluations and compare them with a popular five factor model of personality. We find that our language-based trait construct is often more generalizable in that it often predicts non-questionnaire-based outcomes better than questionnaire-based traits (e.g. entities someone likes, income and intelligence quotient), while the factors remain nearly as stable as traditional factors. Our approach suggests a value in new constructs of personality derived from everyday human language use.
Keywords
Female, Humans, Language, Male, Social Media
Identifiers
External DOI: https://doi.org/10.1371/journal.pone.0201703
This record's URL: https://www.repository.cam.ac.uk/handle/1810/296740