Repository logo
 

Robot-Led Vision Language Model Wellbeing Assessment of Children

Accepted version
Peer-reviewed

Change log

Abstract

This study presents a novel robot-led approach to assessing children’s mental wellbeing using a Vision Language Model (VLM). Inspired by the Child Apperception Test (CAT), the social robot NAO presented children with pictorial stimuli to elicit their verbal narratives of the images, which were then evaluated by a VLM in accordance with CAT assessment guidelines. The VLM’s assessments were systematically compared to those provided by a trained psychologist. The results reveal that while the VLM demonstrates moderate reliability in identifying cases with no wellbeing concerns, its ability to accurately classify assessments with wellbeing concerns remains limited. Moreover, although the model’s performance was generally consistent when prompted with varying demographic factors such as age and gender, a significantly higher false positive rate was observed for girls, indicating potential sensitivity to gender attribute. These findings highlight both the promise and the challenges of integrating VLMs into robot-led assessments of children's wellbeing.

Description

Keywords

Journal Title

RO-MAN

Conference Name

34th IEEE International Conference on Robot and Human Interactive Communication (IEEE RO-MAN 2025)

Journal ISSN

Volume Title

Publisher

IEEE

Publisher DOI

Rights and licensing

Except where otherwised noted, this item's license is described as Attribution 4.0 International
Sponsorship
Engineering and Physical Sciences Research Council (EP/R030782/1)
N. I. Abbasi is supported by the W.D. Armstrong Trust PhD Studentship and the Cambridge Trusts. F. I. Dogan, G. Laban, and H. Gunes have been supported by the EPSRC project ARoEQ under grant ref. EP/R030782/1. All research at the Department of Psychiatry (University of Cambridge) is supported by the NIHR Cambridge Biomedical Research Centre (BRC-1215-20014, particularly T. Ford) and NIHR Applied Research Collaboration East of England (P. Jones, J. Anderson).