Repository logo
 

How Much Speech Data Is Needed for Tracking Language Change in Alzheimer's Disease? A Comparison of Random Length, 5-Min, and 1-Min Spontaneous Speech Samples.

Accepted version
Peer-reviewed

No Thumbnail Available

Type

Article

Change log

Authors

Petti, Ulla 
Baker, Simon 
Korhonen, Anna 
Robin, Jessica 

Abstract

INTRODUCTION: Changes in speech can act as biomarkers of cognitive decline in Alzheimer's disease (AD). While shorter speech samples would promote data collection and analysis, the minimum length of informative speech samples remains debated. This study aims to provide insight into the effect of sample length in analyzing longitudinal recordings of spontaneous speech in AD by comparing the original random length, 5- and 1-minute-long samples. We hope to understand whether capping the audio improves the accuracy of the analysis, and whether an extra 4 min conveys necessary information. METHODS: 110 spontaneous speech samples were collected from decades of Youtube videos of 17 public figures, 9 of whom eventually developed AD. 456 language features were extracted and their text-length-sensitivity, comparability, and ability to capture change over time were analyzed across three different sample lengths. RESULTS: Capped audio files had advantages over the random length ones. While most extracted features were statistically comparable or highly correlated across the datasets, potential effects of sample length should be acknowledged for some features. The 5-min dataset presented the highest reliability in tracking the evolution of the disease, suggesting that the 4 extra minutes do convey informative data. CONCLUSION: Sample length seems to play an important role in extracting the language feature values from speech and tracking disease progress over time. We highlight the importance of further research into optimal sample length and standardization of methods when studying speech in AD.

Description

Keywords

Alzheimer’s disease, Dementia, Digital biomarkers, Digital health, Language, Natural language processing, Speech

Journal Title

Digit Biomark

Conference Name

Journal ISSN

2504-110X
2504-110X

Volume Title

Publisher

S. Karger AG
Sponsorship
Economic and Social Research Council (2275526)
ESRC (ES/P000738/1)
Economic and Social Research Council (ESRC) Cambridge Doctoral Training Partnership (DTP) grant number ES/P000738/1.

Version History

Now showing 1 - 2 of 2
VersionDateSummary
2024-01-11 15:48:07
Published version added
1*
2023-08-08 23:32:02
* Selected version