Dataset Bias in Deception Detection
View / Open Files
Journal Title
Proceedings of the 26th International Conference on Pattern Recognition
Conference Name
The 26th International Conference on Pattern Recognition
Type
Conference Object
This Version
AM
Metadata
Show full item recordCitation
Mambreyan, A., Punskaya, E., & Gunes, H. Dataset Bias in Deception Detection. Proceedings of the 26th International Conference on Pattern Recognition https://doi.org/10.17863/CAM.85707
Abstract
With the advances in Machine Learning, lie detection technology gained significant attention. In recent years, several multi-modal techniques achieved as high as 99% accuracy results using the Real-life Trial dataset with only 121 data points. This led to considerable media hype and research interest in lie detection with machine learning. In this paper, we analyze the effect of dataset bias in deception detection. More specifically, we train a classifier to predict the sex of the identity appearing in the video. On a test data point, we use the sex predictor to predict sex which we use as a proxy for predicting deception, predicting
lie for females and truth for males. This lie predictor simulates a classifier that uses nothing but dataset bias. Nevertheless, we find that the performance of this biased classifier is comparable to those of state-of-the-art papers. More specifically, when using IDT features, our biased classifier achieves 64.6% and 59.3% AUC while a classifier trained normally on truth/lie labels achieves 57.4% accuracy and 69.3% AUC. We perform similar experiments on the Bag-of-Lies dataset and show that it too is biased with respect to sex. In addition, we apply the state-of-
the-art techniques on an unbiased dataset and show that their performance is no better than chance. Our experiments strongly
suggest that the results of recent deception detection techniques
can be explained by the bias inherent in the datasets.
Sponsorship
Engineering and Physical Sciences Research Council (EP/R030782/1)
European Commission Horizon 2020 (H2020) Societal Challenges (826232)
Identifiers
External DOI: https://doi.org/10.17863/CAM.85707
This record's URL: https://www.repository.cam.ac.uk/handle/1810/338298
Statistics
Total file downloads (since January 2020). For more information on metrics see the
IRUS guide.
Recommended or similar items
The current recommendation prototype on the Apollo Repository will be turned off on 03 February 2023. Although the pilot has been fruitful for both parties, the service provider IKVA is focusing on horizon scanning products and so the recommender service can no longer be supported. We recognise the importance of recommender services in supporting research discovery and are evaluating offerings from other service providers. If you would like to offer feedback on this decision please contact us on: support@repository.cam.ac.uk