Psychometric scaling of TID2013 dataset
2018 10th International Conference on Quality of Multimedia Experience, QoMEX 2018
MetadataShow full item record
Mikhailiuk, A., Perez-Ortiz, M., & Mantiuk, R. (2018). Psychometric scaling of TID2013 dataset. 2018 10th International Conference on Quality of Multimedia Experience, QoMEX 2018 https://doi.org/10.1109/QoMEX.2018.8463376
TID2013 is a subjective image quality assessment dataset with a wide range of distortion types and over 3000 images. The dataset has proven to be a challenging test for objective quality metrics. The dataset mean opinion scores were obtained by collecting pairwise comparison judgments using the Swiss tournament system, and averaging votes of observers. However, this approach differs from the usual analysis of multiple pairwise comparisons, which involves psychometric scaling of the comparison data using either Thurstone or Bradley-Terry mod- els. In this paper we investigate how quality scores change when they are computed using such psychometric scaling instead of averaging vote counts. In order to properly scale TID2013 quality scores, we conduct four additional experiments of two different types, which we found necessary to produce a common quality scale: comparisons with reference images, and cross-content comparisons. We demonstrate on a fifth validation experiment that the two additional types of comparisons are necessary and in conjunction with psychometric scaling improve the consistency of quality scores, especially across images depicting different contents.
European Commission Horizon 2020 (H2020) ERC (725253)
External DOI: https://doi.org/10.1109/QoMEX.2018.8463376
This record's URL: https://www.repository.cam.ac.uk/handle/1810/279248