CAM3D

Name: CAM3D
Published: 2019-04-09T12:05:54Z
Keywords: Hand-over-face, Facial expressions, Hand gestures, Multi-modal dataset, Natural expressions, Video segments

Anis, Marwa; Baltrusaitis, Tadas; Robinson, Peter

doi:10.17863/CAM.38196

CAM3D

Repository URI

https://www.repository.cam.ac.uk/handle/1810/291335

Repository DOI

https://doi.org/10.17863/CAM.38196

Files

Agreeing.zip (360.19 MB)

Bored.zip (507.25 MB)

Disagreeing.zip (133.29 MB)

Disgusted.zip (81.92 MB)

Excited.zip (79.45 MB)

Type

Dataset

Authors

Anis, Marwa

Baltrusaitis, Tadas

Robinson, Peter

https://orcid.org/0000-0003-0347-3789

Description

Cam3D consists of 108 labelled videos of 12 mental states including spontaneous facial expressions and hand gestures. It was labelled using crowd-sourcing (inter-rater reliability Κ=0.45).

We used three different sensors for data collection: Microsoft Kinect sensors, HD cameras, and microphones in the HD cameras. After the initial data collection, the videos were segmented. Each segment showed a single event such as a change in facial expression, head and body posture movement or hand gesture. From videos with public consent, a total of 451 segments were collected. The mean duration is 6 seconds.

Labelling was based on context-free observer judgment. Public segments were labelled by community crowd-sourcing. Out of the 451 segmented videos we wanted to extract the ones that can reliably be described as belonging to one of the 24 emotion groups from the Baron-Cohen taxonomy. From the 2916 labels collected, 122 did not appear in the taxonomy so were not considered in the analysis. The remaining 2794 labels were grouped as belonging to one of the 24 groups plus agreement, disagreement, and neutral. To fi lter out non-emotional segments we chose only the videos that 60% or more of the raters agreed on. This resulted in 108 segments in total. The most common label given to a video segment was considered as the ground truth.

The data is categorized by the ground-truth label and divided into seven folders. For each video segment, we provide the colour video, camera parameters, colour images and their corresponding aligned depth images.

Software / Usage instructions

Video files that can be opened with any video player with depth and colour information as image files. Happy, Unsure and Thinking folders are split due to file size limit.