Large Scale Labelled Video Data Augmentation for Semantic Segmentation in Driving Scenarios
Proceedings - 2017 IEEE International Conference on Computer Vision Workshops, ICCVW 2017
MetadataShow full item record
Budvytis, I., Sauer, P., Roddick, T., Breen, K., & Cipolla, R. (2017). Large Scale Labelled Video Data Augmentation for Semantic Segmentation in Driving Scenarios. Proceedings - 2017 IEEE International Conference on Computer Vision Workshops, ICCVW 2017, 2018-January 230-237. https://doi.org/10.1109/ICCVW.2017.36
In this paper we present an analysis of the effect of large scale video data augmentation for semantic segmentation in driving scenarios. Our work is motivated by a strong correlation between the high performance of most recent deep learning based methods and the availability of large volumes of ground truth labels. To generate additional labelled data, we make use of an occlusion-aware and uncertainty-enabled label propagation algorithm. As a result we increase the availability of high-resolution labelled frames by a factor of 20, yielding in a 6.8% to 10.8% rise in average classification accuracy and/or IoU scores for several semantic segmentation networks. Our key contributions include: (a) augmented CityScapes and CamVid datasets providing 56.2K and 6.5K additional labelled frames of object classes respectively, (b) detailed empirical analysis of the effect of the use of augmented data as well as (c) extension of proposed framework to instance segmentation.
External DOI: https://doi.org/10.1109/ICCVW.2017.36
This record's URL: https://www.repository.cam.ac.uk/handle/1810/274298