Repository logo
 

Real-time factored ConvNets: Extracting the x factor in human parsing

Accepted version
Peer-reviewed

Type

Conference Object

Change log

Authors

Charles, J 
Budvytis, I 

Abstract

© 2017. The copyright of this document resides with its authors. We propose a real-time and lightweight multi-task style ConvNet (termed a Factored ConvNet) for human body parsing in images or video. Factored ConvNets have isolated areas which perform known sub-tasks, such as object localization or edge detection. We call this area and sub-task pair an X factor. Unlike multi-task ConvNets which have independent tasks, the Factored ConvNet’s sub-task has direct effect on the main task outcome. In this paper we show how to isolate the X factor of foreground/background (f/b) subtraction from the main task of segmenting human body images into 31 different body part types. Knowledge of this X factor leads to a number of benefits for the Factored ConvNet: 1) Ease of network transfer to other image domains, 2) ability to personalize to humans in video and 3) easy model performance boosts. All achieved by either efficient network update or replacement of the X factor whilst avoiding catastrophic forgetting of previously learnt body part dependencies and structure. We show these benefits on a large dataset of images and also on YouTube videos.

Description

Keywords

Journal Title

British Machine Vision Conference 2017, BMVC 2017

Conference Name

British Machine Vision Conference 2017, BMVC 2017

Journal ISSN

Volume Title

Publisher

British Machine Vision Association
Sponsorship
SeeQuestor