Improving Interpretability and Regularization in Deep Learning

Wu, C; Gales, MJF; Ragni, A; Karanasou, P; Sim, KC

Improving Interpretability and Regularization in Deep Learning

Accepted version

Peer-reviewed

Repository URI

https://www.repository.cam.ac.uk/handle/1810/274201

Repository DOI

https://doi.org/10.17863/CAM.21302

Files

Accepted version (2.13 MB)

Type

Article

Authors

Wu, Chunyang

https://orcid.org/0000-0002-0269-3555

Gales, MJF

Ragni, A

Karanasou, P

Sim, KC

Abstract

IEEE Deep learning approaches yield state-of-the-art performance in a range of tasks, including automatic speech recognition. However, the highly distributed representation in a deep neural network (DNN) or other network variations are difficult to analyse, making further parameter interpretation and regularisation challenging. This paper presents a regularisation scheme acting on the activation function output to improve the network interpretability and regularisation. The proposed approach, referred to as activation regularisation, encourages activation function outputs to satisfy a target pattern. By defining appropriate target patterns, different learning concepts can be imposed on the network. This method can aid network interpretability and also has the potential to reduce over-fitting. The scheme is evaluated on several continuous speech recognition tasks: the Wall Street Journal continuous speech recognition task, eight conversational telephone speech tasks from the IARPA Babel program and a U.S. English broadcast news task. On all the tasks, the activation regularisation achieved consistent performance gains over the standard DNN baselines.

Keywords

Activation regularisation, interpretability, visualisation, neural network, deep learning

Journal Title

IEEE/ACM Transactions on Audio Speech and Language Processing

Journal ISSN

2329-9290
2329-9304

Volume Title

26

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Publisher DOI

https://doi.org/10.1109/TASLP.2017.2774919

Rights

http://www.rioxx.net/licenses/all-rights-reserved

Sponsorship

IARPA (4912046943)
Cambridge Assessment (unknown)

Collections

Scholarly Works - Engineering