Structural Priors in Deep Neural Networks

Ioannou, Yani Andrew

Structural Priors in Deep Neural Networks

cam.restriction	thesis_access_open
cam.supervisor	Cipolla, Roberto
cam.supervisor	Criminisi, Antonio
cam.supervisor.orcid	Cipolla, Roberto [0000-0002-8999-2151]
cam.supervisor.orcid	Criminisi, Antonio [0000-0001-7976-3374]
cam.thesis.funding	false
dc.contributor.author	Ioannou, Yani Andrew
dc.contributor.orcid	Ioannou, Yani Andrew [0000-0002-9797-5888]
dc.date.accessioned	2018-08-22T10:19:41Z
dc.date.available	2018-08-22T10:19:41Z
dc.date.issued	2018-10-20
dc.date.submitted	2017-09-28
dc.date.updated	2018-08-21T00:46:57Z
dc.description.abstract	Deep learning has in recent years come to dominate the previously separate fields of research in machine learning, computer vision, natural language understanding and speech recognition. Despite breakthroughs in training deep networks, there remains a lack of understanding of both the optimization and structure of deep networks. The approach advocated by many researchers in the field has been to train monolithic networks with excess complexity, and strong regularization --- an approach that leaves much to desire in efficiency. Instead we propose that carefully designing networks in consideration of our prior knowledge of the task and learned representation can improve the memory and compute efficiency of state-of-the art networks, and even improve generalization --- what we propose to denote as structural priors. We present two such novel structural priors for convolutional neural networks, and evaluate them in state-of-the-art image classification CNN architectures. The first of these methods proposes to exploit our knowledge of the low-rank nature of most filters learned for natural images by structuring a deep network to learn a collection of mostly small, low-rank, filters. The second addresses the filter/channel extents of convolutional filters, by learning filters with limited channel extents. The size of these channel-wise basis filters increases with the depth of the model, giving a novel sparse connection structure that resembles a tree root. Both methods are found to improve the generalization of these architectures while also decreasing the size and increasing the efficiency of their training and test-time computation. Finally, we present work towards conditional computation in deep neural networks, moving towards a method of automatically learning structural priors in deep networks. We propose a new discriminative learning model, conditional networks, that jointly exploit the accurate representation learning capabilities of deep neural networks with the efficient conditional computation of decision trees. Conditional networks yield smaller models, and offer test-time flexibility in the trade-off of computation vs. accuracy.
dc.description.sponsorship	Funded by a Microsoft Research PhD Scholarship
dc.identifier.doi	10.17863/CAM.26357
dc.identifier.uri	https://www.repository.cam.ac.uk/handle/1810/278976
dc.language.iso	en
dc.publisher.college	Jesus College
dc.publisher.department	Engineering
dc.publisher.institution	University of Cambridge
dc.rights	All rights reserved
dc.rights	All Rights Reserved	en
dc.rights.uri	https://www.rioxx.net/licenses/all-rights-reserved/	en
dc.subject	Deep Learning
dc.subject	Neural Networks
dc.subject	Machine Learning
dc.subject	Computer Vision
dc.subject	Structural Priors
dc.subject	Filter Groups
dc.subject	Low Rank
dc.subject	Convolution
dc.subject	Deep Neural Networks
dc.subject	Convolutional Neural Networks
dc.subject	CNN
dc.subject	Efficient
dc.title	Structural Priors in Deep Neural Networks
dc.type	Thesis
dc.type.qualificationlevel	Doctoral
dc.type.qualificationname	Doctor of Philosophy (PhD)
dc.type.qualificationtitle	Doctor of Philosophy in Information Engineering

Files

Original bundle

Now showing 1 - 2 of 2

Name:: thesisv2.pdf
Size:: 15.29 MB
Format:: Adobe Portable Document Format
Description:: Thesis - minor changes to correct formatting
Licence: https://www.rioxx.net/licenses/all-rights-reserved/

Download

Name:: thesis.pdf
Size:: 15.3 MB
Format:: Adobe Portable Document Format
Description:: Thesis - original
Licence: https://www.rioxx.net/licenses/all-rights-reserved/

Download

Collections

Theses - Engineering