Applied High-Order Singular Value Decomposition for Weight Compression and Expansion in Deep Neural Networks

Graham, Austin

dc.contributor.advisor	Grant, Christan
dc.contributor.author	Graham, Austin
dc.date.accessioned	2019-08-02T16:14:53Z
dc.date.available	2019-08-02T16:14:53Z
dc.date.issued	2019-08
dc.identifier.uri	https://hdl.handle.net/11244/321112
dc.description.abstract	Complex deep learning objectives such as object detection and saliency, semantic segmentation, sequence-to-sequence translation, and others have given rise to training processes requiring increasing amounts of time and computational resources. Human-in-the-loop solutions have addressed this problem in several ways; one such pain point is model hyperparameter search. Common methods of parameter search have high time costs and require iterative training of several models. Several algorithms have been proposed to manipulate a neural network's architecture and alleviate this cost. However, these algorithms require tuning of parameters to achieve desired performance and provide little to no intuition as to how such a change may affect overall performance. In this thesis, I present EigenRemove and WeakExpansion for removal and addition of weights providing a human-in-the-loop solution to the architecture search problem in both classical feedforward and convolutional neural network layers. EigenRemove yields results comparable to or better than the more popular Minimum Weight Selection pruning strategy, producing final test accuracies increased by 2-3% at larger compressions on the VGG16 object detection network. WeakExpand is compared with a trivial Zero Weight Expansion approach, where new connections are assigned no weight. WeakExpand is shown to produce final test accuracies in VGG16 comparable to that of Zero Weight Expansion, while providing new trainable weights rather than the dead weights produced by Zero Weight Expansion. Finally, I propose heuristics outlining how a user may use WeakExpand and EigenRemove to have a desired effect based on the current state of their network's training.	en_US
dc.language	en	en_US
dc.rights	Attribution 4.0 International	*
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/	*
dc.subject	machine learning	en_US
dc.subject	neural networks	en_US
dc.subject	human in the loop	en_US
dc.subject	singular value decomposition	en_US
dc.title	Applied High-Order Singular Value Decomposition for Weight Compression and Expansion in Deep Neural Networks	en_US
dc.contributor.committeeMember	McGovern, Amy
dc.contributor.committeeMember	Hougen, Dean
dc.date.manuscript	2019-07
dc.thesis.degree	Master of Science	en_US
ou.group	Gallogly College of Engineering::School of Computer Science	en_US