TY - GEN
T1 - Self-compression in bayesian neural networks
AU - Carannante, Giuseppina
AU - Dera, Dimah
AU - Rasool, Ghulam
AU - Bouaynaya, Nidhal C.
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/9
Y1 - 2020/9
N2 - Machine learning models have achieved human-level performance on various tasks. This success comes at a high cost of computation and storage overhead, which makes machine learning algorithms difficult to deploy on edge devices. Typically, one has to partially sacrifice accuracy in favor of an increased performance quantified in terms of reduced memory usage and energy consumption. Current methods compress the networks by reducing the precision of the parameters or by eliminating redundant ones. In this paper, we propose a new insight into network compression through the Bayesian framework. We show that Bayesian neural networks automatically discover redundancy in model parameters, thus enabling self-compression, which is linked to the propagation of uncertainty through the layers of the network. Our experimental results show that the network architecture can be successfully compressed by deleting parameters identified by the network itself while retaining the same level of accuracy.
AB - Machine learning models have achieved human-level performance on various tasks. This success comes at a high cost of computation and storage overhead, which makes machine learning algorithms difficult to deploy on edge devices. Typically, one has to partially sacrifice accuracy in favor of an increased performance quantified in terms of reduced memory usage and energy consumption. Current methods compress the networks by reducing the precision of the parameters or by eliminating redundant ones. In this paper, we propose a new insight into network compression through the Bayesian framework. We show that Bayesian neural networks automatically discover redundancy in model parameters, thus enabling self-compression, which is linked to the propagation of uncertainty through the layers of the network. Our experimental results show that the network architecture can be successfully compressed by deleting parameters identified by the network itself while retaining the same level of accuracy.
UR - http://www.scopus.com/inward/record.url?scp=85096461924&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85096461924&partnerID=8YFLogxK
U2 - 10.1109/MLSP49062.2020.9231550
DO - 10.1109/MLSP49062.2020.9231550
M3 - Conference contribution
AN - SCOPUS:85096461924
T3 - IEEE International Workshop on Machine Learning for Signal Processing, MLSP
BT - Proceedings of the 2020 IEEE 30th International Workshop on Machine Learning for Signal Processing, MLSP 2020
PB - IEEE Computer Society
T2 - 30th IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2020
Y2 - 21 September 2020 through 24 September 2020
ER -