Method and apparatus for pruning neural networks

Enzo Tartaglione, Marco Grangetto, Francesco Odierna, Andrea Bragagnolo, Attilio Fiandrotti

January 2022

Abstract

The present invention relates to a method for pruning a neural network comprising a plurality of neurons, said method comprising: an initialization phase, wherein input information is fetched comprising at least parameters $({w_{ni},b_{ni}})$ related to said neural network and a dataset (D) representative of a task that said neural network has to deal with, wherein said parameters $({w_{ni},b_{ni}})$ comprising a weights vector (w_{ni}) and/or a bias (b_{ni}) related to at least one neuron of said plurality of neurons; a regularization phase, wherein said neural network is trained according to a training algorithm by using said dataset (D); a thresholding phase, wherein an element $(w_{nij})$ of said weights vector $(w_{ni})$ is put at zero when its absolute value is below a given threshold (T).

Type

Patent

Publication

US Patent App. 17/769,707

Method and apparatus for pruning neural networks

Abstract

Marco Grangetto

Full Professor

Attilio Fiandrotti

Associate Professor