Training generalizable quantized deep neural nets

Panos M. Pardalos; Charles Hernandez; Bijan Taslimi; Hung Yi; Hongcheng Liu

doi:10.1016/j.eswa.2022.118736

Go Back Research Article April, 2023

Expert Systems with Applications

Training generalizable quantized deep neural nets

Panos M. Pardalos

Charles Hernandez

Bijan Taslimi

Hung Yi

Hongcheng Liu

Abstract

While a number of practical methods for training quantized DL models have been presented in the literature, there exists a critical gap in the theoretical generalizability results for such approaches. Although empirical evidence often suggests a high tolerance of DL architectures to variations of training procedures, existing theoretical generalization analyses are often contingent on the specific designs of training algorithms, e.g., in stochastic gradient descent (SGD). This specialization makes such generalizability results inapplicable to the case of quantized DL models. In view of this critical vacuum, this paper provides several almost-algorithm-independent results to ensure the generalizability of a quantized neural network at different levels of optimality. These results include the characterizations of a computable, quantized local solution that ensures the generalization performance and an algorithm that is provably convergent to such a local solution.

Document Preview

Download PDF

Details

Volume 213

Issue Part B

Pages 118736

DOI 10.1016/j.eswa.2022.118736

ISSN 1873-6793

Impact Metrics

Training generalizable quantized deep neural nets

Abstract

Cite this publication

QUICKLINKS

CONTACT US

Email Not Verified

Confirm Account Verification

Incomplete Profile

Training generalizable quantized deep neural nets

Abstract

Cite this publication