Presentation 2021-03-03
The Design and Development of of Quantized Neural Networks Library for Exact Hardware Emulation
Masato Kiyama, Yasuhiro Nakahara, Motoki Amagasaki, Masahiro Iida,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Quantization is used to speed up execution time and save power when runnning Deep neural networks (DNNs) on edge devices or AI chips. To investigate the effect of quantization, we need performing inference after quantizing the weights of DNN with 32-bit floating-point numbers by a some bit width, and then quantizing them back to 32-bit floating-point numbers. This is because the DNN library can only handle floating-point numbers. However, the accuracy of the emulation does not provide accurate precision. We need accurate precision to detect overflow in MAC operations or to verify the operation on AI chips. We have developed PyParch, a DNN library that executes quantized DNNs (QNNs) with exactly the same behavior as hardware. In this paper, we describe a new proposal and implementation of PyParch. As a result of the evaluation, the accuracy of QNNs with arbitrary bit widths can be estimated for large and complex DNNs such as YOLOv5, and the overflow can be detected. We evaluated the overhead of the emulation time and found that it was 5.6 times slower for QNN and 42 times slower for QNN with overflow detection compared to the normal DNN execution time.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Deep Learning / Quantization
Paper # VLD2020-70,HWS2020-45
Date of Issue 2021-02-24 (VLD, HWS)

Conference Information
Committee HWS / VLD
Conference Date 2021/3/3(2days)
Place (in Japanese) (See Japanese page)
Place (in English) Online
Topics (in Japanese) (See Japanese page)
Topics (in English) Design Technology for System-on-Silicon, Hardware Security, etc.
Chair Makoto Ikeda(Univ. of Tokyo) / Daisuke Fukuda(Fujitsu Labs.)
Vice Chair Yasuhisa Shimazaki(Renesas Electronics) / Makoto Nagata(Kobe Univ.) / Kazutoshi Kobayashi(Kyoto Inst. of Tech.)
Secretary Yasuhisa Shimazaki(Kyushu Univ.) / Makoto Nagata(NTT) / Kazutoshi Kobayashi(Hitachi)
Assistant / Takuma Nishimoto(Hitachi)

Paper Information
Registration To Technical Committee on Hardware Security / Technical Committee on VLSI Design Technologies
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) The Design and Development of of Quantized Neural Networks Library for Exact Hardware Emulation
Sub Title (in English)
Keyword(1) Deep Learning
Keyword(2) Quantization
1st Author's Name Masato Kiyama
1st Author's Affiliation Faculty of Advanced Science and Technology, Kumamoto University(Kumamoto Univ.)
2nd Author's Name Yasuhiro Nakahara
2nd Author's Affiliation Graduate School of Science and Technology, Kumamoto University(Kumamoto Univ.)
3rd Author's Name Motoki Amagasaki
3rd Author's Affiliation Faculty of Advanced Science and Technology, Kumamoto University(Kumamoto Univ.)
4th Author's Name Masahiro Iida
4th Author's Affiliation Faculty of Advanced Science and Technology, Kumamoto University(Kumamoto Univ.)
Date 2021-03-03
Paper # VLD2020-70,HWS2020-45
Volume (vol) vol.120
Number (no) VLD-400,HWS-401
Page pp.pp.18-23(VLD), pp.18-23(HWS),
#Pages 6
Date of Issue 2021-02-24 (VLD, HWS)