正確なハードウェアエミュレーションのための量子化DNNライブラリの開発

木山 真人; 中原 康宏; 尼崎 太樹; 飯田 全広

Presentation	2021-03-03 The Design and Development of of Quantized Neural Networks Library for Exact Hardware Emulation Masato Kiyama, Yasuhiro Nakahara, Motoki Amagasaki, Masahiro Iida,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	Quantization is used to speed up execution time and save power when runnning Deep neural networks (DNNs) on edge devices or AI chips. To investigate the effect of quantization, we need performing inference after quantizing the weights of DNN with 32-bit floating-point numbers by a some bit width, and then quantizing them back to 32-bit floating-point numbers. This is because the DNN library can only handle floating-point numbers. However, the accuracy of the emulation does not provide accurate precision. We need accurate precision to detect overflow in MAC operations or to verify the operation on AI chips. We have developed PyParch, a DNN library that executes quantized DNNs (QNNs) with exactly the same behavior as hardware. In this paper, we describe a new proposal and implementation of PyParch. As a result of the evaluation, the accuracy of QNNs with arbitrary bit widths can be estimated for large and complex DNNs such as YOLOv5, and the overflow can be detected. We evaluated the overhead of the emulation time and found that it was 5.6 times slower for QNN and 42 times slower for QNN with overflow detection compared to the normal DNN execution time.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Deep Learning / Quantization
Paper #	VLD2020-70,HWS2020-45
Date of Issue	2021-02-24 (VLD, HWS)

Conference Information
Committee	HWS / VLD
Conference Date	2021/3/3(2days)
Place (in Japanese)	(See Japanese page)
Place (in English)	Online
Topics (in Japanese)	(See Japanese page)
Topics (in English)	Design Technology for System-on-Silicon, Hardware Security, etc.
Chair	Makoto Ikeda(Univ. of Tokyo) / Daisuke Fukuda(Fujitsu Labs.)
Vice Chair	Yasuhisa Shimazaki(Renesas Electronics) / Makoto Nagata(Kobe Univ.) / Kazutoshi Kobayashi(Kyoto Inst. of Tech.)
Secretary	Yasuhisa Shimazaki(Kyushu Univ.) / Makoto Nagata(NTT) / Kazutoshi Kobayashi(Hitachi)
Assistant	/ Takuma Nishimoto(Hitachi)

Paper Information
Registration To	Technical Committee on Hardware Security / Technical Committee on VLSI Design Technologies
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	The Design and Development of of Quantized Neural Networks Library for Exact Hardware Emulation
Sub Title (in English)
Keyword(1)	Deep Learning
Keyword(2)	Quantization
1st Author's Name	Masato Kiyama
1st Author's Affiliation	Faculty of Advanced Science and Technology, Kumamoto University(Kumamoto Univ.)
2nd Author's Name	Yasuhiro Nakahara
2nd Author's Affiliation	Graduate School of Science and Technology, Kumamoto University(Kumamoto Univ.)
3rd Author's Name	Motoki Amagasaki
3rd Author's Affiliation	Faculty of Advanced Science and Technology, Kumamoto University(Kumamoto Univ.)
4th Author's Name	Masahiro Iida
4th Author's Affiliation	Faculty of Advanced Science and Technology, Kumamoto University(Kumamoto Univ.)
Date	2021-03-03
Paper #	VLD2020-70,HWS2020-45
Volume (vol)	vol.120
Number (no)	VLD-400,HWS-401
Page	pp.pp.18-23(VLD), pp.18-23(HWS),
#Pages	6
Date of Issue	2021-02-24 (VLD, HWS)