Presentation | 2019-06-18 Theoretical Analysis of the Fixup Initialization for Fast Convergence and High Generalization Ability Yasutaka Furusho, Kazushi Ikeda, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | The Fixup initialization is a new initialization method of ResNet for a fast convergence with a high learning rate of SGD and a high generalization ability. However, the reasons for its high performance are not clear. Both the learning rate and the generalization ability are affected by the loss landscape at initialization, that is, the maximum eigenvalue of the Hessian matrix. Thus, we calculated the maximum eigenvalue of the ResNet and found that the maximum eigenvalue with the Fixup initialization has at most the square root order with respect to the depth of the ResNet while that with the He initialization has the exponential order. This small eigenvalue leads to a fast convergence with a high learning rate of SGD and a high generalization ability. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Deep neural network / ResNet / Initialization / Fixup |
Paper # | NC2019-19,IBISML2019-17 |
Date of Issue | 2019-06-10 (NC, IBISML) |
Conference Information | |
Committee | NC / IBISML / IPSJ-MPS / IPSJ-BIO |
---|---|
Conference Date | 2019/6/17(3days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | Okinawa Institute of Science and Technology |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | Neurocomputing, Machine Learning Approach to Biodata Mining, and General |
Chair | Hayaru Shouno(UEC) / Hisashi Kashima(Kyoto Univ.) / Masakazu Sekijima(Tokyo Tech) / Hiroyuki Kurata(Kyutech) |
Vice Chair | Kazuyuki Samejima(Tamagawa Univ) / Masashi Sugiyama(Univ. of Tokyo) / Koji Tsuda(Univ. of Tokyo) |
Secretary | Kazuyuki Samejima(NAIST) / Masashi Sugiyama(NTT) / Koji Tsuda(Nagoya Inst. of Tech.) / (AIST) / (Nagoya Univ.) |
Assistant | Takashi Shinozaki(NICT) / Ken Takiyama(TUAT) / Tomoharu Iwata(NTT) / Shigeyuki Oba(Kyoto Univ.) |
Paper Information | |
Registration To | Technical Committee on Neurocomputing / Technical Committee on Infomation-Based Induction Sciences and Machine Learning / IPSJ Special Interest Group on Mathematical Modeling and Problem Solving / IPSJ Special Interest Group on Bioinformatics and Genomics |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Theoretical Analysis of the Fixup Initialization for Fast Convergence and High Generalization Ability |
Sub Title (in English) | |
Keyword(1) | Deep neural network |
Keyword(2) | ResNet |
Keyword(3) | Initialization |
Keyword(4) | Fixup |
1st Author's Name | Yasutaka Furusho |
1st Author's Affiliation | Nara Institute of Science and Technology(NAIST) |
2nd Author's Name | Kazushi Ikeda |
2nd Author's Affiliation | Nara Institute of Science and Technology(NAIST) |
Date | 2019-06-18 |
Paper # | NC2019-19,IBISML2019-17 |
Volume (vol) | vol.119 |
Number (no) | NC-88,IBISML-89 |
Page | pp.pp.87-92(NC), pp.109-114(IBISML), |
#Pages | 6 |
Date of Issue | 2019-06-10 (NC, IBISML) |