Generalization Error by Langevin Equation in Singular Learning Machines

Taruhi Iwagaki; Sumio Watanabe

Summary

International Symposium on Nonlinear Theory and its Applications

2009

Session Number:B2L-D

Session:

Number:B2L-D2

Generalization Error by Langevin Equation in Singular Learning Machines

Taruhi Iwagaki, Sumio Watanabe,

pp.-

Publication Date:2009/10/18

Online ISSN:2188-5079

DOI:10.34385/proc.43.B2L-D2

PDF download (161.6KB)

Summary:

The Langevin equation implies an algorithm that can generate samples from the stationary distribution of a biased random walk, which is equivalent to the posterior distribution of Bayesian learning. The Langevin algorithm uses gradient information of the target distribution; therefore it is expected to be more efficient than the Metropolis method especially in wide parameter space of singular learning machines e.g. neural networks. In this paper, we will discuss experimental results of generalization errors of both the Langevin algorithm and the Metropolis method for neural networks with practical dimension.