Neural Comp. Sign up for ETOCS
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Watanabe, S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Watanabe, S.
(Neural Computation. 2001;13:899-933.)
© 2001 The MIT Press


Letter

Algebraic Analysis for Nonidentifiable Learning Machines

Sumio Watanabe

P&I Laboratory, Tokyo Institute of Technology, Yokohama, 226-8503 Japan

This article clarifies the relation between the learning curve and the algebraic geometrical structure of a nonidentifiable learning machine such as a multilayer neural network whose true parameter set is an analytic set with singular points. By using a concept in algebraic analysis, we rigorously prove that the Bayesian stochastic complexity or the free energy is asymptotically equal to {lambda}1logn-(m1-1)loglogn+ constant, where n is the number of training samples and {lambda}1 and m1 are the rational number and the natural number, which are determined as the birational invariant values of the singularities in the parameter space. Also we show an algorithm to calculate {lambda}1 and m1 based on the resolution of singularities in algebraic geometry. In regular statistical models, 2{lambda}1 is equal to the number of parameters and m1=1, whereas in nonregular models, such as multilayer networks, 2{lambda}1 is not larger than the number of parameters and m1>=1. Since the increase of the stochastic complexity is equal to the learning curve or the generalization error, the nonidentifiable learning machines are better models than the regular ones if Bayesian ensemble learning is applied.




This article has been cited by other articles:


Home page
Neural Comput.Home page
H. Wei, J. Zhang, F. Cousseau, T. Ozeki, and S.-i. Amari
Dynamics of learning near singularities in layered networks.
Neural Comput., March 1, 2008; 20(3): 813 - 843.
[Abstract] [Full Text] [PDF]


Home page
Neural Comput.Home page
S. Nakajima and S. Watanabe
Variational bayes solution of linear neural networks and its generalization performance.
Neural Comput., April 1, 2007; 19(4): 1112 - 1153.
[Abstract] [Full Text] [PDF]


Home page
Neural Comput.Home page
S.-i. Amari, H. Park, and T. Ozeki
Singularities affect dynamics of learning in neuromanifolds.
Neural Comput., May 1, 2006; 18(5): 1007 - 1065.
[Abstract] [Full Text] [PDF]


Home page
IEICE Trans Inf & SystHome page
S. NAKAJIMA and S. WATANABE
Generalization Performance of Subspace Bayes Approach in Linear Neural Networks
IEICE Trans D: Information, March 1, 2006; E89-D(3): 1128 - 1138.
[Abstract] [PDF]


Home page
Neural Comput.Home page
S.-i. Amari and H. Nakahara
Difficulty of Singularity in Population Coding
Neural Comput., April 1, 2005; 17(4): 839 - 858.
[Abstract] [Full Text] [PDF]


Home page
Neural Comput.Home page
T. Hayasaka, M. Kitahara, and S. Usui
On the Asymptotic Distribution of the Least-Squares Estimators in Unidentifiable Models
Neural Comput., January 1, 2004; 16(1): 99 - 114.
[Abstract] [Full Text] [PDF]


Home page
Neural Comput.Home page
K. Tsuda, S. Akaho, M. Kawanabe, and K.-R. Muller
Asymptotic Properties of the Fisher Kernel
Neural Comput., January 1, 2004; 16(1): 115 - 137.
[Abstract] [Full Text] [PDF]


Home page
Neural Comput.Home page
S. Watanabe and S.-i. Amari
Learning Coefficients of Layered Models When the True Distribution Mismatches the Singularities
Neural Comput., May 1, 2003; 15(5): 1013 - 1033.
[Abstract] [Full Text] [PDF]


Home page
Neural Comput.Home page
K. Hagiwara
On the Problem in Model Selection of Neural Network Regression in Overrealizable Scenario
Neural Comput., August 1, 2002; 14(8): 1979 - 2002.
[Abstract] [Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
J COGNITIVE NEUROSCIENCE NEURAL COMPUTATION MIT PRESS JOURNALS
Copyright © 2001 by The MIT Press.