Neural Comp. NEW Faster Access
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Amari, S.-i.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Amari, S.-i.

Neural Computation, Vol 10, 251-276, Copyright © 1998 by The MIT Press


ARTICLES

Natural Gradient Works Efficiently in Learning

Shun-ichi Amari

When a parameter space has a certain underlying structure, the ordinary gradient of a function does not represent its steepest direction, but the natural gradient does. Information geometry is used for calculating the natural gradients in the parameter space of perceptrons, the space of matrices (for blind source separation), and the space of linear dynamical systems (for blind source deconvolution). The dynamical behavior of natural gradient online learning is analyzed and is proved to be Fisher efficient, implying that it has asymptotically the same performance as the optimal batch estimation of parameters. This suggests that the plateau phenomenon, which appears in the backpropagation learning algorithm of multilayer perceptrons, might disappear or might not be so serious when the natural gradient is used. An adaptive method of updating the learning rate is proposed and analyzed.


This article has been cited by other articles:


Home page
Neural Comput.Home page
Z. He, S. Xie, L. Zhang, and A. Cichocki
A note on lewicki-sejnowski gradient for learning overcomplete representations.
Neural Comput., March 1, 2008; 20(3): 636 - 643.
[Abstract] [Full Text] [PDF]


Home page
Neural Comput.Home page
H. Wei, J. Zhang, F. Cousseau, T. Ozeki, and S.-i. Amari
Dynamics of learning near singularities in layered networks.
Neural Comput., March 1, 2008; 20(3): 813 - 843.
[Abstract] [Full Text] [PDF]


Home page
IEICE Trans FundamentalsHome page
X. ZHANG, N. ZHANG, J. LU, and T. YAHAGI
Independent Component Analysis for Image Recovery Using SOM-Based Noise Detection
IEICE Trans A: Fundamentals, June 1, 2007; E90-A(6): 1125 - 1132.
[Abstract] [PDF]


Home page
IEICE Trans FundamentalsHome page
J. EVEN and K. SUGIMOTO
Blind Identification for Systems Non-Invertible at Infinity
IEICE Trans A: Fundamentals, June 1, 2007; E90-A(6): 1133 - 1143.
[Abstract] [PDF]


Home page
Neural Comput.Home page
S. Fiori
A Study on Neural Learning on Manifold Foliations: The Case of the Lie Group SU(3)
Neural Comput., April 1, 2007; 20(4): 1091 - 1117.
[Abstract] [Full Text] [PDF]


Home page
Neural Comput.Home page
K. Miura, M. Okada, and S.-i. Amari
Estimating Spiking Irregularities Under Changing Environments.
Neural Comput., October 1, 2006; 18(10): 2359 - 2386.
[Abstract] [Full Text] [PDF]


Home page
IEICE Trans FundamentalsHome page
K. IKEDA
Geometric Properties of Quasi-Additive Learning Algorithms
IEICE Trans A: Fundamentals, October 1, 2006; E89-A(10): 2812 - 2817.
[Abstract] [PDF]


Home page
Neural Comput.Home page
J. Basak
Online adaptive decision trees: pattern classification and function approximation.
Neural Comput., September 1, 2006; 18(9): 2062 - 2101.
[Abstract] [Full Text] [PDF]


Home page
IEICE Trans FundamentalsHome page
M. TUFAIL, M. ABE, and M. KAWAMATA
An Extension to the Natural Gradient Algorithm for Robust Independent Component Analysis in the Presence of Outliers
IEICE Trans A: Fundamentals, September 1, 2006; E89-A(9): 2429 - 2432.
[Abstract] [PDF]


Home page
Neural Comput.Home page
S.-i. Amari, H. Park, and T. Ozeki
Singularities affect dynamics of learning in neuromanifolds.
Neural Comput., May 1, 2006; 18(5): 1007 - 1065.
[Abstract] [Full Text] [PDF]


Home page
Neural Comput.Home page
X.-L. Zhu, X.-D. Zhang, and J.-M. Ye
A generalized contrast function and stability analysis for overdetermined blind separation of instantaneous mixtures.
Neural Comput., March 1, 2006; 18(3): 709 - 728.
[Abstract] [Full Text] [PDF]


Home page
IEICE Trans FundamentalsHome page
S. MAKINO, H. SAWADA, R. MUKAI, and S. ARAKI
Blind Source Separation of Convolutive Mixtures of Speech in Frequency Domain
IEICE Trans A: Fundamentals, July 1, 2005; E88-A(7): 1640 - 1655.
[Abstract] [PDF]


Home page
IEICE Trans FundamentalsHome page
A. ANDO, M. IWAKI, K. ONO, and K. KUROZUMI
Separation of Sound Sources Propagated in the Same Direction
IEICE Trans A: Fundamentals, July 1, 2005; E88-A(7): 1665 - 1672.
[Abstract] [PDF]


Home page
Neural Comput.Home page
S.-i. Amari and H. Nakahara
Difficulty of Singularity in Population Coding
Neural Comput., April 1, 2005; 17(4): 839 - 858.
[Abstract] [Full Text] [PDF]


Home page
Neural Comput.Home page
K. Zhang and L.-W. Chan
Extended Gaussianization Method for Blind Separation of Post-Nonlinear Mixtures
Neural Comput., February 1, 2005; 17(2): 425 - 452.
[Abstract] [Full Text] [PDF]


Home page
Neural Comput.Home page
S. Ikeda, T. Tanaka, and S.-i. Amari
Stochastic Reasoning, Free Energy, and Information Geometry
Neural Comput., September 1, 2004; 16(9): 1779 - 1810.
[Abstract] [Full Text] [PDF]


Home page
Neural Comput.Home page
J. Basak
Online Adaptive Decision Trees
Neural Comput., September 1, 2004; 16(9): 1959 - 1981.
[Abstract] [Full Text] [PDF]


Home page
Neural Comput.Home page
J.-M. Ye, X.-L. Zhu, and X.-D. Zhang
Adaptive Blind Separation with an Unknown Number of Sources
Neural Comput., August 1, 2004; 16(8): 1641 - 1660.
[Abstract] [Full Text] [PDF]


Home page
Neural Comput.Home page
H. Park, N. Murata, and S.-i. Amari
Improving Generalization Performance of Natural Gradient Learning Using Optimized Regularization by NIC
Neural Comput., February 1, 2004; 16(2): 355 - 382.
[Abstract] [Full Text] [PDF]


Home page
Cereb CortexHome page
S. Zeki, R.J. Perry, and A. Bartels
The Processing of Kinetic Contours in the Brain
Cereb Cortex, February 1, 2003; 13(2): 189 - 202.
[Abstract] [Full Text] [PDF]


Home page
Neural Comput.Home page
N. Ay
Locality of Global Stochastic Interaction in Directed Acyclic Networks
Neural Comput., December 1, 2002; 14(12): 2959 - 2980.
[Abstract] [Full Text]


Home page
Neural Comput.Home page
K. Hagiwara
On the Problem in Model Selection of Neural Network Regression in Overrealizable Scenario
Neural Comput., August 1, 2002; 14(8): 1979 - 2002.
[Abstract] [Full Text] [PDF]


Home page
Neural Comput.Home page
N. N. Schraudolph
Fast Curvature Matrix-Vector Products for Second-Order Gradient Descent
Neural Comput., July 1, 2002; 14(7): 1723 - 1738.
[Abstract] [Full Text] [PDF]


Home page
Neural Comput.Home page
M. Zlochin and Y. Baram
Manifold Stochastic Dynamics for Bayesian Learning
Neural Comput., November 1, 2001; 13(11): 2549 - 2572.
[Abstract] [Full Text]


Home page
Neural Comput.Home page
S. Fiori
A Theory for Learning by Weight Flow on Stiefel-Grassman Manifold
Neural Comput., July 1, 2001; 13(7): 1625 - 1647.
[Abstract] [Full Text]


Home page
Neural Comput.Home page
M.-a. Sato
Online Model Selection Based on the Variational Bayes
Neural Comput., July 1, 2001; 13(7): 1649 - 1681.
[Abstract] [Full Text]


Home page
Neural Comput.Home page
S.-i. Amari
Estimating Functions of Independent Component Analysis for Temporally Correlated Signals
Neural Comput., September 1, 2000; 12(9): 2083 - 2107.
[Abstract] [Full Text]


Home page
Neural Comput.Home page
S.-i. Amari, H. Park, and K. Fukumizu
Adaptive Method of Realizing Natural Gradient Learning for Multilayer Perceptrons
Neural Comput., June 1, 2000; 12(6): 1399 - 1409.
[Abstract] [Full Text]


Home page
Neural Comput.Home page
A. Navia-Vázquez and A. R. Figueiras-Vidal
Efficient Block Training of Multilayer Perceptrons
Neural Comput., June 1, 2000; 12(6): 1429 - 1447.
[Abstract] [Full Text]


Home page
Neural Comput.Home page
T. Heskes
On "Natural" Learning and Pruning in Multilayered Perceptrons
Neural Comput., April 1, 2000; 12(4): 881 - 901.
[Abstract] [Full Text]


Home page
Neural Comput.Home page
S.-i. Amari
Natural Gradient Learning for Over- and Under-Complete Bases in ICA
Neural Comput., November 15, 1999; 11(8): 1875 - 1883.
[Abstract] [Full Text]


Home page
Neural Comput.Home page
P. van de Larr and T. Heskes
Pruning Using Parameter and Neuronal Metrics
Neural Comput., May 15, 1999; 11(4): 977 - 993.
[Abstract] [Full Text]


Home page
Neural Comput.Home page
J. Basak and S.-i. Amari
Blind Separation of a Mixture of Uniformly Distributed Source Signals: A Novel Approach
Neural Comput., May 15, 1999; 11(4): 1011 - 1034.
[Abstract] [Full Text]


Home page
J. Neurosci.Home page
S. Makeig, M. Westerfield, T.-P. Jung, J. Covington, J. Townsend, T. J. Sejnowski, and E. Courchesne
Functionally Independent Components of the Late Positive Event-Related Potential during Visual Spatial Attention
J. Neurosci., April 1, 1999; 19(7): 2665 - 2680.
[Abstract] [Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
J COGNITIVE NEUROSCIENCE NEURAL COMPUTATION MIT PRESS JOURNALS
Copyright © 1998 by The MIT Press.