Neural Comp. Sign up for ETOCS
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Ma, J.
Right arrow Articles by Jordan, M. I.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Ma, J.
Right arrow Articles by Jordan, M. I.
(Neural Computation. 2000;12:2881-2907.)
© 2000 The MIT Press


Letter

Asymptotic Convergence Rate of the EM Algorithm for Gaussian Mixtures

Jinwen Ma

Department of Computer Science & Engineering, The Chinese University of Hong Kong, Shatin Hong Kong and Institute of Mathematics, Shantou University, Shantou, Guangdong, 515063, People's Republic of China

Lei Xu

Department of Computer Science & Engineering, The Chinese University of Hong Kong, Shatin Hong Kong, People's Republic of China

Michael I. Jordan

Department of Computer Science and Department of Statistics, University of California at Berkeley, Berkeley, CA 94720, U.S.A.

It is well known that the convergence rate of the expectation-maximization (EM) algorithm can be faster than those of convention first-order iterative algorithms when the overlap in the given mixture is small. But this argument has not been mathematically proved yet. This article studies this problem asymptotically in the setting of gaussian mixtures under the theoretical framework of Xu and Jordan (1996). It has been proved that the asymptotic convergence rate of the EM algorithm for gaussian mixtures locally around the true solution {Theta}* is o(e0.5-{epsilon}({Theta}*)), where {epsilon}>0 is an arbitrarily small number, o(x) means that it is a higher-order infinitesimal as x->0, and e({Theta}*) is a measure of the average overlap of gaussians in the mixture. In other words, the large sample local convergence rate for the EM algorithm tends to be asymptotically superlinear when e({Theta}*) tends to zero.







HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
J COGNITIVE NEUROSCIENCE NEURAL COMPUTATION MIT PRESS JOURNALS
Copyright © 2000 by The MIT Press.