Neural Comp. NEW Faster Access
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Schmidhuber, J.
Right arrow Articles by Gomez, F.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Schmidhuber, J.
Right arrow Articles by Gomez, F.
(Neural Computation. 2007;19:757-779.)
© 2007 The MIT Press


Letter

Training Recurrent Networks by Evolino

Jürgen Schmidhuber

juergen{at}idsia.ch IDSIA, 6928 Manno (Lugano), Switzerland, and TU Munich, 85748 Garching, München, Germany

Daan Wierstra

daan{at}idsia.ch

Matteo Gagliolo

matteo{at}idsia.ch

Faustino Gomez

tino{at}idsia.ch IDSIA, 6928 Manno (Lugano), Switzerland

In recent years, gradient-based LSTM recurrent neural networks (RNNs) solved many previously RNN-unlearnable tasks. Sometimes, however, gradient information is of little use for training RNNs, due to numerous local minima. For such cases, we present a novel method: EVOlution of systems with LINear Outputs (Evolino). Evolino evolves weights to the nonlinear, hidden nodes of RNNs while computing optimal linear mappings from hidden state to output, using methods such as pseudo-inverse-based linear regression. If we instead use quadratic programming to maximize the margin, we obtain the first evolutionary recurrent support vector machines. We show that Evolino-based LSTM can solve tasks that Echo State nets (Jaeger, 2004a) cannot and achieves higher accuracy in certain continuous function generation tasks than conventional gradient descent RNNs, including gradient-based LSTM.







HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
J COGNITIVE NEUROSCIENCE NEURAL COMPUTATION MIT PRESS JOURNALS
Copyright © 2007 by The MIT Press.