Early stopping

From Wikipedia, the free encyclopedia

In machine learning, early stopping is a form of regularization used when a machine learning model (such as a neural network) is trained by on-line gradient descent. In early stopping, the training set is split into a new training set and a validation set. Gradient descent is applied to the new training set. After each sweep through the new training set, the network is evaluated on the validation set. The network with the best performance on the validation set is then used for actual testing.

This technique is a simple but efficient hack to deal with the problem of overfitting. Overfitting is a phenomenon in which a learning system, such as a neural network gets very good at dealing with one data set at the expense of becoming very bad at dealing with other data sets. Early stopping is effectively limiting the used weights in the network and thus imposes a regularization, effectively lowering the VC dimension.

Early stopping is a very common practice in neural network training and often produces networks that generalize well. However, while often improving the generalization it does not do so in a mathematically well-defined way.

See related topic: Cross-validation in particular using a "Validation Set"

This applied mathematics-related article is a stub. You can help Wikipedia by expanding it.

Categories: Neural networks | Data Mining

Views

Interaction

Search

Languages

Esperanto

This page was last modified 22:25, 2 May 2008 by Wikipedia user Onasraou. Based on work by Wikipedia user(s) Btyner, Neilc, Hike395, Maksim-e, Silverfish, Gene s, Dcoetzee and Charles Matthews and Anonymous user(s) of Wikipedia.
All text is available under the terms of the GNU Free Documentation License. (See Copyrights for details.)
Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a U.S. registered 501(c)(3) tax-deductible nonprofit charity.
About Wikipedia
Disclaimers