Skip to main navigation Skip to search Skip to main content

Objective functions of online weight noise injection training algorithms for MLPs

Research output: Journal Publications and ReviewsRGC 22 - Publication in policy or professional journal

Abstract

Injecting weight noise during training has been a simple strategy to improve the fault tolerance of multilayer perceptrons (MLPs) for almost two decades, and several online training algorithms have been proposed in this regard. However, there are some misconceptions about the objective functions being minimized by these algorithms. Some existing results misinterpret that the prediction error of a trained MLP affected by weight noise is equivalent to the objective function of a weight noise injection algorithm. In this brief, we would like to clarify these misconceptions. Two weight noise injection scenarios will be considered: one is based on additive weight noise injection and the other is based on multiplicative weight noise injection. To avoid the misconceptions, we use their mean updating equations to analyze the objective functions. For injecting additive weight noise during training, we show that the true objective function is identical to the prediction error of a faulty MLP whose weights are affected by additive weight noise. It consists of the conventional mean square error and a smoothing regularizer. For injecting multiplicative weight noise during training, we show that the objective function is different from the prediction error of a faulty MLP whose weights are affected by multiplicative weight noise. With our results, some existing misconceptions regarding MLP training with weight noise injection can now be resolved. © 2010 IEEE.
Original languageEnglish
Article number5674088
Pages (from-to)317-323
JournalIEEE Transactions on Neural Networks
Volume22
Issue number2
DOIs
Publication statusPublished - Feb 2011

Research Keywords

  • Fault tolerance
  • prediction error
  • weight noise injection

Fingerprint

Dive into the research topics of 'Objective functions of online weight noise injection training algorithms for MLPs'. Together they form a unique fingerprint.

Cite this