1Cademy - Effective Observation Window of RMSProp

Learn Before

RMSprop (Deep Learning Optimization Algorithm)

Concept

Effective Observation Window of RMSProp

In the RMSProp optimization algorithm, the effective observation window for the exponentially weighted average of squared gradients is defined by the quantity $\frac{1}{1 - \gamma}$ , where $\gamma$ is the weighting term (or decay factor). This means the state variable aggregates information over approximately the past $\frac{1}{1 - \gamma}$ observations. A larger $\gamma$ produces a longer memory and a smoother average, while a smaller $\gamma$ makes the algorithm more responsive to recent gradients. For example, setting $\gamma = 0.9$ yields an effective window of $\frac{1}{1 - 0.9} = 10$ observations.

Updated 2026-06-17

Contributors are:

Who are from:

References

Dive into Deep Learning
Dive into Deep Learning

Learn Before

Related