Formula

Xavier Initialization with a Uniform Distribution

Xavier initialization can also be adapted for sampling weights from a uniform distribution instead of a Gaussian one. A uniform distribution U(a,a)U(-a, a) has a variance of a23\frac{a^2}{3}. By setting this equal to the Xavier variance condition σ2=2nextrmin+nextrmout\sigma^2 = \frac{2}{n_ extrm{in} + n_ extrm{out}} and solving for aa, we obtain a=6nextrmin+nextrmouta = \sqrt{\frac{6}{n_ extrm{in} + n_ extrm{out}}}. Therefore, the uniform version of Xavier initialization samples weights according to the distribution U\left(-\sqrt{\frac{6}{n_ extrm{in} + n_ extrm{out}}}, \sqrt{\frac{6}{n_ extrm{in} + n_ extrm{out}}} ight).

0

1

Updated 2026-05-06

Tags

Data Science

D2L

Dive into Deep Learning @ D2L