Learn Before
Concept

Working Process of Centered Deep Boltzmann machine

Considering Boltzmann machine as a set of units xx with a weight matrix UU and biases bb, the energy function is given by: E(x)=xTUxbTxE(x) = -x^{T}Ux-b^{T}x we can implement structures of Boltzmann machines with different numbers of layers, by partitioning xx into visible and hidden units and zeroing out elements of UU for units that do not interact. E(x;U,b)=(xμ)TU(xμ)(xμ)TbE^{'}(x;U,b) = -(x-\mu)^{T}U(x-\mu)-(x-\mu)^{T}b Typically μ\mu is a hyperparameter fixed at the beginning of training. It is usually chosen to make sure that xμ0x-\mu \approx 0 when the model is initialized.

0

1

Updated 2021-07-29

References


Tags

Data Science