Concept

Variational AutoEncoder Encoder

If we have a mean μ\mu and a symmetric covariance matrix CC in 2D, they are defined as:

μ=[μ1μ2]\mu = \begin{bmatrix} \mu_1 \\ \mu_2 \end{bmatrix}

C=[σ12rσ1σ2rσ1σ2σ22]C = \begin{bmatrix} \sigma_{1}^{2} & r \sigma_1 \sigma_2 \\ r \sigma_1 \sigma_2 & \sigma_{2}^{2} \end{bmatrix}

In a Variational AutoEncoder, it is assumed that there is no correlation between these dimensions. This means that the encoder needs to map each input to mean and variance vectors. As a result, the image will be encoded into two vectors:

  1. mu (μ\mu) - the mean point distribution.
  2. log_var - the logarithm of the variance of each dimension.

In order to encode an image into a specific point, we can use this formula:

z=μ+exp(log_var/2)z = \mu + \exp(log\_var / 2)

0

1

Updated 2026-05-08

Tags

Data Science