Learn Before
Concept

Probabilistic PCA

Probabilistic PCA is a dimensionality reduction technique that analyzes data via a lower dimensional latent space. The PCA probability model is a slightly modified factor analysis model that uses W\textbf {{W}} W T\textbf{{W}}^{ ~T} + σ2I\sigma^{2}\textbf{{I}} as the covariance of x\textbf{x} where σ2\sigma^{2} is now a scalar:

xN(x;b,WW T+σ2I)\textbf{x} \sim {N} (\textbf{x}; \textbf{{b}},\textbf {{W}}\textbf{{W}}^{ ~T} + \sigma^{2}\textbf{{I}})

which can be equivalently expressed as:

x=Wh+b+σz\textbf{x} = \textbf{Wh} + \textbf{b} + \sigma \textbf{z}

where z\textbf{z} \sim N( z ; 0, I)\textbf{N( z ; 0, {I})} is noise, x\textbf{x} is a data vector, h\textbf{h} is a latent varibale, and W\textbf{W} is a set of principal axes relates the latent variables to the data represented as a matrix.

0

0

Updated 2021-07-08

References


Tags

Data Science