Learn Before
Concept

Probabilistic PCA

Probabilistic PCA is a dimensionality reduction technique that analyzes data via a lower dimensional latent space. The PCA probability model is a slightly modified factor analysis model that uses W\textbf {{W}} W T\textbf{{W}}^{ ~T} + σ2I\sigma^{2}\textbf{{I}} as the covariance of x\textbf{x} where σ2\sigma^{2} is now a scalar:

textbf{x} sim {N} (textbf{x}; textbf{{b}},textbf {{W}}textbf{{W}}^{ ~T} + sigma^{2}textbf{{I}})

which can be equivalently expressed as:

x=Wh+b+σz\textbf{x} = \textbf{Wh} + \textbf{b} + \sigma \textbf{z}

where z\textbf{z} \sim textbf{N( z ; 0, {I})} is noise, x\textbf{x} is a data vector, h\textbf{h} is a latent varibale, and W\textbf{W} is a set of principal axes relates the latent variables to the data represented as a matrix.

0

0

Updated 2021-07-08

References


Tags

Data Science