PCA

Principal Component Analysis (PCA)-dimensionality reduction algorithm

Consider a $d$-dimensional data set $X=\{x^{(1)},x^{(2)},x^{(3)},...., x^{(N)} \}$, where $N$ is the number of samples or observations. Each sample is a $d$-dimensional feature vector. The data set can be represented in terms of $d \times N$ matrix $X$. We define a new matrix $W (dimension: d\times d)$ which transforms $X$ to $Y$.

In practice PCA can be used on a data set $X$ in the following way:

Subtract the mean of the data set.
Compute EVD of ${R_{XX}}$, taking into account that eigenvalues are arranged in descending order.
Collect finite amount of $d'$ ($d' < d$) eigenvectors of ${R_{XX}}$ in matrix ${W}$, where $d'$ is a new dimension of the projected data ${Y}$.
Matrix $W$represents a new basis for our data, so project the data onto this basis by multiplication $Y = WX$.

An example::

Training images

Reconstruction of the training images using $PC$'s