The seven keys to marketing automation success

I had just finished running my first all-day client training in my new role as a Marketing Automation Consultant. As the participants one-by-one said their thanks and left, heads full of new and…

Smartphone

独家优惠奖金 100% 高达 1 BTC + 180 免费旋转




Guide to Principal Component Analysis

PCA is a buzz word which always pops up at many stages of Data Analysis for various uses. But what is really behind this Principal Component Analysis? What are the uses of it? Let’s see one by one.

First, let us understand what is done behind the screen called PCA. I have attached a Jupyter Notebook with examples at the bottom of the article.

Fig-1

Yes, you are correct, it is a 2D graph.

Fig-2

Yes, it is a 1D graph, because, for every x, the y value is the same.

Let’s move to a tricky question — if you can guess this one you have successfully pass the first step in PCA.

(Fig-3) What might be the Dimensionality of this Graph?

If you say 2D, nope that is wrong. It is 1D — What? But how?

Fig-6 (Rotated Axis/data)

Yes, if we rotate the axis a bit the data points will align as same as our previous case (Fig-2). This is done in following sub-steps,

Fig-5, PC Datacamp

One of the major uses of PCA is Dimensionality reduction. Assume you have dozens of features on your dataset. But unfortunately, we can only,

So somehow we need to reduce the noise of data and filter out the meaningful features for our task. This where the PCA comes in handy.

Intrinsic dimension is the number of features needed to approximate a dataset. This is the key idea behind the dimension reduction. If we need to reduce the dimension we should know which features we should select and which to neglect.

Usually, PCA in Sklearn will automatically find the number of Intrinsic Dimensions itself. It will consider PCA features with significant variance. We can also explicitly specify the number of components to be considered.

Sometimes python notebooks are not loading in GitHub properly. In such cases download the Notebook and open it using your local Jupyter environment.

Add a comment

Related posts:

Home loan for construction?

My Dad retired from job last year and building a house. Unfortunately the cost is exceeding his budget by Rs. 7 lacs approx. I am a salaried employee with a decent package to make myself eligible for…

Observe your microservices with Cellery

Cellery Observability Dashboard provides the ability for developers, devops and architects to observe the microservices deployed on the Cellery Mesh. It shows the run-time dependencies and metrics of…

ACC basketball season preview

Just five ACC teams reached the men's NCAA Tournament last season. Will the conference bounce back in 2022-23?