Accuracy is something which gives an intuition of model performance. i.e. ratio of number of correct predictions with respect to total sample present. But what in case of unbalanced data.Imagine a case where we need to make a model to predict click through rate over a display rate. The click trough rate used to be […]

Logistic Regression is a method to create Machine Learning model for two class problems. It came out of linear regression but used to generate binary output (0 and 1) for making classifications. For example In Linear Regression we use simple linear equation as follows :- Yh = b0 + b1X1 Where X combines linearly with […]

If we have limited amount of Data, We can diversify it using data augmentation. It is like instead of collecting new data elements we just transform which is already there to increase the sample size along with diversity. We will consider unstructured data i.e. Image data for augmentation process. Editorial Team

Pandas is one of the powerful library used in python for data science and analysis. It has n-number of functions, methods and attributes, which are comparatively easy in syntax and flexible in nature. So a data scientist or any one who wants certain insights from any huge set of data prefers it and let their […]

In every situation we went through numerous option same as here. Whenever we decided to work on ML, we do stuck into a thought process of selecting an algorithm which suits our need. Its also bit complicated to know all the minor details of each one of them. Then how can we decide X is […]

Variations in the dataset is actually the information from the dataset and this is what the PCA uses. In simple terms PCA or Principal component analysis is a process to emphasise variations in a data set and generate strong pattern out of it. We can figure out the whole concepts in 3 points as followsâ€” […]

Variance – It is the measure of squared difference from the Mean. To calculate it we follow certain steps mentioned below: Calculate average of numbers For each numbers subtract the mean and square the result Calculate the average of those squared differences i.e. Variance Editorial Team

In case of Principal Component Analysis we project our data points on a vector in a direction of maximum variance to decrease the number of existing components. In this case we consider the direction eigenvector generated using covariance matrix as the direction of maximum variance. In this article we look into the proof of why […]