Skip to main content

Simple Linear Questions and Answers

Q: What is a linear regression?

A: It is all about getting the best fit line that supports linearity. The relation between independent and dependent variables helps to form a  Straight Line.


Q: Assumptions of Linear regression?

A: a)The relation between independent and dependent variables that supports Linearity.

    b)Multicollinearity.The relation between one feature with other feature

    c)Homoscedasticity: It means the same distribution of error for all the independent variables.

   d)Hetrosedasticity: it is quite opposite to Homescedasticity.no Equal distribution of the errors.


Q: Regression problems evaluation metrics and which one we have to use?

A: In regression problems, we are having a different  type of evaluation metrics like RMSE,
MSE, MAE, etc. And now the question is which one we have to use?
for regression problems, we can use any metrics to calculate the score but in Compilation time we use MAE(mean absolute error), which saves execution time and memory space, etc So most of the people use this.

Q: OUTLIER treatment required for Linear regression?
A) Yes Removing Outliers will help to improve the score.

Q): if our feature not following any Gaussian Distribution what we have to do?

A) we have to use 1) Log Transformation or
                              2) Reciprocal Transformation or
                              3)Exponential Transformation or
                              4)Sqareroot Transformation 
you can choose any technique for your feature that falls in any of the standard distribution.

Q)Missing value treatment is required for Linear regression

A) Missing value treatment is required for all the models. fill missing values with (mean, mode, median)

Q) Advantages and disadvantages of  Linear Model
A)  
  1. It can handle overfitting using dimensionality reduction techniques and cross-validation and regularization.
  2. Linear regression performs exceptionally well for linearly separable data.
Disadvantages:
  1. Sometimes Lot of Feature Engineering Is required
  2. If the independent features are correlated it may affect performance
  3. It is often quite prone to noise and overfitting

Q) Linear regression require Feature scaling?

A) Yes it needs a feature scaling.

Q) Linear regression for Time-series analysis
A) yes you can use but the results not accurate.


Comments

Popular posts from this blog

SUPPORT VECTOR MACHINE

                 SUPPORT VECTOR MACHINE:- Support vector machine:-it is a type of supervised learning algorithm it is used to solve both classification and regression problem. Note :- It is mostly used for classification problems. what we are going to learn in SVM: a) Support vectors b) Hyperplane c) Marginal Distance d) Linear Separable e) Non-linear separable f) support kernels NOw we will discuss everything in detail. Hyper plane:- in the above diagram, we have drawn three lines(A, B, C) separating two data points (stars and reds) The lines (A, B, C) are called Hyperplanes. Note:- “Select the hyper-plane which segregates the two classes better” i.e  above there are three hyperplanes how to select the best hyperplane? b)Marginal Distance:- When we draw a hyperplane the plane creates two new(------) dotted lines one line above the hyperplane and one line below the hyperplane line. see the below image you will get an ...

KNN Interview Questions

                           KNN interview questions 1) Which of the following distance metric can not be used in k-NN? A) Euclidean Distance B) Manhatten Distance c) Hamming Distance E) Minkowski Distance F) Jaccard Distance G) All the above Answer:- G All of these distance metric can be used as a distance metric for KNN 2)Knn is for regression or classification? Answer:- Knn is used for both classification and regression problems. 3) When we use Manhatten Distance? Answer:-Manhatten distance is used for continuous variables. 4) You have given the following 2 statements, find which of these options is/are true in case of k-NN? In the case of very large value of k , we may include points from other classes into the neighborhood, so it leads to overfitting. In case of too small value of k the algorithm is very sensitive to noise.(it will affect our model performance). Answer:-The above two points are answers. 5...

K-NN

                           K-Nearest Neighbour The k-nearest neighbors (KNN) algorithm is a simple, easy-to-implement supervised machine learning algorithm that can be used to solve both classification and regression problems. KNN means in short Similar things near to each other. The KNN algorithm uses ‘ feature similarity ’ to predict the values of any new data points. I am going to explain this knn with a simple example:- In the above table, we have S.No, Height, Weight & Age in our table for S.No.5 the weight is missing, So now we need to predict the weight of the person based on his Height and Age. graph example in the above graph, X_axis represents the age and the y_axis represents the Height of a person. in the above graph, I write 5 numbers, in that  4 values have output and one id not having output now see How KNN help us. the 5th number I want to predict which is circled. hint:-By seeing th...