Skip to main content

Modeling Linear Regression

Premium

You are modeling marketing return on investment (ROI). You have each month’s revenue on the Y axis and spend on the X axis.

img_one_regression_question

You decide to use a simple linear regression model to evaluate whether spending more would generate more revenue. You find your linear intercept (b) is $1.5MM and gradient (a) is 2.1. Your residual standard error is 79.1 and your adjusted R-squared is 0.72 with a p-value of 1.09e-9.

a. How much of your data’s variance has your model explained and can the result be called significant?

b. Our problem requires more accuracy in modeling the data. How can we alter the linear equation to better fit the data? What regression model would you pick and why?

c. Your new model explains 98% of the data variance. How would you determine if your model is overfitting? How would you evaluate the model overall fit and parameters fit?

The p-value is very small. What does that tell you?

What would increase the complexity of the model?

Make sure to check that the features aren’t collinear.

How do the residuals look?

For part (b): Let's say our model failed because of multicollinearity. What model would you chose to get more accuracy?

For part (c): Can you name more than five checks you would use to determine if the model is overfitting?