You are Here: Home >< Maths

Announcements Posted on
TSR's new app is coming! Sign up here to try it first >> 17-10-2016
1. Using minitab 16, for a regression model, how would I determine the ‘best’ regression equation by using an all-possible subset regression approach with suitable selection criteria (and how does one choose this "suitable selection criteria")?
2. (Original post by Bruce Harrisface)
Using minitab 16, for a regression model, how would I determine the ‘best’ regression equation by using an all-possible subset regression approach with suitable selection criteria (and how does one choose this "suitable selection criteria"?
Maybe this will help. The criterion that one uses for selecting between models depends upon the use that you're going to make of the final model. I don't use minitab, so I don't know which criteria it offers, but they will likely be to do with regression fit, explained variation or prediction accuracy.

BTW, all-subsets selection (and step-wise selection) are, in general, very poor methods of arriving at a final regression model. They were very popular a few years ago (hence they have been built into current software), but they have been shown to have very poor selection performance. Replaced these days with shrinkage techniques like the Lasso.
3. I've done the best subsets approach so now I have this table

How do I get the regression equation from this?
4. Sorry, wrong image
How would I get the equation from this table?
5. (Original post by Bruce Harrisface)
Sorry, wrong image
How would I get the equation from this table?
The selection criteria criteria you are given there are:

(i) R-squared: this measures the amount of explained variation in the regression. Higher is better.
(ii) Adjusted R-squared: same as (i) but with a penalty for the number of regression covariates. Tends to be favoured over (i) as it prefers "compact" regression equations.
(iii) Mallows Cp: this is equivalent to the Akaike Information Criterion (AIC), smaller is better subject to the expected value of Cp should be approximately equal to the number of covariates plus one (the constant).
(iv) S: the standard error of the regression. Again, smaller is better.

Here you have a whole bunch of choices with adjusted R-squared above 90%. Any of these would be good candidates. If you want to pick one then pick the fifth with adjusted R-squared of 91.3, though note that Mallows Cp might be a bit low, so you should check all of the regression diagnostics before you use it for inference or prediction.

Register

Thanks for posting! You just need to create an account in order to submit the post
1. this can't be left blank
2. this can't be left blank
3. this can't be left blank

6 characters or longer with both numbers and letters is safer

4. this can't be left empty
1. Oops, you need to agree to our Ts&Cs to register

Updated: March 29, 2016
TSR Support Team

We have a brilliant team of more than 60 Support Team members looking after discussions on The Student Room, helping to make it a fun, safe and useful place to hang out.

This forum is supported by:
Today on TSR

How does exam reform affect you?

From GCSE to A level, it's all changing

Poll
Useful resources

Maths Forum posting guidelines

Not sure where to post? Read here first

How to use LaTex

Writing equations the easy way

Study habits of A* students

Top tips from students who have already aced their exams

Chat with other maths applicants