Econometrics Beat: Dave Giles' Blog: Can You Actually TEST for Multicollinearity?

Monday, June 24, 2013

Can You Actually TEST for Multicollinearity?

When you're undertaking a piece of applied econometrics, something that's always on your mind is the need to test the specification of your model, and to test the validity of the various underlying assumptions that you're making. At least - I hope it's always on your mind!

This is an important aspect of any modelling exercise, whether you're working with a linear regression model, or with some nonlinear model such Logit, Probit, Poisson regression, etc. Most people are pretty good when it comes to such testing in the context of the linear regression model. They seem to be more lax once they move away from that framework. That makes me grumpy, but that's not what this particular post is about.

It's actually about a rather silly question that you sometimes encounter, namely: "Have you tested to see if multicollinearity is a problem for your results?"

I'll explain why this isn't really a sensible question, and why the answer to the question in the title for this post is a resounding "No!"

First of all, let's stop and think for a moment what is actually going on when we perform a statistical test f some hypothesis. The null and alternative hypotheses are statements, or conjectures, about some feature of the underlying population. That is, they're associated with the data-generating process that supposedly gave rise to the sample data that we actually observe. For example, the hypothesis that we're testing may be a statement to the effect that one of the parameters in the population takes a particular value.

Hypothesis testing is an example of statistical inference. We use specific sample information to try and learn (infer) something about the unobserved characteristics of the population at large.

To state the obvious, in the context of a regression model, it's meaningful to have null and alternative hypotheses of the form, H₀: β₂ = 0 and H₁: β₂ > 0. Under standard conditions we might then test the validity of H₀ by using the t-statistic, t₂ = (b₂ - β₂) / s.e.(b₂). We'd reject H₀ in favour of H₁ if t₂ > c_α, where c_α is the 100(1 - α)'^th percentile of Student's t-distribution with (n - k) degrees of freedom.

On the other hand, it's nonsensical to write down null and alternative hypotheses of the form, H₀: b₂ = 0 and H₁: b₂ >0. These "hypotheses" are actually statements about the random variable, b₂, which is a function of the observed sample data. They are not statements about a fixed, unobserved, population parameter! They don;t form any basis for making inferences.

Now, let's return to the "problem" of multicollinearity.

What do we mean by this term, anyway? This turns out to be the key question!

Multicollinearity is a phenomenon associated with our particular sample of data when we're trying to estimate a regression model. Essentially, it's a situation where there is insufficient information in the sample of data to enable us to enable us to draw "reliable" inferences about the individual parameters of the underlying (population) model.

I'll be elaborating more on the "informational content" aspect of this phenomenon in a follow-up post. Yes, there are various sample measures that we can compute and report, to help us gauge how severe this data "problem" may be. But they're not statistical tests, in any sense of the word

Because multicollinearity is a characteristic of the sample, and not a characteristic of the population, you should immediately be suspicious when someone starts talking about "testing for multicollinearity". Right?

Apparently not everyone gets it!

There's an old paper by Farrar and Glauber (1967) which, on the face of it might seem to take a different stance. In fact, if you were around when this paper was published (or if you've bothered to actually read it carefully), you'll know that this paper makes two contributions. First, it provides a very sensible discussion of what multicollinearity is all about. Second, the authors take some well known results from the statistics literature (notably, by Wishart, 1928; Wilks, 1932; and Bartlett, 1950) and use them to give "tests" of the hypothesis that the regressor matrix, X, is orthogonal.

How can this be? Well, there's a simple explanation if you read the Farrar and Glauber paper carefully, and note what assumptions are made when they "borrow" the old statistics results. Specifically, there's an explicit (and necessary) assumption that in the population the X matrix is random, and that it follows a multivariate normal distribution.

This assumption is, of course totally at odds with what is usually assumed in the linear regression model! The "tests" that Farrar and Glauber gave us aren't really tests of multicollinearity in the sample. Unfortunately, this point wasn't fully appreciated by everyone.

There are some sound suggestions in this paper, including looking at the sample multiple correlations between each regressor, and all of the other regressors. These, and other sample measures such as variance inflation factors, are useful from a diagnostic viewpoint, but they don't constitute tests of "zero multicollinearity".

So, why am I even mentioning the Farrar and Glauber paper now?

Well, I was intrigued to come across some Stata code (Shehata, 2012) that allows one to implement the Farrar and Glauber "tests". I'm not sure that this is really very helpful. Indeed, this seems to me to be a great example of applying someone's results without understanding (bothering to read?) the assumptions on which they're based!

Be careful out there - and be highly suspicious of strangers bearing gifts!

References

Bartlett, M. S., 1950. Tests of significance in factor analysis. British Journal of Psychology, Statistical Section, 3, 77-85.

Farrar, D. E. and R. R. Glauber, 1967. Multicollinearity in regression analysis: The problem revisited. Review of Economics and Statistics, 49, 92-107.

Shehata, E. A. E., 2012. FGTEST: Stata module to compute Farrar-Glauber Multicollinearity Chi2, F, t tests.

Wilks, S. S., 1932. Certain generalizations in the analysis of variance. Biometrika, 24, 477-494.

Wishart, J., 1928. The generalized product moment distribution in samples from a multivariate normal population. Biometrika, 20A, 32-52.

21 comments:

AnonymousJune 25, 2013 at 1:39 AM
Hi Dave, as I recall, quite a few econometrics texts note that the assumption of fixed, rather than stochastic, variables/regressors is often made for convenience (of derivation and exposition) rather than because it has some substantive basis. Doesn't that affect your statement above about the Farrar and Glauber paper?
ReplyDelete
Replies
AnonymousJune 25, 2013 at 11:00 AM
See Goldberger, A Course in Econometrics, 1991, pp. 248-50, for a pretty scathing critique of "testing" for multicollinearity (which he analogizes to "testing for small sample size," aka "micronumerosity")
ReplyDelete
Replies
FhnuzoagJune 27, 2013 at 4:53 AM
It seems to me that the main issues here are loose language, and loose concepts. I say this as someone who has actually asked stuff similar to "Have you tested to see if multicollinearity is a problem for your results?" from time to time. It's true that generally I don't see what the usefulness of statistical hypothesis testing for multicollinearity would be, but I think that there are implied questions here from such a statement that are informative and worth looking at. For instance:

Q1: Is [phenomenon] easily explained by multicollinearity?

Such phenomenon include things like an algorithm taking a very long time to converge, or being very sensitive to initial parameters, or giving gigantic error estimates. Sometimes people throw their arms up at such things without asking why, or they go to explanations like 'this must mean there's a big noise component' without exhausting all the possibilities.

Q2: Is the model designed such that multicollinearity arises frequently? Is the study designed such that this can happen?

This can happen with models involving transformations with covariates. Sometimes, even with arbitarily large samples, your model neccessarily has a multicollinearity problem! Surveys can also have this issue.

Q3: Does your method survive multicollinearity well?

For example, working with principle components is one way to tackle the multi-collinearity issue.
ReplyDelete
Replies
econjeffJuly 7, 2013 at 5:47 AM
Here's a test: does Stata drop one of your independent variables when it estimates your model? If yes, then they are collinear. If not, then they are not.
ReplyDelete
Replies
AnonymousSeptember 11, 2013 at 11:06 AM
I am doing sGMM, do I have to ensure that there are no multicollinearity among the explanatory variables? even though I am using instruments in this case?
ReplyDelete
Replies
itfeature.comSeptember 17, 2013 at 9:53 AM
Thanks for nice tutorial on relationship between explanatory variables.
ReplyDelete
Replies
UnknownJune 23, 2014 at 4:09 AM
Hello Professor, do we worry about multicollinearity issue in ARDL estimation?
ReplyDelete
Replies
MaiJuly 17, 2015 at 6:31 AM
Dear Professor, I am using sGMM to investigate the effect of executive compensation (independent variable) on bank risk (dependent variable). In the model, I use also some other control variables. My problem is that the coefficients of control variables are not consistent when I replace the compensation variable. For example if I used Salary as the compensation variable I found significant effect of bank capital, however when Bonus is used as the compensation variable, effect of bank capital is not significant. Is this caused by multicollinearity? Thank you in advance.
ReplyDelete
Replies

Add comment

Note: Only a member of this blog may post a comment.

Pages

Monday, June 24, 2013

Can You Actually TEST for Multicollinearity?

21 comments: