Econometrics Beat: Dave Giles' Blog: Zero-One Matrices

Thursday, January 2, 2014

Zero-One Matrices

When we're learning the basics of least squares regression analysis, one of the topics that we invariably encounter is the consequences of model mis-specification. In particular, we're taught that omitting relevant regress from the model renders the OLS estimator biased and inconsistent, although its precision is improved. On the other hand, including extraneous regressors simply reduces the efficiency of the OLS estimator of the coefficient vector. That estimator is still unbiased (and consistent) in this case.

These results are just special cases of those associated with imposing false restrictions on the parameter space, or failing to impose valid restrictions. So, once these more general results have been covered there's really no need to treat the "omitted regressors" and "extraneous regressors" situations as a separate matter.

However, usually they are dealt with as a distinct topic. What I find interesting, and what I want to focus on here, is the way in which the unbiasedness of OLS can be demonstrated in the context of irrelevant regressors. There's an easy way to get this result, and there's a more tedious proof. Let's begin by looking at the easy way.

Here's the set-up for our problem. The correct data-generating process (DGP) is:

y = X₁β₁ + ε ; ε ~ [0 , σ²I_n] (1)

but the model that is estimated is:

y = X₁β₁ + X₂β₂ + u (2)

where X₁ and X₂ are both non-random and of full rank, k₁ and k₂ respectively.

So, our OLS estimator of the full coefficient vector in (2) is b = (X'X)^-1X'y, where X = (X₁ , X₂).

Given that (1) is the true DGP, we can write

b = (X'X)^-1X'(X₁β₁ + ε). (3)

When I'm teaching this stuff, what I do next is to introduce the following zero-one "selection matrix":

S' = ( I , 0')

and note that we can write X₁ = XS. Immediately, it follows that

b = Sβ₁ + ε , and E[b] = Sβ₁

Because b' = (b₁' , b₂'), we have the result that E[b₁] = β₁and E[b₂] = 0 (= β₂).

So, both sub-vectors of b are unbiased estimators for the corresponding coefficient sub-vectors.

Alright, that was easy enough. What's the difficult way to do this?

In equation (3), the (X'X)^-1 matrix can be written as a partitioned inverse, and we can proceed, laboriously, as follows:

b₁ = [X₁'X₁ - X₁'X₂(X₂'X₂)^-1X₂'X₁]^-1X₁'y -[X₁'X₁ - X₁'X₂(X₂'X₂)^-1X₂X₁]^-1X₁'X₂(X₂'X₂)^-1X₂'y ,
so

E[b₁] = QX₁'X₁β₁ - QX₁'X₂(X₂'X₂)^-1X₂'X₁β₁ ,

where
Q = [X₁'X₁ - X₁'X₂(X₂'X₂)^-1X₂'X₁]^-1.
So,

E[b₁] = Q[X₁'X₁ - X₁'X₂(X₂'X₂)^-1X₂'X₁] β₁ = QQ^-1β₁ = β₁ .

You can then go through the same agony to prove that E[b₂] = 0, if you really want to!

Various types of zero-one matrices can be used in all sorts of ways to make life easier in econometrics, and the use of the selection matrix here is a good example. A comprehensive discussion of the use of these matrices is given by Turkington (2001).

References

Turkington, D. A., 2001. Matrix Calculus and Zero-One Matrices: Statistical and Econometric Applications. Cambridge University Press, Cambridge.

4 comments:

AnonymousJanuary 3, 2014 at 5:13 AM
The selection matrix works for showing unbiasedness, but if you want to show that adding irrelevant regressors lowers precision, you have to use Frisch-Waugh...(which uses the partitioned inverse, although you can hide that from your students by solving equations).
ReplyDelete
Replies
AnonymousJanuary 3, 2014 at 11:32 AM
Nice post, Prof. Giles! Just to let you know, there is an easy way to display nice maths in blogger using the tex typesetting system, it is very well described on this blog post:
http://holdenweb.blogspot.co.uk/2011/11/blogging-mathematics.html
ReplyDelete
Replies

Add comment

Note: Only a member of this blog may post a comment.

Pages

Thursday, January 2, 2014

Zero-One Matrices

4 comments: