Comments on Econometrics Beat: Dave Giles' Blog: An Overly Confident (Future) Nobel Laureate

Thanks Mark!

2017-08-04T04:51:59.500-07:00

Thanks Mark!

The best discussion on Andrew Gelman's blog is...

2017-08-03T20:50:57.450-07:00

The best discussion on Andrew Gelman's blog is in connection with this entry:

http://andrewgelman.com/2017/03/04/interpret-confidence-intervals/

Some good contributions there, esp. by Carlos Ungil and Daniel Lakeland.

«if you were a Bayesian, then the whole idea of a ...

2017-08-03T08:04:00.171-07:00

«if you were a Bayesian, then the whole idea of a confidence interval will be meanngless, regardless of the sample size - because you'd have no interest in "repeated sampling", or the associated idea of the "sampling distribution".»

I think that is an bizarre misdescription of bayesian approaches, as if they were "well one sample is plenty and then we bet the farm, because priors!".

«I like to get students doing MC simulations nice ...

2017-08-03T07:58:55.564-07:00

«I like to get students doing MC simulations nice and early.»

That is a really really good point. For example I found that myself and others only understand ("somewhat") the dreaded p-value if it is computed from a MC simulation, because the classic definition is in effect a double negative, not a constructive one.

«The true value of the coefficient in this regress...

2017-08-03T07:42:53.771-07:00

«The true value of the coefficient in this regression model is a parameter. It's a constant, whose value we just don't happen to know. On the other hand, the (point) estimate of 1.1 is just one particular "realized" value of a random variable - generated using this one particular sample of data. An estimator is a formula - like the OLS formula in our example. Except in rather silly cases, this formula involves using the (random) sample data. So, an estimator is a function of the sample data - in othere words, what we call a statistic. When we apply this formula using a particular sample of data, we generate a number - a point estimate. Because an estimator is a function of the random data, it's random itself. Being a random variable, an estimator has a distribution function.»

To me this looks like extremely loose and obfuscating terminology that gets so many people in trouble, for example can a "formula" be a "random variable" and have a "distribution function"? That's simply ridiculous. The way I learned it from some very clear thinking definettian subjectivists (but it is not a subjectivist point of view) is:

* There is an algebra of arithmetic number and an algebra of stochastic numbers, and they are fundamentally different.

* A "statistic" is a measure over a set of numbers, whether they be arithmetic or stochastic. The same formula for a measure can portend two different functions, one over arithmetic numbers, one over stochastic numbers.

* Arithmetic numbers arise from populations, stochastic numbers from samples (under the hypothesis that the sampling process is ergodic, but I am not sure that is what a definettian subjectivist would say).

* A measure on a sample is at the same time an arithmetic number with respect to the sample, and a stochastic number if *interpreted* as an estimate of the same measure on the population, while a measure on a population is always and only an arithmetic number.

* Bonus point: it fantastically important (especially in studies of the political economy) to always ask what is the population from which a sample has been drawn, and whether the sampling process was indeed ergodic. And if you consider those two questions deeply enough, you end up a definettian subjectivist I guess :-).

I do hope that I was not that loose conceptually or in terminology in the above, and that it reflects the insights I got from those clear thinking people.

I will definitely be looking into this - thanks ag...

2017-08-03T06:47:27.450-07:00

I will definitely be looking into this - thanks again for alerting me (and other readers).

Ah... hadn't noticed that! In 2011 I wasn'...

2017-08-03T06:45:02.702-07:00

Ah... hadn't noticed that! In 2011 I wasn't aware of "bet-proofness" either - I only learned about it from the M-N 2016 paper. But the concept has been around for decades, apparently. It's curious that it isn't more widely known.

Mark - thanks for pointing this out! I'll chec...

2017-08-03T06:21:33.091-07:00

Mark - thanks for pointing this out! I'll check it out.(Note that my blog post was from 2011 - I promoted it recently because it was the 100'th anniversary of Friedman's birth.)

Actually, there IS a way to interpret realized CIs...

2017-08-03T06:13:23.347-07:00

Actually, there IS a way to interpret realized CIs. The concept is “bet-proofness”. We had quite a good discussion about it over at Andrew Gelman's blog several months ago. I learned about the concept from a recent paper by Mueller-Norets (Econometrica 2016).

Mueller-Norets (2016, published version, p. 2185):

“Following Buehler (1959) and Robinson (1977), we consider a formalization of “reasonableness” of a confidence set by a betting scheme: Suppose an inspector does not know the true value of θ either, but sees the data and the confidence set of level 1−α. For any realization, the inspector can choose to object to the confidence set by claiming that she does not believe that the true value of θ is contained in the set. Suppose a correct objection yields her a payoff of unity, while she loses α/(1−α) for a mistaken objection, so that the odds correspond to the level of the confidence interval. Is it possible for the inspector to be right on average with her objections no matter what the true parameter is, that is, can she generate positive expected payoffs uniformly over the parameter space? … The possibility of uniformly positive expected winnings may thus usefully serve as a formal indicator for the “reasonableness” of confidence sets.”

“The analysis of set estimators via betting schemes, and the closely related notion of a relevant or recognizable subset, goes back to Fisher (1956), Buehler (1959), Wallace (1959), Cornfield (1969), Pierce (1973), and Robinson (1977). The main result of this literature is that a set is “reasonable” or bet-proof (uniformly positive expected winnings are impossible) if and only if it is a superset of a Bayesian credible set with respect to some prior. In the standard problem of inference about an unrestricted mean of a normal variate with known variance, which arises as the limiting problem in well behaved parametric models, the usual [realized confidence] interval can hence be shown to be bet-proof."

Full reference:

Credibility of Confidence Sets in Nonstandard Econometric Problems
Ulrich K. Mueller and Andriy Norets (2016)
https://www.princeton.edu/~umueller/cred.pdf
http://onlinelibrary.wiley.com/doi/10.3982/ECTA14023/abstract

Interesting stuff!

--Mark

Excellent explanation... but sorry, but you're...

2017-08-02T08:59:57.770-07:00

Excellent explanation... but sorry, but you're really just parsing words here. If 95% of the intervals would cover the true value, then IMO it's not illogical at all to say that there's a 95% chance that any particular interval selected contains the true value. Yes, I get that the specific one we estimated either does or does not, but on average 95% is the best estimate we have of whether it does or does not.

I am grateful to all of you (Dave Giles and Richar...

2017-08-01T21:45:00.668-07:00

I am grateful to all of you (Dave Giles and Richard Morey et al) for explaining this so clearly. Thank you!

Anonymous: What you can say about the single inter...

2015-11-22T21:30:20.985-08:00

Anonymous: What you can say about the single interval is, barring other available information that would make such a statement absurd, that you believe the true value is in it. People run into the most trouble trying to wrap such statements in probabilities not understanding that after the interval is calculated there aren't any known ones for the interval (without additional work). But consider the confidence in the procedure. If you perform a procedure that is correct 95% of the time it's perfectly rational to then just act as if the procedure gave you the correct answer even if you have no idea the exact probability that you're correct this particular time. I always find it fascinating that people have no problem acting as if their decision following a typical test is correct even though they may be wrong at a much higher rate than for a CI but can't do the same with a CI. The difference is that you're not stating sig./non-sig. but instead saying that mu is here.

Anonymous: You can't say anything about a sing...

2015-08-08T01:24:45.612-07:00

Anonymous: You can't say anything about a single interval. As Neyman (1952) said, "[all the CI] does assert is that the probability of success in estimation using [any] formula[] is equal to [95\%]." You can read our paper on this topic in our upcoming paper, "The Fallacy of Placing Confidence in Confidence Intervals" (http://learnbayes.org/papers/confidenceIntervalsFallacy/index.html.

Thanks professor. So what can we say about a singl...

2014-03-09T16:08:50.235-07:00

Thanks professor. So what can we say about a single interval? In your example, how would you interpret the confidence interval of [0.9,1.3]?.

Ok, I just wanted to convince myself this is the o...

2013-08-14T10:42:22.452-07:00

Ok, I just wanted to convince myself this is the only reason.
All doubts clarified. Thanks for that.

The interval is random; the parameter is constant....

2013-08-14T10:33:27.561-07:00

The interval is random; the parameter is constant.

Brilliant! Now, I just can't link the two thin...

2013-08-14T10:25:15.584-07:00

Brilliant! Now, I just can't link the two things:
(1) the probability of a single interval covering the unknown parameter is 95%.
(2) the probability of the unknown parameter be within a single interval is either zero or 1.

Yes, but only in the sense that if we repeated the...

2013-08-14T10:15:18.139-07:00

Yes, but only in the sense that if we repeated the exercise again and again, with randomly drawn samples of the same size, then 95% of all of the intervals that we constructed would cover the parameter. In practice, we're not (usually) able to do this.

Many thanks for your reply. But in this case, the...

2013-08-14T10:11:03.312-07:00

Many thanks for your reply.
But in this case, the probability of the single interval covering the unknown parameter would be 95%, wouldn't it?

Yes we can. And of course given that the confidenc...

2013-08-14T10:03:50.924-07:00

Yes we can. And of course given that the confidence interval is constructed using the sampling distribution of the point estimator, the notion of "probability" in this context (whether we like it or not), is based on "repeated sampling". We'll never know if our single interval covers the unknown parameter or not.

Very nice post. About Mr. F's question: I am ...

2013-08-14T09:43:13.062-07:00

Very nice post.
About Mr. F's question: I am ok with the fact that θ10 is a constant, and as such the probabilities are zero or 1. But the intervals are random variables, and we can ask about the probability of one of those intervals covering θ10 or not. Can't we?

Rasmus: Thanks for the comment. No, the story (int...

2012-01-07T14:29:25.537-08:00

Rasmus: Thanks for the comment. No, the story (interpretation of the confidence interval) doesn't change in the asymptotic case.

Of course, if you were a Bayesian, then the whole idea of a confidence interval will be meanngless, regardless of the sample size - because you'd have no interest in "repeated sampling", or the associated idea of the "sampling distribution".

Glad you are enjoying the blog.

Great post and a great blog in general. Does the ...

2012-01-07T13:26:25.916-08:00

Great post and a great blog in general.
Does the story (i.e. the interpretation of the confidence intervals) change if we are considering an estimator where only the asymptotic distribution is known? E.g T^0.5 (b-beta) is asymptotically normal.

Jeremy: Thanks. I agree about the medics. There ar...

2011-08-31T10:00:59.062-07:00

Jeremy: Thanks. I agree about the medics. There are some gems in the med. journals! You're also right about the key role the sampling distribution plays in understanding what follows. I like to get students doing MC simulations nice and early.

Great post and great story! In addition to judges,...

2011-08-30T19:03:24.628-07:00

Great post and great story! In addition to judges, doctors also have a great deal of trouble properly interpreting the confidence intervals presented in medical literature.

I think students' difficulties (my own, anyway) stem from an inadequate understanding of the sampling distribution before diving into moments and OLS regression. Peter Kennedy's Guide to Econometrics opens with an excellent, intuitive treatment of sampling distributions that helped me immensely.