Comments on The 20% Statistician: The difference between a confidence interval and a capture percentage

2021-04-07T16:26:40.911+02:00

This comment has been removed by a blog administrator.

That's a great post but I think it misses the ...

2016-07-19T04:26:11.676+02:00

That's a great post but I think it misses the point that not only are statements like the one being critiqued incorrect but they are caring about the wrong thing. They're like early astronomy where the earth is the centre of the universe. The researcher is thinking in terms of their mean and CI as the centre of the universe. Accepting what the CI really means and what a proper statement about it is allows one to be correct 95% of the time. So, after the calculations the critical method is that making your CI the centre of discussion you've reduced your long run accuracy of statements dramatically and further reduced the useful relevance of your study.

Not sure if you (Mike Atiken) are still reading th...

2016-04-23T13:45:07.973+02:00

Not sure if you (Mike Atiken) are still reading this, but the code breaks for me from this point on.

RCIU_sim<-RCIU_sim[RCIU_simsamplemean]

(There isn't an object RCIU_simsamplemean, or an object mean_simRCIL or an object mean_sim2)

There is no special application - CI are always wh...

2016-04-01T06:40:19.518+02:00

There is no special application - CI are always what they are, as explained above. It sounds like they are misused - but I can't explain that.

Okay, do you mind explaining the application of CI...

2016-03-31T23:04:25.809+02:00

Okay, do you mind explaining the application of CI's in this context (please)?

Hi, no, that is incorrect. It is often taught inco...

2016-03-31T17:29:48.237+02:00

Hi, no, that is incorrect. It is often taught incorrectly. Confidence intervals are counterintuitive things.

Hi Daniel, quick question. All of the discussion a...

2016-03-31T10:35:47.851+02:00

Hi Daniel, quick question. All of the discussion around CI's has focused on population data. I'm wondering about the implications for individual data (such as neuropsych assessment).

As a neuropsych I was trained that the 95% CI provides a range about which we can be 95% confident contains the individuals 'true' score. But is this actually the case??? Is a more accurate interpretation that if we tested the patient over and over again (not accounting for practice effects) that their score would fall within the 95% CI,95% of the time...??

A bit late, but I came across a direct reference t...

2016-03-22T00:10:08.507+01:00

A bit late, but I came across a direct reference today:

* Kalbfleisch (1975, 1989) Probability and Statistical Inference II Example 16.3.1

Well, I seem to recall that prediction intervals c...

2016-03-06T19:36:27.224+01:00

Well, I seem to recall that prediction intervals can be used for future sample means (of independent samples), too, treating the future sample mean as an observable. The usual multiplier becomes sqrt(1/n_new + 1/n_old) rather than sqrt(1 + 1/n_old). So, loosely, a 95% CI is really about an 84% PI.

Seymour Geisser wrote a book on this.

Sure. Normal. Add colMeans(data) and see it...

2016-03-05T22:55:49.410+01:00

Sure. Normal. Add

colMeans(data)

and see it's 83.4 - ON AVERAGE

I do not know whether there is an issue with the s...

2016-03-05T22:34:43.425+01:00

I do not know whether there is an issue with the simulation, but with a large sample size (n =1000) and repeating the simulation 100 times, I've found the capture percentage is higher than 84% (I've found 92%)

@AntoViral (do not know how to sign, I have'nt a URL)

This is the code, adapted from yours:

library(ggplot2)

### creation of an empty dataframe

data <- data.frame()

N <- 100

for (i in 1:N) {

### original
## n=20 #set sample size
## nSims<-100000 #set number of simulations

### modified by me

set.seed(i) ### set seed for reproducibility

n=1000 #set sample size
nSims<-1000 #set number of simulations

x<-rnorm(n = n, mean = 100, sd = 15) #create sample from normal distribution

#95%CI
CIU<-mean(x)+qt(0.975, df = n-1)*sd(x)*sqrt(1/n)
CIL<-mean(x)-qt(0.975, df = n-1)*sd(x)*sqrt(1/n)

#plot data
#png(file="CI_mean.png",width=2000,height=2000, res = 300)
ggplot(as.data.frame(x), aes(x)) +
geom_rect(aes(xmin=CIL, xmax=CIU, ymin=0, ymax=Inf), fill="#E69F00") +
geom_histogram(colour="black", fill="grey", aes(y=..density..), binwidth=2) +
xlab("IQ") + ylab("number of people") + ggtitle("Data") + theme_bw(base_size=20) +
theme(panel.grid.major.x = element_blank(), axis.text.y = element_blank(), panel.grid.minor.x = element_blank()) +
geom_vline(xintercept=100, colour="black", linetype="dashed", size=1) +
coord_cartesian(xlim=c(50,150)) + scale_x_continuous(breaks=c(50,60,70,80,90,100,110,120,130,140,150)) +
annotate("text", x = mean(x), y = 0.02, label = paste("Mean = ",round(mean(x)),"\n","SD = ",round(sd(x)),sep=""), size=6.5)
#dev.off()

#Simulate Confidence Intervals
CIU_sim<-numeric(nSims)
CIL_sim<-numeric(nSims)
mean_sim<-numeric(nSims)

for(i in 1:nSims){ #for each simulated experiment
x<-rnorm(n = n, mean = 100, sd = 15) #create sample from normal distribution
CIU_sim[i]<-mean(x)+qt(0.975, df = n-1)*sd(x)*sqrt(1/n)
CIL_sim[i]<-mean(x)-qt(0.975, df = n-1)*sd(x)*sqrt(1/n)
mean_sim[i]<-mean(x) #store means of each sample
}

#Save only those simulations where the true value was inside the 95% CI
CIU_sim<-CIU_sim[CIU_sim<100]
CIL_sim<-CIL_sim[CIL_sim>100]

# cat((100*(1-(length(CIU_sim)/nSims+length(CIL_sim)/nSims))),"% of the 95% confidence intervals contained the true mean")

#Calculate how many times the observed mean fell within the 95% CI of the original study
mean_sim<-mean_sim[mean_sim>CIL&mean_sim<CIU]
# cat("The capture percentage for the plotted study, or the % of values within the observed confidence interval from",CIL,"to",CIU,"is:",100*length(mean_sim)/nSims,"%")

conf <- (100*(1-(length(CIU_sim)/nSims+length(CIL_sim)/nSims)))
capt <- 100*length(mean_sim)/nSims

### collect the data in a dataframe

data <- rbind(data, c(conf, capt))
names(data) <- c("95% CI", "Capture %")

}

### check the result

head(data)

cap <- ifelse(data[,2]<94.9, 1, 0)

plot(data,pch=19)
mtext(paste0("95% confidence intervals have a ", sum(cap), "% capture percentage"))

Predictions intervals typically are for a single n...

2016-03-04T14:51:23.572+01:00

Predictions intervals typically are for a single new observation. Are much wider than confidence intervals.

How is this different from a prediction interval v...

2016-03-03T19:47:57.231+01:00

How is this different from a prediction interval versus a confidence interval (as is often discussed in regression)? Rob Hyndman has a post on the this (http://robjhyndman.com/hyndsight/intervals/)

Apologies - didn't mean to post previous anony...

2016-03-03T11:08:12.594+01:00

Apologies - didn't mean to post previous anonymously!

Nice post - although you might want to include the...

2016-03-03T11:06:50.197+01:00

Nice post - although you might want to include the 95% replication capture intervals (what you should do for this type of inference) as a comparator for the the 95% CI.

Hacked into your script below:

if(!require(ggplot2)){install.packages('ggplot2')}
library(ggplot2)

n=30 #set sample size
nSims<-1000 #set number of simulations

x<-rnorm(n = n, mean = 100, sd = 15) #create sample from normal distribution
samplemean <-mean(x)
#95%CI
CIU<-samplemean+qt(0.975, df = n-1)*sd(x)*sqrt(1/n)
CIL<-samplemean-qt(0.975, df = n-1)*sd(x)*sqrt(1/n)
RCIU<-samplemean+qt(0.975, df = n-1)*sd(x)*sqrt(2/n)
RCIL<-samplemean-qt(0.975, df = n-1)*sd(x)*sqrt(2/n)

#plot data
#png(file="CI_mean.png",width=2000,height=2000, res = 300)
ggplot(as.data.frame(x), aes(x)) +
geom_rect(aes(xmin=CIL, xmax=CIU, ymin=0, ymax=Inf), fill="#E69F00") +
geom_histogram(colour="black", fill="grey", aes(y=..density..), binwidth=2) +
xlab("IQ") + ylab("number of people") + ggtitle("Data") + theme_bw(base_size=20) +
theme(panel.grid.major.x = element_blank(), axis.text.y = element_blank(), panel.grid.minor.x = element_blank()) +
geom_vline(xintercept=100, colour="black", linetype="dashed", size=1) +
coord_cartesian(xlim=c(50,150)) + scale_x_continuous(breaks=c(50,60,70,80,90,100,110,120,130,140,150)) +
annotate("text", x = mean(x), y = 0.02, label = paste("Mean = ",round(mean(x)),"\n","SD = ",round(sd(x)),sep=""), size=6.5)
#dev.off()

#Simulate Confidence Intervals
CIU_sim<-numeric(nSims)
CIL_sim<-numeric(nSims)
RCIU_sim<-numeric(nSims)
RCIL_sim<-numeric(nSims)
mean_sim<-numeric(nSims)
capture = 0
Tcrit = qt(0.975, df = n-1)
for(i in 1:nSims){ #for each simulated experiment
x<-rnorm(n = n, mean = 100, sd = 15) #create sample from normal distribution
sim_mean = mean(x)
CIW = Tcrit*sd(x)*sqrt(1/n)
CIU_sim[i]<-sim_mean+CIW
CIL_sim[i]<-sim_mean-CIW
RCIU_sim[i]<-sim_mean+CIW*sqrt(2)
RCIL_sim[i]<-sim_mean-CIW*sqrt(2)
mean_sim[i]<-sim_mean #store means of each sample
for (j in 1:i){
if(mean_sim[i]<=RCIU_sim[j]&&mean_sim[i]>=RCIL_sim[j]){
capture=capture+1
}
if(mean_sim[j]<=RCIU_sim[i]&&mean_sim[j]>=RCIL_sim[i]){
capture=capture+1
}
}
}

#How many simulations does the true value lie outside the 95% CI
CIU_sim<-CIU_sim[CIU_sim<100]
CIL_sim<-CIL_sim[CIL_sim>100]

#How many simulations does our original observed value lie outside the 95% RCI
RCIU_sim<-RCIU_sim[RCIU_simsamplemean]

cat((100*(1-(length(CIU_sim)/nSims+length(CIL_sim)/nSims))),"% of the 95% confidence intervals contained the true mean")
cat((100*(1-(length(RCIU_sim)/nSims+length(RCIL_sim)/nSims))),"% of the 95% replication capture intervals contained the observed mean")

#Calculate how many times the simulated mean fell within the 95% CI of the original study
mean_sim1<-mean_sim[mean_sim>CIL&mean_simRCIL&mean_sim<RCIU]
cat("The RCI capture percentage for the plotted study, or the % of means from other simulations within the observed 95% replication capture interval from",RCIL,"to",RCIU,"is:",100*length(mean_sim2)/nSims,"%")

#What proportion ofmany times did one simulation capture another within RCI
cat(100*(capture-nSims)/(i*(j-1)),"% of pairwise replication captures were successful from simulated 95% RCIs")

Note how Nate Silver gets this wrong in regard to ...

2016-03-02T21:40:15.576+01:00

Note how Nate Silver gets this wrong in regard to polling, despite his linking to a correct definition. (Some commentators attempted explanations.)
http://errorstatistics.com/2016/02/12/rubbing-off-uncertainty-confidence-and-nate-silver/

Nice explanation of capture percentage, clearly di...

2016-03-02T09:45:31.372+01:00

Nice explanation of capture percentage, clearly differentiating it from coverage percentage. AND, thanks for the link to Magnusson's mesmerizing demo.