Comments on The 20% Statistician: Why you don't need to adjust your alpha level for all tests you'll do in your lifetime.

I did not. Please provide the quote you are referr...

2018-02-10T09:39:33.577+01:00

I did not. Please provide the quote you are referring to. Then we can discuss what I meant.

In your online class you just said the opposite of...

2018-02-10T04:39:57.718+01:00

In your online class you just said the opposite of this post

I can understand Fisher's dismay, but it remai...

2016-11-13T13:28:26.357+01:00

I can understand Fisher's dismay, but it remains true :)

Nice post! We should definitely pay more attention...

2016-11-13T11:55:12.671+01:00

Nice post! We should definitely pay more attention to the logical structure of the inferences we make, in particular whether multiple pieces of evidence are combined in a disjunctive (OR operator) or conjunctive (AND operator) manner. I also think it is sometimes sensible to do neither and simply "average" multiple pieces of evidence (without correction) when we interpret our results.

On a different topic, Fisher would have probably hated to read this sentence at the end: "There is only one reason to calculate p-values, and that is to control Type 1 error rates using a Neyman-Pearson approach"

See (Gigerenzer, 2004) http://library.mpib-berlin.mpg.de/ft/gg/GG_Mindless_2004.pdf

Here's another puzzler for folks interested in...

2016-10-27T19:21:50.377+02:00

Here's another puzzler for folks interested in the issue of error control over families of tests: Should researchers be correcting for multiple tests, even when they themselves did not run the tests, but all of the tests were run on the same data? Link is HERE.

If you have some data, you can better use a meta-a...

2016-05-07T06:57:54.087+02:00

If you have some data, you can better use a meta-analysis. A chi-square would be a dichotomous test (sig yes or on), meta-analysis is continuous. Alternatively, you might be interested in literature on controlling the false discovery rate (instead of the Type 1 error rate) - see Benjamini & Hochberg, 1995.

HI Daniel, Thinking about multiple tests: What a...

2016-05-07T00:53:37.899+02:00

HI Daniel,

Thinking about multiple tests: What about calculating the number of significant findings that you'd expect to observe due to chance (given number of tests), and then running a chi-squared test to determine whether the number of significant results you obtained are themselves, significantly different from what you'd expect due to chance?

Intuitively i feel like this makes sense...what do you think?

You cannot combine p-values (a Bayesian t-test doe...

2016-02-20T11:05:52.271+01:00

You cannot combine p-values (a Bayesian t-test does not give a p-value). You could do both tests, interpret the p-value in terms of a NP approach (in the long run, I would rarely be wrong if I act as if there is an effect) and then interpret the evidence at hand (and the current data provide strong/weak evidence for the alternative hypothesis).

No, it would not. You perform separate tests for e...

2016-02-20T11:04:20.655+01:00

No, it would not. You perform separate tests for each individual study. If you want to evaluate all the studies, you need to do a meta-analysis. This has a new theoretical prediction (is there an effect, if I combine all these studies). If this was really one big investigation, it would not make sense to publish these papers separately, right? And if it makes sense to publish them separately, then you don't need to control the error rate across all studies.

if i got it correct (maybe i didn't) when you ...

2016-02-15T21:45:55.141+01:00

if i got it correct (maybe i didn't) when you say 'Combining both approaches is probably a win-win, where long run error rates are controlled, after which the evidential value in individual studies in interpreted (and, because why not, parameters are estimated).', does it mean one could perform, say, a bayesian t.test and a welch t.test on a pairwise comparison and report both bayesian and frequentist p.values to come up with a decision?... even, would it be ok to combine those p.values?

Hi Etienne, I'm thinking of registered reports...

2016-02-14T20:18:35.343+01:00

Hi Etienne, I'm thinking of registered reports. There, you could pre-register a set of 2 studies, and they will be publish regardless. Let's say the second p-value is 0.8. If you indeed had high power for a minimum effect (e.g., 95%) you could decide that the effect is small, or null. That should be good to know, right?

>>>>For example, it is perfectly fine ...

2016-02-14T20:14:47.026+01:00

>>>>For example, it is perfectly fine to pre-register a set of two experiments, the second a close replication of the first, where you will choose to reject the null-hypothesis if the p-value is smaller than 0.2236 in both experiments. The probability that you will reject the null hypothesis twice in a row if the null hypothesis is true is α * α, or 0.2236 * 0.2236 = 0.05.

Interesting logic. In practice, however, what would happen if after your first experiment, the *one* and only target statistical test yields p < .30? Do you still run the second experiment?

I guess you have to given you publicly pre-registered the study. But if the first experiment was highly-powered (e.g., 95%) to detect a plausible effect size (e.g., d=.20), doesn't it seem odd to still run the second experiment?