Comments on The 20% Statistician: So you banned p-values, how’s that working out for you?

p value should be there, just to validate the meth...

2019-05-04T05:28:02.383+02:00

p value should be there, just to validate the methodological correctness and assigning uniformity in research work or strengthening justifications to the findings only with respect to the individualistic terms of the work, but not to support the hypothesis as universal fact. Of course, we can encourage reporting Power and effect size, because there are many studies where Power is compromised. What I liked Trafimow’s article is that it vibrates the dishonest attempt of researchers to get their paper published in journals based on p value with unrealistic elements like exceptionally low n (as small as 3), skewed distributions, non-homogeneity etc. BASP might have fatigued with such type of papers. That is why they wrote "we encourage the use of larger sample sizes than is typical in much psy-chology research, because as the sample size increases,descriptive statistics become increasingly stable and sampling error is less of a problem" (from Trafimow & Marks, 2015, doi.10.1080/01973533.2015.1012991). Honest and judicious use of p or CI is always welcome.

Hi, the Type 1 error rate has increased because pe...

2017-10-20T17:51:40.715+02:00

Hi, the Type 1 error rate has increased because people stop controlling their error rates at 5% when reporting multiple tests. So it must logically be higher.

Hi Daniel, could you please tell me how you have c...

2017-10-20T17:48:23.793+02:00

Hi Daniel, could you please tell me how you have come to think that the type I error rate increased? You seem to believe (but correct me if I am wrong) that a p-value tells you whether a type I error has been made or not. But that is simply not true. If my decision criterion is: reject when p is between 1.00 and .95, for instance, the type I error rate is the same as when I reject when p < .05. In both cases it is .05. So, given the first criterion, reject when p = .99, provides a perfect control of type I errors (but of course not of type II errors). So, unless one magically determines which null-hypotheses are actually true, there is no way of determining whether or not a type I error has been made. A rejection of a true null is a type I error regardless of the value of p used to make the decision. (The idea that p-values tell you something about the probability of a type I error is called the local type I error fallacy).

The fact that researchers are struggling isn't...

2016-06-02T06:31:53.203+02:00

The fact that researchers are struggling isn't the grounds on which to bash the editors, but the very real fact that they truly misunderstand the statistics of significance tests is. I came to learn that through Trafimow's papers.

ha, thanks! I'm sure someone will remove it ve...

2016-02-12T17:16:04.861+01:00

ha, thanks! I'm sure someone will remove it very soon ;)

Hi Chris, I think the editors are to blame for not...

2016-02-12T17:15:28.099+01:00

Hi Chris, I think the editors are to blame for not taking the responsibility to check the articles they publish better than they have. Also, the surprisingly large number of citations to articles that are not good, and suggest NHST is crap, annoy me: http://daniellakens.blogspot.nl/2015/11/the-relation-between-p-values-and.html Obviously the authors and reviewers can and should improve, but I'm criticizing the editorial strategy here.

It seems that there is a lot of editor bashing her...

2016-02-12T16:56:12.034+01:00

It seems that there is a lot of editor bashing here. I think that is inappropriate. The important lesson to be learned from this ban is that, without NHST, most researchers--and readers of empirical research--are incapable of evaluating empirical data. The fact that researchers are struggling should not be used to mock Trafimow and Marks. If anything, their ban on NHST has helped make salient just how much of our critical thinking we have outsourced to misunderstood statistical procedures.

You are right - I don't think they banned them...

2016-02-11T23:06:39.327+01:00

You are right - I don't think they banned them, but they are missing from many papers (maybe the majority). The editors/reviewers should have asked for them, but I don't really think they are intentionally banned. I was slightly exaggerating there. ;)

Interesting post! You write: "They also banne...

2016-02-11T22:50:31.217+01:00

Interesting post! You write: "They also banned reporting sample sizes for between subject conditions" but I don't remember seeing a ban for this anywhere, and checked with the editor & he says they never banned reporting sample sizes for between subject conditions -- only p-values and traditional confidence intervals. Did I miss something? Thanks for all your work -- cheers

I agree. (With the second point, I have no opinion...

2016-02-11T15:06:33.972+01:00

I agree. (With the second point, I have no opinions on Shakespearian English).

If you read the editorial, you'll see that the...

2016-02-11T15:05:39.740+01:00

If you read the editorial, you'll see that they are not exactly encouraging authors to do Bayes Factors either. I personally really like Bayes Factors for hypothesis testing, but I would perhaps not risk submitting them to BASP after reading that editorial.

https://en.wikipedia.org/wiki/Basic_and_Applied_So...

2016-02-11T13:22:48.862+01:00

https://en.wikipedia.org/wiki/Basic_and_Applied_Social_Psychology

While I think it's not strictly incorrect, per...

2016-02-11T12:52:47.953+01:00

While I think it's not strictly incorrect, personally I feel it ought to be 'thou shalt' not 'thou shall' ;)

I agree banning p-values without giving any alternative for hypothesis-testing was a silly move.

It's almost as if the problem is with the ince...

2016-02-11T12:48:26.344+01:00

It's almost as if the problem is with the incentives of the publishing system, rather than with the specific ways in which those problems manifest themselves.

I suspect that if p-values were declared illegal worldwide tomorrow, we would quickly see a consensus around d=.02 or r=.10 or pesq2=.02 or even BF=6 as the new shorthand for "Look what a clever scientist I am, can I have some more money now please?".

On the other hand, change takes time. Many of these articles will have been in the pipeline when the journal announced its new policies. Every journey starts with a small step, etc. The problem is to determine when to examine one's progress on that journey and decide whether to carry on, or go home and have a cup of tea on your familiar comfy sofa.

2016-02-11T12:48:08.146+01:00

This comment has been removed by the author.

It's surprising that even now with JASP being ...

2016-02-11T11:08:36.222+01:00

It's surprising that even now with JASP being so easy to use that someone didn't report a BF. That's even less improvement than my low expectations expected.

I did not encounter any paper using Bayesian stati...

2016-02-11T08:58:26.252+01:00

I did not encounter any paper using Bayesian statistics in 2015. Note that it would have made sense in the 8 study paper I mention in the blog post where they don't find support for their hypotheses, but even there, nothing.

So how many papers in BASP did Bayesian statistics...

2016-02-11T08:41:01.446+01:00

So how many papers in BASP did Bayesian statistics one year prior to the p-value ban versus one year after? Is there a qualitative difference?