2017
DOI: 10.1038/s41562-017-0189-z
|View full text |Cite
|
Sign up to set email alerts
|

Redefine statistical significance

Abstract: We propose to change the default P-value threshold for statistical significance from 0.05 to 0.005 for claims of new discoveries. T he lack of reproducibility of scientific studies has caused growing concern over the credibility of claims of new discoveries based on 'statistically significant' findings. There has been much progress toward documenting and addressing several causes of this lack of reproducibility (for example, multiple testing, P-hacking, publication bias and under-powered studies). However, we … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

31
1,590
5
24

Year Published

2017
2017
2018
2018

Publication Types

Select...
7
2

Relationship

2
7

Authors

Journals

citations
Cited by 2,099 publications
(1,650 citation statements)
references
References 15 publications
(11 reference statements)
31
1,590
5
24
Order By: Relevance
“…As seen in suggested that the p-value threshold for statistically significant findings should be lowered from 0.05 to 0.005 for new discoveries 30 . In a replication context it would be relevant to apply this stricter threshold to meta-analytic results.…”
Section: -25mentioning
confidence: 99%
“…As seen in suggested that the p-value threshold for statistically significant findings should be lowered from 0.05 to 0.005 for new discoveries 30 . In a replication context it would be relevant to apply this stricter threshold to meta-analytic results.…”
Section: -25mentioning
confidence: 99%
“…[8][9][10][11][12] If journals start requiring a lower threshold for significance, the power of all experiments will be further reduced, exacerbating the above problems. By way of example, suppose we are conducting a two group experiment with independent samples in each group.…”
Section: Introductionmentioning
confidence: 99%
“…From the perspective of frequentists, Pvalues only provide information pertaining to whether a null hypothesis about the extremity of an observed distribution can be rejected; they do not say anything about whether and how strongly evidence found from a specific study supports a hypothesis . Furthermore, as the current debates indicated, conventional P-value thresholds widely used in the field, particularly, p < .05, could only support very week or even could not support RUNNING HEAD: UTILIZING BAYESIAN STATISTICS 23 the presence of positive evidence (Benjamin et al, 2018). Instead, BFs show us the strength of evidence; directly BF thresholds used in the field can also be considered as better thresholds to make practical decisions about accepting a specific hypothesis based on evidence (Kass & Raftery, 1995).…”
Section: Running Head: Utilizing Bayesian Statistics 18mentioning
confidence: 99%
“…Although recent debates about the frequentist perspective in the field of quantitative methods have intensified concerns regarding how to collect and test data properly (Benjamin et al, 2018), the majority of studies in the fields related to moral education have tend to use such a perspective. We have been used to employing the methodology of frequentist, such as P-values, in empirical studies of moral education.…”
Section: Introductionmentioning
confidence: 99%