Scientific apophenia in strategic management research: Significance tests &amp; mistaken inference

Goldfarb, Brent; King, Andrew A.

doi:10.1002/smj.2459

Cited by 119 publications

(109 citation statements)

References 19 publications

(39 reference statements)

Supporting

Mentioning

106

Contrasting

Unclassified

Order By: Relevance

“…They found that average effect sizes were considerably smaller than originally reported. Closer to home, Goldfarb and King (2016) assessed a sample of 300 published studies. They estimated that 24-40% of the studies could not be replicated.…”

Section: Research Rigor Revisitedmentioning

confidence: 99%

The Critique of Empirical Social Science: New Policies atManagement and Organization Review

et al. 2016

View full text Add to dashboard Cite

At the June 2016 meeting of the International Association for Chinese Management Research, MOR organized a symposium to discuss the mounting criticisms of empirical social science and subsequent changes, as part of ongoing discussions affecting journal reviewing policies. This article overviews the history of modern empirical social science as the foundation of management, organization, and strategy research and the criticism of social science research, which has reached the point that some critics refer to current publication norms as encouraging and enabling the publication of junk science. Most importantly, however, this article outlines MOR's strategy going forward and the new reviewing initiatives that MOR is implementing as of Volume 13 (2017).

show abstract

Section: Research Rigor Revisitedmentioning

confidence: 99%

The Critique of Empirical Social Science: New Policies atManagement and Organization Review

et al. 2016

View full text Add to dashboard Cite

show abstract

“…Despite its many flaws, null hypothesis significance testing (NHST) continues to be the choice of researchers in management and organization studies (Bettis, Ethiraj, Gambardella, Helfat, & Mitchell, 2016;Meyer et al, 2017). In NHST, the tenability of a null hypothesis (i.e., no effect or relation) is primarily judged based on the observed p value associated with the test of the hypothesis, and values smaller than 0.05 are often judged as providing sufficient evidence to reject it (Bettis et al, 2016;Goldfarb & King, 2016). Of the many problems associated with this interpretation of p values, the most pernicious is that it motivates researchers to engage in a practice called ''p-hacking'' and to report ''crippled'' p values (see below) (Aguinis, Werner, Abbott, Angert, Park, & Kohlhausen, 2010;Banks, Rogelberg et al, 2016).…”

Section: Reporting Of P Valuesmentioning

confidence: 99%

“…For example, consider a researcher who interprets p = 0.0499 as sufficient evidence for rejecting the null hypothesis, and p = 0.0510 as evidence that the null hypothesis should be retained, and believes that journals are more likely to look favorably on rejected null hypotheses. This researcher will be highly motivated to ''p-hack,'' that is, find some way, such as using control variables or eliminating outliers, to reduce the p value below the 0.05 threshold (Aguinis et al, 2010, Goldfarb & King, 2016Starbuck, 2016;Waldman & Lilienfeld, 2016). Similarly, this researcher will be motivated to report p values using cutoffs (e.g., p \ 0.05), rather that report the actual p value (0.0510).…”

Section: Reporting Of P Valuesmentioning

confidence: 99%

“…In classical hypothesis testing, results either are or are not significant; there is no such thing as ''marginally significant'' results. The examples regarding the use of control variables and outliers provided above, along with evidence from other fields, such as strategic management (Bettis et al, 2016;Goldfarb & King, 2016) and psychology (Bakker & Wicherts, 2011;Nuijten, Hartgerink, Assen, Epskamp, & Wicherts, 2015) suggest the existence of published articles in which researchers exercised their ''degrees of freedom'' to systematically manipulate the data to obtain a significant (i.e., p \ 0.05) result. Engaging in these practices increases systematic capitalization on chance and diminishes the probability that results will be reproducible and replicable.…”

Section: Reporting Of P Valuesmentioning

confidence: 99%

See 1 more Smart Citation

Science’s reproducibility and replicability crisis: International business is not immune

2017

View full text Add to dashboard Cite

International business is not immune to science's reproducibility and replicability crisis. We argue that this crisis is not entirely surprising given the methodological practices that enhance systematic capitalization on chance. This occurs when researchers search for a maximally predictive statistical model based on a particular dataset and engage in several trial-and-error steps that are rarely disclosed in published articles. We describe systematic capitalization on chance, distinguish it from unsystematic capitalization on chance, address five common practices that capitalize on chance, and offer actionable strategies to minimize the capitalization on chance and improve the reproducibility and replicability of future IB research.

show abstract

“…Based on 300 articles in prominent strategic management journals, Goldfarb and King (2015) estimated conservatively that about 25-40% of the published claims of statistical significance are actually false. Such audits strongly suggest that researchers or editors do not publish studies that report null-findings (Kepes et al, 2012).…”

Section: Three Important Types Of Little Liesmentioning

confidence: 99%

A Call for Openness in Research Reporting: How to Turn Covert Practices Into Helpful Tools

Schwab

Starbuck

2017

AMLE

View full text Add to dashboard Cite

Research articles often give inaccurate information about how researchers developed hypotheses, analyzed data, and drew conclusions. Published articles sometimes report only some of the hypotheses that researchers tested, or some of the statistical analyses that researchers made. Articles often imply that researchers formulated all hypotheses before they examined their data when in fact they added or deleted hypotheses after they made some data analyses.Indeed, such covert practices are so common that new entrants into management research may think they are correct behavior. Yet, these practices create false impressions about the validity of research and they undermine the openness that ought to create trust among researchers.Researchers have tried to halt these practices by labeling them "unethical" but their continued prevalence questions the effectiveness of wholly critical approaches. This article proposes a constructive path toward reform: advocating honesty about actual research processes by adding discussions of inferences drawn after data analyses. Post-hoc data analyses can stimulate important theoretical ideas; running alternative statistical models can deepen understanding of empirical patterns; lack of support for hypotheses can identify incorrect or incomplete theories. The management research culture should encourage these practices. The negative effects result from the lack of explicit reporting about them.

show abstract

Scientific apophenia in strategic management research: Significance tests & mistaken inference

Cited by 119 publications

References 19 publications

The Critique of Empirical Social Science: New Policies atManagement and Organization Review

The Critique of Empirical Social Science: New Policies atManagement and Organization Review

Science’s reproducibility and replicability crisis: International business is not immune

A Call for Openness in Research Reporting: How to Turn Covert Practices Into Helpful Tools

Contact Info

Product

Resources

About