查看文章

Manipulating the alpha level cannot cure significance testing–comments on" Redefine statistical significance"

作者

David Trafimow, Valentin Amrhein, Corson N Areshenkoff, Carlos Barrera-Causil, Eric J Beh, Yusuf Bilgiç, Roser Bono, Michael T Bradley, William M Briggs, Héctor A Cepeda-Freyre, Sergio E Chaigneau, Daniel R Ciocca, Juan Carlos Correa, Denis Cousineau, Michiel R de Boer, Subhra Sankar Dhar, Igor Dolgov, Juana Gómez-Benito, Marian Grendar, James Grice, Martin E Guerrero-Gimenez, Andrés Gutiérrez, Tania B Huedo-Medina, Klaus Jaffe, Armina Janyan, Ali Karimnezhad, Fränzi Korner-Nievergelt, Koji Kosugi, Martin Lachmair, Rubén Ledesma, Roberto Limongi, Marco Tullio Liuzza, Rosaria Lombardo, Michael Marks, Gunther Meinlschmidt, Ladislas Nalborczyk, Hung T Nguyen, Raydonal Ospina, Jose D Perezgonzalez, Roland Pfister, Juan José Rahona, David A Rodríguez-Medina, Xavier Romão, Susana Ruiz-Fernández, Isabel Suarez, Marion Tegethoff, Mauricio Tejo, Rens van de Schoot, Ivan Vankov, Santiago Velasco-Forero, Tonghui Wang, Yuki Yamada, Felipe C Zoppino, Fernando Marmolejo-Ramos

发表日期

2017/11/14

期刊

PeerJ Preprints

卷号

页码范围

e3411v1

出版商

PeerJ Inc.

简介

We argue that depending on p-values to reject null hypotheses, including a recent call for changing the canonical alpha level for statistical significance from .05 to .005, is deleterious for the finding of new discoveries and the progress of science. Given that blanket and variable criterion levels both are problematic, it is sensible to dispense with significance testing altogether. There are alternatives that address study design and determining sample sizes much more directly than significance testing does; but none of the statistical tools should replace significance testing as the new magic method giving clear-cut mechanical answers. Inference should not be based on single studies at all, but on cumulative evidence from multiple independent studies. When evaluating the strength of the evidence, we should consider, for example, auxiliary assumptions, the strength of the experimental design, or implications for applications. To boil all this down to a binary decision based on a p-value threshold of .05, .01, .005, or anything else, is not acceptable.

引用总数

被引用次数：10

2017201820193 4 3

学术搜索中的文章

Manipulating the alpha level cannot cure significance testing–comments on" Redefine statistical significance"

D Trafimow, V Amrhein, CN Areshenkoff… - PeerJ Preprints, 2017

被引用次数：10 相关文章所有 11 个版本