Accurate tests of statistical significance for rWG and average deviation interrater agreement indexes.

WP Dunlap, MJ Burke… - Journal of Applied …, 2003 - psycnet.apa.org
Journal of Applied Psychology, 2003psycnet.apa.org
The authors demonstrated that the most common statistical significance test used with r WG-
type interrater agreement indexes in applied psychology, based on the chi-square
distribution, is flawed and inaccurate. The chi-square test is shown to be extremely
conservative even for modest, standard significance levels (eg,. 05). The authors present an
alternative statistical significance test, based on Monte Carlo procedures, that produces the
equivalent of an approximate randomization test for the null hypothesis that the actual …
Abstract
The authors demonstrated that the most common statistical significance test used with r WG-type interrater agreement indexes in applied psychology, based on the chi-square distribution, is flawed and inaccurate. The chi-square test is shown to be extremely conservative even for modest, standard significance levels (eg,. 05). The authors present an alternative statistical significance test, based on Monte Carlo procedures, that produces the equivalent of an approximate randomization test for the null hypothesis that the actual distribution of responding is rectangular and demonstrate its superiority to the chi-square test. Finally, the authors provide tables of critical values and offer downloadable software to implement the approximate randomization test for r WG type and for average deviation (AD)-type interrater agreement indexes. The implications of these results for studying a broad range of interrater agreement problems in applied psychology are discussed.(PsycINFO Database Record (c) 2016 APA, all rights reserved)
American Psychological Association
以上显示的是最相近的搜索结果。 查看全部搜索结果