[图书][B] Developing and validating test items

TM Haladyna - 2013 - books.google.com
Since test items are the building blocks of any test, learning how to develop and validate test
items has always been critical to the teaching-learning process. As they grow in importance …

The influence of training and experience on rater performance in scoring spoken language

L Davis - Language Testing, 2016 - journals.sagepub.com
Two factors were investigated that are thought to contribute to consistency in rater scoring
judgments: rater training and experience in scoring. Also considered were the relative …

Listeners and raters: Similarities and differences in evaluation of accented speech

X Yan, A Ginther - Assessment in second language pronunciation, 2017 - taylorfrancis.com
This chapter reviews research findings on listener background characteristics that influence
evaluations of L2 accented speech, and discusses how these findings may affect both …

[图书][B] Assessing English for professional purposes

U Knoch, S Macqueen - 2019 - books.google.com
** WINNER OF ILTA/SAGE Best Book Award 2020** Assessing English for Professional
Purposes provides a state-of-the-art account of the various kinds of language assessments …

Task and rater effects in L2 speaking and writing: A synthesis of generalizability studies

Y In'nami, R Koizumi - Language testing, 2016 - journals.sagepub.com
We addressed Deville and Chalhoub-Deville's (2006), Schoonen's (2012), and Xi and
Mollaun's (2006) call for research into the contextual features that are considered related to …

Building a better rubric: Mixed methods rubric revision

G Janssen, V Meier, J Trace - Assessing writing, 2015 - Elsevier
Because rubrics are the foundation of a rater's scoring process, principled rubric use
requires systematic review as rubrics are adopted and adapted (Crusan, 2010, p. 72) into …

A comparison of newly-trained and experienced raters on a standardized writing assessment

Y Attali - Language Testing, 2016 - journals.sagepub.com
A short training program for evaluating responses to an essay writing task consisted of
scoring 20 training essays with immediate feedback about the correct score. The same …

Adapting CEF-descriptors for rating purposes: Validation by a combined rater training and scale revision approach

C Harsch, G Martin - Assessing Writing, 2012 - Elsevier
We explore how a local rating scale can be based on the Common European Framework
CEF-proficiency scales. As part of the scale validation (Alderson, 1991; Lumley, 2002), we …

Severity differences among self-assessors, peer-assessors, and teacher assessors rating EFL essays

R Esfandiari, CM Myford - Assessing writing, 2013 - Elsevier
We compared three assessor types (self-assessors, peer-assessors, and teacher assessors)
to determine whether they differed in the levels of severity they exercised when rating …

Investigating rater severity/leniency in interpreter performance testing: A multifaceted Rasch measurement approach

C Han - Interpreting, 2015 - jbe-platform.com
Rater-mediated performance assessment (RMPA) is a critical component of interpreter
certification testing systems worldwide. Given the acknowledged rater variability in RMPA …