[PDF][PDF] A review of different scaling approaches under full invariance, partial invariance, and noninvariance for cross-sectional country comparisons in large-scale …

A Robitzsch, O Lüdtke - Psychological Test and …, 2020 - psychologie-aktuell.com
One of the primary goals of international large-scale assessments (ILSAs) in education is the
comparison of country means in student achievement. The present article introduces a …

[HTML][HTML] Evaluating the psychometric properties of ChatGPT-generated questions

S Bhandari, Y Liu, Y Kwak, ZA Pardos - Computers and Education: Artificial …, 2024 - Elsevier
Not much is known about how LLM-generated questions compare to gold-standard,
traditional formative assessments concerning their difficulty and discrimination parameters …

Some thoughts on analytical choices in the scaling model for test scores in international large-scale assessment studies

A Robitzsch, O Lüdtke - Measurement Instruments for the Social Sciences, 2022 - Springer
International large-scale assessments (LSAs), such as the Programme for International
Student Assessment (PISA), provide essential information about the distribution of student …

Linking scores with patient-reported health outcome instruments: A validation study and comparison of three linking methods

BD Schalet, S Lim, D Cella, SW Choi - Psychometrika, 2021 - Springer
The psychometric process used to establish a relationship between the scores of two (or
more) instruments is generically referred to as linking. When two instruments with the same …

Robust and nonrobust linking of two groups for the Rasch model with balanced and unbalanced random DIF: A comparative simulation study and the simultaneous …

A Robitzsch - Symmetry, 2021 - mdpi.com
In this article, the Rasch model is used for assessing a mean difference between two groups
for a test of dichotomous items. It is assumed that random differential item functioning (DIF) …

On the choice of the item response model for scaling PISA data: Model selection based on information criteria and quantifying model uncertainty

A Robitzsch - Entropy, 2022 - mdpi.com
In educational large-scale assessment studies such as PISA, item response theory (IRT)
models are used to summarize students' performance on cognitive test items across …

Sample size requirements for applying diagnostic classification models

S Sen, AS Cohen - Frontiers in Psychology, 2021 - frontiersin.org
Results of a comprehensive simulation study are reported investigating the effects of sample
size, test length, number of attributes and base rate of mastery on item parameter recovery …

Lp Loss Functions in Invariance Alignment and Haberman Linking with Few or Many Groups

A Robitzsch - Stats, 2020 - mdpi.com
The comparison of group means in latent variable models plays a vital role in empirical
research in the social sciences. The present article discusses an extension of invariance …

On the treatment of missing item responses in educational large-scale assessment data: An illustrative simulation study and a case study using PISA 2018 …

A Robitzsch - European Journal of Investigation in Health …, 2021 - mdpi.com
Missing item responses are prevalent in educational large-scale assessment studies such
as the programme for international student assessment (PISA). The current operational …

Alternatives to weighted item fit statistics for establishing measurement invariance in many groups

S Joo, M Valdivia, DS Valdivia… - Journal of Educational …, 2024 - journals.sagepub.com
Evaluating scale comparability in international large-scale assessments depends on
measurement invariance (MI). The root mean square deviation (RMSD) is a standard …