Evaluation toolkit for robustness testing of automatic essay scoring systems

A Kabra, M Bhatia, YK Singla, J Jessy Li… - Proceedings of the 5th …, 2022 - dl.acm.org
Automatic scoring engines have been used for scoring approximately fifteen million test-
takers in just the last three years. This number is increasing further due to COVID-19 and the …

[PDF][PDF] Calling Out Bluff: Evaluation Toolkit For Robustness Testing Of Automatic Essay Scoring Systems

A KABRA, M BHATIA, Y KUMAR, JJ LI, D Jin… - arXiv preprint arXiv …, 2020 - academia.edu
Automatic scoring engines have been used for scoring approximately fifteen million test
takers in just the last three years. This number is increasing further due to COVID-19 and the …

Automatic essay scoring systems are both overstable and oversensitive: explaining why and proposing defenses

Y Kumar, S Parekh, S Singh, JJ Li, RR Shah… - Dialogue & …, 2023 - journals.uic.edu
Abstract Deep-learning based Automatic Essay Scoring (AES) systems are being actively
used in various high-stake applications in education and testing. However, little research …

[PDF][PDF] Trustworthy Automated Essay Scoring without Explicit Construct Validity.

P West-Smith, S Butler, E Mayfield - AAAI Spring Symposia, 2018 - help.turnitin.com
Automated essay scoring (AES) is a broadly used application of machine learning, with a
long history of realworld use that impacts high-stakes decision-making for students …

[PDF][PDF] Calling out bluff: Attacking the robustness of automatic scoring systems with simple adversarial testing

Y Kumar, M Bhatia, A Kabra, JJ Li, D Jin… - arXiv preprint arXiv …, 2020 - ask.qcloudimg.com
A significant progress has been made in deep-learning based Automatic Essay Scoring
(AES) systems in the past two decades. The performance commonly measured by the …

AES systems are both overstable and oversensitive: Explaining why and proposing defenses

YK Singla, S Parekh, S Singh, JJ Li, RR Shah… - arXiv preprint arXiv …, 2021 - arxiv.org
Deep-learning based Automatic Essay Scoring (AES) systems are being actively used by
states and language testing agencies alike to evaluate millions of candidates for life …

My teacher thinks the world is flat! interpreting automatic essay scoring mechanism

S Parekh, YK Singla, C Chen, JJ Li… - arXiv preprint arXiv …, 2020 - arxiv.org
Significant progress has been made in deep-learning based Automatic Essay Scoring (AES)
systems in the past two decades. However, little research has been put to understand and …

A study on performance sensitivity to data sparsity for automated essay scoring

Y Ran, B He, J Xu - … Conference, KSEM 2018, Changchun, China, August …, 2018 - Springer
Automated essay scoring (AES) attempts to rate essays automatically using machine
learning and natural language processing techniques, hoping to dramatically reduce the …

Unveiling the Tapestry of Automated Essay Scoring: A Comprehensive Investigation of Accuracy, Fairness, and Generalizability

K Yang, M Raković, Y Li, Q Guan, D Gašević… - Proceedings of the …, 2024 - ojs.aaai.org
Automatic Essay Scoring (AES) is a well-established educational pursuit that employs
machine learning to evaluate student-authored essays. While much effort has been made in …

Learning automated essay scoring models using item-response-theory-based scores to decrease effects of rater biases

M Uto, M Okano - IEEE Transactions on Learning …, 2021 - ieeexplore.ieee.org
In automated essay scoring (AES), scores are automatically assigned to essays as an
alternative to grading by humans. Traditional AES typically relies on handcrafted features …