Methodological considerations for predicting at-risk students

C Koutcheme, S Sarsa, A Hellas, L Haaranen… - Proceedings of the 24th …, 2022 - dl.acm.org
Proceedings of the 24th Australasian Computing Education Conference, 2022dl.acm.org
Educational researchers have long sought to increase student retention. One stream of
research focusing on this seeks to automatically identify students who are at risk of dropping
out. Studies tend to agree that earlier identification of at-risk students is better, providing
more room for targeted interventions. We looked at the interplay of data and predictive
power of machine learning models used to identify at-risk students. We critically examine the
often used approach where data collected from weeks 1, 2,..., n is used to predict whether a …
Educational researchers have long sought to increase student retention. One stream of research focusing on this seeks to automatically identify students who are at risk of dropping out. Studies tend to agree that earlier identification of at-risk students is better, providing more room for targeted interventions. We looked at the interplay of data and predictive power of machine learning models used to identify at-risk students. We critically examine the often used approach where data collected from weeks 1, 2,..., n is used to predict whether a student becomes inactive in the subsequent weeks w, w ≥ n + 1, pointing out issues with this approach that may inflate models’ predictive power. Specifically, our empirical analysis highlights that including students who have become inactive on week n or before, where n > 1, to the data used to identify students who are inactive on the following weeks is a significant cause of bias. Including students who dropped out during the first week makes the problem significantly easier, since they have no data in the subsequent weeks. Based on our results, we recommend including only active students until week n when building and evaluating models for predicting dropouts in subsequent weeks and evaluating and reporting the particularities of the respective course contexts.
ACM Digital Library
以上显示的是最相近的搜索结果。 查看全部搜索结果