作者
Indrė Žliobaitė, Albert Bifet, Jesse Read, Bernhard Pfahringer, Geoff Holmes
发表日期
2015/3
期刊
Machine Learning
卷号
98
页码范围
455-482
出版商
Springer US
简介
Predictive modeling on data streams plays an important role in modern data analysis, where data arrives continuously and needs to be mined in real time. In the stream setting the data distribution is often evolving over time, and models that update themselves during operation are becoming the state-of-the-art. This paper formalizes a learning and evaluation scheme of such predictive models. We theoretically analyze evaluation of classifiers on streaming data with temporal dependence. Our findings suggest that the commonly accepted data stream classification measures, such as classification accuracy and Kappa statistic, fail to diagnose cases of poor performance when temporal dependence is present, therefore they should not be used as sole performance indicators. Moreover, classification accuracy can be misleading if used as a proxy for evaluating change detectors with datasets that have temporal …
引用总数
201420152016201720182019202020212022202320241810152420241711118
学术搜索中的文章