作者
Anshu Zhang, Wenzhong Shi, Geoffrey I Webb
发表日期
2016/7
期刊
Data mining and knowledge discovery
卷号
30
页码范围
928-963
出版商
Springer US
简介
In association rule mining, the trade-off between avoiding harmful spurious rules and preserving authentic ones is an ever critical barrier to obtaining reliable and useful results. The statistically sound technique for evaluating statistical significance of association rules is superior in preventing spurious rules, yet can also cause severe loss of true rules in presence of data error. This study presents a new and improved method for statistical test on association rules with uncertain erroneous data. An original mathematical model was established to describe data error propagation through computational procedures of the statistical test. Based on the error model, a scheme combining analytic and simulative processes was designed to correct the statistical test for distortions caused by data error. Experiments on both synthetic and real-world data show that the method significantly recovers the loss in true rules …
引用总数
2016201720182019202020212022202334235232
学术搜索中的文章