作者
Nada Lavrač, Branko Kavšek, Peter Flach, Ljupčo Todorovski
发表日期
2004
期刊
Journal of Machine Learning Research
卷号
5
期号
Feb
页码范围
153-188
简介
This paper investigates how to adapt standard classification rule learning approaches to subgroup discovery. The goal of subgroup discovery is to find rules describing subsets of the population that are sufficiently large and statistically unusual. The paper presents a subgroup discovery algorithm, CN2-SD, developed by modifying parts of the CN2 classification rule learner: its covering algorithm, search heuristic, probabilistic classification of instances, and evaluation measures. Experimental evaluation of CN2-SD on 23 UCI data sets shows substantial reduction of the number of induced rules, increased rule coverage and rule significance, as well as slight improvements in terms of the area under ROC curve, when compared with the CN2 algorithm. Application of CN2-SD to a large traffic accident data set confirms these findings.
引用总数
200220032004200520062007200820092010201120122013201420152016201720182019202020212022202320242192318212231263533304128374031251932252610
学术搜索中的文章
N Lavrac, B Kavsek, P Flach, L Todorovski - J. Mach. Learn. Res., 2004
N Lavrac, P Flach, B Kavsek, L Todorovski - Proceedings of the 2nd international workshop on …, 2002