作者
Jennifer Neville, David Jensen, Brian Gallagher
发表日期
2003/11/19
研讨会论文
Third IEEE International Conference on Data Mining
页码范围
609-612
出版商
IEEE
简介
We present the relational Bayesian classifier (RBC), a modification of the simple Bayesian classifier (SBC) for relational data. There exist several Bayesian classifiers that learn predictive models of relational data, but each uses a different estimation technique for modelling heterogeneous sets of attribute values. The effects of data characteristics on estimation have not been explored. We consider four simple estimation techniques and evaluate them on three real-world data sets. The estimator that assumes each multiset value is independently drawn from the same distribution (INDEPVAL) achieves the best empirical results. We examine bias and variance tradeoffs over a range of data sets and show that INDEPVAL's ability to model more multiset information results in lower bias estimates and contributes to its superior performance.
引用总数
20032004200520062007200820092010201120122013201420152016201720182019202020215211101013101117151088543134
学术搜索中的文章
J Neville, D Jensen, B Gallagher - Third IEEE International Conference on Data Mining, 2003