查看文章

mlr.press 中的 [PDF]

On the effectiveness of the skew divergence for statistical language analysis.

作者

Lillian Lee

发表日期

2001

研讨会论文

Proceedings of AISTATS

页码范围

65--72

简介

Estimating word co-occurrence probabilities is a problem underlying many applications in statistical natural language processing. Distance-weighted (or similarityweighted) averaging has been shown to be a promising approach to the analysis of novel co-occurrences. Many measures of distributional similarity have been proposed for use in the distance-weighted averaging framework; here, we empirically study their stability properties, finding that similarity-based estimation appears to make more efficient use of more reliable portions of the training data. We also investigate properties of the skew divergence, a weighted version of the KullbackLeibler (KL) divergence; our results indicate that the skew divergence yields better results than the KL divergence even when the KL divergence is applied to more sophisticated probability estimates.

引用总数

被引用次数：238

200220032004200520062007200820092010201120122013201420152016201720182019202020212022202320246 11 14 13 17 13 10 10 16 12 12 13 7 16 10 14 8 3 4 9 6 6 4

学术搜索中的文章

On the effectiveness of the skew divergence for statistical language analysis

L Lee - International Workshop on Artificial Intelligence and …, 2001

被引用次数：238 相关文章所有 11 个版本