作者
Shiven Sharma, Colin Bellinger, Bartosz Krawczyk, Osmar Zaiane, Nathalie Japkowicz
发表日期
2018/11/17
研讨会论文
2018 IEEE international conference on data mining (ICDM)
页码范围
447-456
出版商
IEEE
简介
The class imbalance problem is a pervasive issue in many real-world domains. Oversampling methods that inflate the rare class by generating synthetic data are amongst the most popular techniques for resolving class imbalance. However, they concentrate on the characteristics of the minority class and use them to guide the oversampling process. By completely overlooking the majority class, they lose a global view on the classification problem and, while alleviating the class imbalance, may negatively impact learnability by generating borderline or overlapping instances. This becomes even more critical when facing extreme class imbalance, where the minority class is strongly underrepresented and on its own does not contain enough information to conduct the oversampling process. We propose a novel method for synthetic oversampling that uses the rich information inherent in the majority class to synthesize …
引用总数
20192020202120222023202482221232111
学术搜索中的文章
S Sharma, C Bellinger, B Krawczyk, O Zaiane… - 2018 IEEE international conference on data mining …, 2018