作者
Sergio Ramírez‐Gallego, Salvador García, Héctor Mouriño‐Talín, David Martínez‐Rego, Verónica Bolón‐Canedo, Amparo Alonso‐Betanzos, José Manuel Benítez, Francisco Herrera
发表日期
2016/1
来源
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
卷号
6
期号
1
页码范围
5-21
出版商
Wiley Periodicals, Inc
简介
Discretization of numerical data is one of the most influential data preprocessing tasks in knowledge discovery and data mining. The purpose of attribute discretization is to find concise data representations as categories which are adequate for the learning task retaining as much information in the original continuous attribute as possible. In this article, we present an updated overview of discretization techniques in conjunction with a complete taxonomy of the leading discretizers. Despite the great impact of discretization as data preprocessing technique, few elementary approaches have been developed in the literature for Big Data. The purpose of this article is twofold: a comprehensive taxonomy of discretization techniques to help the practitioners in the use of the algorithms is presented; the article aims is to demonstrate that standard discretization methods can be parallelized in Big Data platforms such as Apache …
引用总数
20162017201820192020202120222023202482123292719232314
学术搜索中的文章
S Ramírez‐Gallego, S García, H Mouriño‐Talín… - Wiley Interdisciplinary Reviews: Data Mining and …, 2016