作者
Majid Zaman, Sameer Kaul, Muheet Ahmed
发表日期
2020
期刊
International Journal of Advanced Computer Science and Applications
卷号
11
期号
5
出版商
Science and Information (SAI) Organization Limited
简介
The historical geographical data of Kashmir province is spread across two disparate files having attributes of Maximum Temperature, Minimum Temperature, Humidity measured at 12 AM, Humidity measured at 3 PM, rainfall besides auxiliary parameters like date, year etc. The parameters Maximum Temperature, Minimum Temperature, Humidity measured at 12 AM, Humidity measured at 3 PM are continuous in nature and here, in this study, we applied Information Gain and Gini Index on these attributes to convert continuous data into discrete values, their after we compare and evaluate the generated results. Of the four attributes, two have same results for Information Gain and Gini Index; one attribute has overlapping results while as only one attribute has conflicting results for Information Gain and Gini Index. Subsequently, continuous valued attributes are converted into discrete values using Gini index. Irrelevant attributes are not considered and auxiliary attributes are labeled accordingly. Consequently, the data set is ready for the application of machine learning (decision tree) algorithms.
引用总数
2020202120222023202414173
学术搜索中的文章