作者
Manpreet Singh, Parminder Kaur Wadhwa, PW Sandhu
发表日期
2007/4/30
期刊
International Journal of Computer Science and Network Security
卷号
7
期号
4
页码范围
92-98
简介
To overcome the problem of exponentially increasing protein data, drug discoverers need efficient machine learning techniques to predict the functions of proteins which are responsible for various diseases in human body. The existing decision tree induction methodology C4. 5 uses the entropy calculation for best attribute selection. The proposed method develops a new decision tree induction technique in which uncertainty measure is used for best attribute selection. This is based on the study of priority based packages of SDFs (Sequence Derived Features). The present research work results the creation of better decision tree in terms of depth than the existing C4. 5 technique. The tree with greater depth ensures more number of tests before functional class assignment and thus results in more accurate predictions than the existing prediction technique. For the same test data, the percentage accuracy of the new HPF (Human Protein Function) predictor is 72% and that of the existing prediction technique is 44%.
引用总数
20082009201020112012201320142015201620172018201920202021212552121221
学术搜索中的文章
M Singh, PK Wadhwa, PW Sandhu - International Journal of Computer Science and …, 2007