作者
Rizka Purwanto, Arindam Pal, Alan Blair, Sanjay Jha
发表日期
2020/6/29
研讨会论文
2020 IEEE Conference on Communications and Network Security (CNS)
页码范围
1-9
出版商
IEEE
简介
Phishing has grown significantly in the past few years and is predicted to further increase in the future. The dynamics of phishing introduce challenges in implementing a robust phishing detection system and selecting features which can represent phishing despite the change of attack. In this study, we propose PhishZip which is a novel phishing detection approach using a compression algorithm to perform website classification and demonstrate a systematic way to construct the word dictionaries for the compression models using word occurrence likelihood analysis. PhishZip outperforms the use of best-performing HTML-based features in past studies, with a true positive rate of 80.04%. We also propose the use of compression ratio as a novel machine learning feature which significantly improves machine learning based phishing detection over previous studies. Using compression ratios as additional features, the …
引用总数
学术搜索中的文章
R Purwanto, A Pal, A Blair, S Jha - 2020 IEEE Conference on Communications and …, 2020