Adversarial Sampling Attacks Against Phishing Detection

Hossein Shirazi; Bruhadeshwar Bezawada; Indrakshi Ray; Charles Anderson

doi:10.1007/978-3-030-22479-0_5

Conference Papers Year : 2019

Adversarial Sampling Attacks Against Phishing Detection

(1) , (2) , (1) , (1)

1
2

Hossein Shirazi

Function : Author
PersonId : 1059349

CSU - Colorado State University [Fort Collins]

Bruhadeshwar Bezawada

Function : Author
PersonId : 1059350

MEC - Mahindra Ecole Centrale [Hyderabad]

Indrakshi Ray

Function : Author
PersonId : 1026658

CSU - Colorado State University [Fort Collins]

Charles Anderson

Function : Author
PersonId : 1059351

CSU - Colorado State University [Fort Collins]

Abstract

Phishing websites trick users into believing that they are interacting with a legitimate website, and thereby, capture sensitive information, such as user names, passwords, credit card numbers and other personal information. Machine learning appears to be a promising technique for distinguishing between phishing websites and legitimate ones. However, machine learning approaches are susceptible to adversarial learning techniques, which attempt to degrade the accuracy of a trained classifier model. In this work, we investigate the robustness of machine learning based phishing detection in the face of adversarial learning techniques. We propose a simple but effective approach to simulate attacks by generating adversarial samples through direct feature manipulation. We assume that the attacker has limited knowledge of the features, the learning models, and the datasets used for training. We conducted experiments on four publicly available datasets on the Internet. Our experiments reveal that the phishing detection mechanisms are vulnerable to adversarial learning techniques. Specifically, the identification rate for phishing websites dropped to 70% by manipulating a single feature. When four features were manipulated, the identification rate dropped to zero percent. This result means that, any phishing sample, which would have been detected correctly by a classifier model, can bypass the classifier by changing at most four feature values; a simple effort for an attacker for such a big reward. We define the concept of vulnerability level for each dataset that measures the number of features that can be manipulated and the cost for each manipulation. Such a metric will allow us to compare between multiple defense models.

Keywords

Domains

Computer Science [cs]

Fichier principal

480962_1_En_5_Chapter.pdf (1.14 Mo)

Origin	Files produced by the author(s)
licence	CC BY 4.0 - Attribution

Connect in order to contact the contributor

https://inria.hal.science/hal-02384598

Submitted on : Thursday, November 28, 2019-2:25:53 PM

Last modification on : Tuesday, November 19, 2024-3:02:03 PM

Long-term archiving on : Saturday, February 29, 2020-4:06:21 PM

Dates and versions

hal-02384598 , version 1 (28-11-2019)

Licence

CC BY 4.0 - Attribution

Identifiers

HAL Id : hal-02384598 , version 1
DOI : 10.1007/978-3-030-22479-0_5

Cite

Hossein Shirazi, Bruhadeshwar Bezawada, Indrakshi Ray, Charles Anderson. Adversarial Sampling Attacks Against Phishing Detection. 33th IFIP Annual Conference on Data and Applications Security and Privacy (DBSec), Jul 2019, Charleston, SC, United States. pp.83-101, ⟨10.1007/978-3-030-22479-0_5⟩. ⟨hal-02384598⟩

Adversarial Sampling Attacks Against Phishing Detection

Abstract

Keywords

Domains

Dates and versions

Licence

Identifiers

Cite

Export

Collections

Altmetric

Share