Projects per year
Abstract
Network and cyberspace security remains challenging in our increasingly connected worlds, and risks include those resulting from malicious cyber activities such as phishing. To combat phishing, this article proposes a phishing website detection model based on tiny-Bert stacking. The core concept is to use tiny-Bert to extract features from website URL strings, and learn the semantic features and long-range dependent features in URLs. Then, we build a Stacking algorithm-based classifier which includes four basic learners among which, CatBoost, XGBoost, and LightGBM are the first-level learners, and GBDT is the second-level learner. This detection model can identify phishing websites without manual feature extraction, and the basic learners of Stacking can compensate each other for errors in the classification process, improve generalization, and achieve higher accuracy. The proposed model is evaluated using a data set based on real phishing websites. Compared to the state-of-the-art, the results show that the proposed model has an accuracy rate of up to 99.14%, a recall rate of up to 99.13%, and is more stable. © 2023 IEEE.
Original language | English |
---|---|
Pages (from-to) | 2236-2243 |
Journal | IEEE Internet of Things Journal |
Volume | 11 |
Issue number | 2 |
Online published | 4 Jul 2023 |
DOIs | |
Publication status | Published - 15 Jan 2024 |
Research Keywords
- Blocklists
- Feature extraction
- Phishing
- Phishing Detection
- Stacking
- tiny-Bert
- Uniform resource locators
- URL Detection
- Visualization
- Web pages
Fingerprint
Dive into the research topics of 'A Method for Detecting Phishing Websites Based on Tiny-Bert Stacking'. Together they form a unique fingerprint.Projects
- 1 Active
-
GRF: Massive Access over an OFDM Platform
CHAN, C. H. S. (Principal Investigator / Project Coordinator) & LI, P. (Co-Investigator)
1/01/22 → …
Project: Research