Skip to main navigation Skip to search Skip to main content

An Empirical Study on Email Classification Using Supervised Machine Learning in Real Environments

  • Wenjuan Li*
  • , Weizhi Meng
  • *Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

Spam emails are considered as one of the biggest challenges for the Internet. Thus email classification, which aims to correctly classify legitimate and spam emails, becomes an important topic for both industry and academia. To achieve this goal, machine learning techniques, especially supervised machine learning algorithms, have been extensively applied to this field. In literature, several studies reveal that supervised machine learning (SML) suffers from some limitations such as performance fluctuation, hence many works start focusing on designing more complex algorithms. However, we identify that most existing research efforts are based on datasets, while more research should be conducted to investigate the performance of SML in real environments. In this paper, we thus perform an empirical study with three different environments and over 1,000 users regarding this issue. In the study, we find that SML classifiers like decision tree and SVMs are acceptable by users in real email classification. In addition, we discuss promising directions and provide new insights in this area.
Original languageEnglish
Title of host publication2015 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC)
PublisherIEEE
Pages7438-7443
ISBN (Electronic)978-1-4673-6432-4
DOIs
Publication statusPublished - 2015
EventIEEE International Conference on Communications (ICC) - London, Malaysia
Duration: 8 Jun 201512 Jun 2015

Publication series

NameIEEE International Conference on Communications
ISSN (Print)1550-3607

Conference

ConferenceIEEE International Conference on Communications (ICC)
PlaceMalaysia
CityLondon
Period8/06/1512/06/15

Bibliographical note

Publication details (e.g. title, author(s), publication statuses and dates) are captured on an “AS IS” and “AS AVAILABLE” basis at the time of record harvesting from the data source. Suggestions for further amendments or supplementary information can be sent to [email protected].

Research Keywords

  • Email Classification
  • Spam Detection
  • Supervised Machine Learning
  • Empirical Study
  • SPAM

Fingerprint

Dive into the research topics of 'An Empirical Study on Email Classification Using Supervised Machine Learning in Real Environments'. Together they form a unique fingerprint.

Cite this