Please use this identifier to cite or link to this item:
http://hdl.handle.net/20.500.11889/8279
Title: | Identifying spam e-mail based-on statistical header features and sender behavior | Authors: | Qaroush, Aziz Khater, Ismail M. Washaha, Mahdi |
Keywords: | Spam e-mails - Prevention;Computer networks - Security measures;Identifying Spam E-mail;Spam Filtering;Classification;Machine learning | Issue Date: | 2012 | Publisher: | ACM International Conference Proceeding Series | Abstract: | Email Spam filtering still a sophisticated and challenging problem as long as spammers continue developing new methods and techniques that are being used in their campaigns to defeat and confuse email spam filtering process. Moreover, utilizing email header information imposing additional challenges in classifying emails because the header information can be easily spoofed by spammers. Also, in recent years, spam has become a major problem at social, economical, political, and organizational levels because it decreases the employee productivity and causes traffic congestions in networks. In this paper, we present a powerful and useful email header features by utilizing the header session messages based on publicly datasets. Then, we apply many machine learning-based classifiers on the extracted header features to show the power of the extracted header features in filtering spam and ham messages by evaluating and comparing classifiers performance. In experiment stage, we apply the following classifiers: Random Forest (RF), C4.5 Decision Tree (J48), Voting Feature Intervals (VFI), Random Tree (RT), REPTree (REPT), Bayesian Network (BN), and Naïve Bayes (NB). The experimental results show that the RF classifier has the best performance with an accuracy, precision, recall, F-measure of 99.27%, 99.40%, 99.50%, and 99.50% when all mentioned features are used included the trust feature. | URI: | http://hdl.handle.net/20.500.11889/8279 | DOI: | 10.1145/2381716.2381863 |
Appears in Collections: | Fulltext Publications |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Identifying spam e-mail based-on statistical header features and sender behavior.pdf | 1.01 MB | Adobe PDF | View/Open |
Page view(s)
8
checked on Jan 20, 2024
Download(s)
3
checked on Jan 20, 2024
Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.