Home           Contact us           FAQs           
 
   Journal Page   |   Aims & Scope   |   Author Guideline   |   Editorial Board   |   Search
    Abstract
2014 (Vol. 7, Issue: 3)
Article Information:

A Novel Feature Selection Based on One-Way ANOVA F-Test for E-Mail Spam Classification

Nadir Omer Fadl Elssied, Othman Ibrahim and Ahmed Hamza Osman
Corresponding Author:  Nadir Omer Fadl Elssied 

Key words:  Feature selection, machine learning, one-way ANOVA F-test, spam detection, SVM, , ,
Vol. 7 , (3): 625-638
Submitted Accepted Published
May 01, 2013 June 22, 2013 January 20, 2014
Abstract:

Spam is commonly defined as unwanted e-mails and it became a global threat against e-mail users. Although, Support Vector Machine (SVM) has been commonly used in e-mail spam classification, yet the problem of high data dimensionality of the feature space due to the massive number of e-mail dataset and features still exist. To improve the limitation of SVM, reduce the computational complexity (efficiency) and enhancing the classification accuracy (effectiveness). In this study, feature selection based on one-way ANOVA F-test statistics scheme was applied to determine the most important features contributing to e-mail spam classification. This feature selection based on one-way ANOVA F-test is used to reduce the high data dimensionality of the feature space before the classification process. The experiment of the proposed scheme was carried out using spam base well-known benchmarking dataset to evaluate the feasibility of the proposed method. The comparison is achieved for different datasets, categorization algorithm and success measures. In addition, experimental results on spam base English datasets showed that the enhanced SVM (FSSVM) significantly outperforms SVM and many other recent spam classification methods for English dataset in terms of computational complexity and dimension reduction.
Abstract PDF HTML
  Cite this Reference:
Nadir Omer Fadl Elssied, Othman Ibrahim and Ahmed Hamza Osman, 2014. A Novel Feature Selection Based on One-Way ANOVA F-Test for E-Mail Spam Classification.  Research Journal of Applied Sciences, Engineering and Technology, 7(3): 625-638.
    Advertise with us
 
ISSN (Online):  2040-7467
ISSN (Print):   2040-7459
Submit Manuscript
   Current Information
   Sales & Services
   Contact Information
  Executive Managing Editor
  Email: admin@maxwellsci.com
  Publishing Editor
  Email: support@maxwellsci.com
  Account Manager
  Email: faisalm@maxwellsci.com
  Journal Editor
  Email: admin@maxwellsci.com
  Press Department
  Email: press@maxwellsci.com
Home  |  Contact us  |  About us  |  Privacy Policy
Copyright © 2009. MAXWELL Science Publication, a division of MAXWELLl Scientific Organization. All rights reserved