Comparative Analysis of Machine Learning Algorithms for the Detection and Classification of Suspicious Emails
- Authors
-
-
Shamsuddeen J. AHMAD
Department of Computer Science, Kaduna polytechnic, Kaduna, Nigeria
Author
-
Saifullahi S. SADI
Department of Cyber Security, Nigerian Defence Academy, Kaduna, Nigeria
Author
-
Muhammad M. AHMAD
Department of Secure Computing, Kaduna State University, Zaria, Kaduna State, Nigeria
Author
-
Abdullahi D. UMAR
Department of Secure Computing, Kaduna State University, Zaria, Kaduna State, Nigeria
Author
-
Shamsuddeen USMAN
Department of Computer Science, Nuhu Bamalli Polytechnic, Zaria, Kaduna State, Nigeria
Author
-
- Keywords:
- Machine Learning, Random Forest, Support Vector Machine, Artificial Neural Network, Artificial Intelligence, Term Frequency-Inverse Document Frequency.
- Abstract
-
The exponential growth of corporate email communications poses significant challenges for digital forensic investigations because manual analysis is slow, resource-intensive, and error-prone. This study compares three machine learning algorithms: Random Forest, Support Vector Machine (SVM), and Artificial Neural Network (ANN) for the detection and classification of suspicious emails. A publicly available dataset from the GitHub repository that comprises 60,000 instances was extracted. The methodology involved preprocessing the dataset by encoding categorical features and converting email body content into numerical representations using TF-IDF vectorisation, and SMOTE was used to balance the dataset. The dataset was then split into 80% (48,000 instances) for training and 20% (12,000 instances) for testing, and each classifier was trained and evaluated using performance metrics including accuracy, precision, recall, F1-score, and AUC. The result indicates that ANN achieved the highest performance (accuracy: 99.86%, AUC: 1.00), with balanced precision and recall across “Evidence” and “Non-Evidence” classes. Random Forest also performed strongly (accuracy: 99.92%, AUC: 1.00) with high interpretability, while SVM (accuracy: 98.92%, AUC: 1.00) showed strong precision but lower recall for “Non-Evidence” emails. ANN’s superior performance is attributed to its ability to model complex patterns and handle class imbalance effectively. The findings indicate that ANN demonstrates the highest performance in classifying suspicious emails, showing superior accuracy, efficiency, and scalability.
- References
- Downloads
- Published
- 24-11-2025
- Section
- Articles
- License
-
Copyright (c) 2025 FUDMA Journal of Engineering and Technology

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
How to Cite
Similar Articles
- Caleb A. ABORISADE, Jide E.T. AKINSOLA, Ifeoluwa M. OLANIYI, Fathia O. ONIPEDE, Emmanuel A. OLAJUBU, Ganiyu A. ADEROUNMU, Machine Learning-Based Polycystic Ovary Syndrome Generative Modelling via Ensemble Learning and Neural Networks for Infertility Prediction , FUDMA Journal of Engineering and Technology: Vol. 1 No. 1 (2025): July 2025
- Olatunde A. AKANO, Wariz A. ISMAEL, Ayomikun A. AWOSEYI, Femi AYO, Ifeoluwa M. OLANIYI, Jide E.T. AKINSOLA, Short Messaging Service Spam Detection Model Using Natural Language Processing and Deep Learning Techniques , FUDMA Journal of Engineering and Technology: Vol. 1 No. 2 (2025): December 2025
- Adekunle O. ADEWOLE, Ayodeji O. ARIYO, Development of an Edge-Enabled IoT Smart Energy Meter with Artificial Intelligence (AI)-Based Load Prediction for Device-Level Monitoring , FUDMA Journal of Engineering and Technology: Vol. 1 No. 2 (2025): December 2025
- Abubakar L. IBRAHEEM, John K. ALHASSAN, Noel D. MOSES, Suleiman AHMAD, Development of Ensemble SVM–LSTM Model for Phishing Website Detection , FUDMA Journal of Engineering and Technology: Vol. 2 No. 1 (2026): June 2026
- Ukange N. SYBIL, Hadiza A. UMAR, Ogar M. OKO, Habeebah A. KAKUDI, Usman MAHMUD, Alex AARON, Leveraging Quantum Machine Learning for Early Ovarian Cancer Diagnosis , FUDMA Journal of Engineering and Technology: Vol. 1 No. 2 (2025): December 2025
- Olawale J. OLALUYI, Johnson O. ADEOGO, Adeniyi O. AJIBOYE, Mayowa O. ORESELU, Olarewaju T. OGINNI, Application of Machine Learning for Enhancing Fake Logo Detection , FUDMA Journal of Engineering and Technology: Vol. 1 No. 2 (2025): December 2025
- Ismaila MAHMUD, Mahmud MUSTAPHA, Sulaiman H. SULAIMAN, Ibrahim ABDULWAHAB, Ibrahim A. SHEHU, Aminu J. ALIYU, Yusuf S. ABU, Nuraddeen A. ILIYASU, Solar Irradiance Forecast using Feed Forward Neural Network: A Case Study of Zaria Town , FUDMA Journal of Engineering and Technology: Vol. 1 No. 2 (2025): December 2025
- Oluwasanmi S. ADANIGBO, Opeyemi O. ASAOLU, Adedayo A. SOBOWALE, Temidayo AKINDAHUNSI, Akinbayode A. ASAOLU, Intrusion Detection in Mobile Adhoc Networks: A Review of Signature-Based, Anomaly-Based, and Hybrid Approaches , FUDMA Journal of Engineering and Technology: Vol. 1 No. 2 (2025): December 2025
- Umar A. IBRAHIM, Abdulra’uf G. SHARIFAI, Hybrid CNN Feature Fusion with Optimization for Precision Potato Leaf Disease Classification , FUDMA Journal of Engineering and Technology: Vol. 1 No. 2 (2025): December 2025
- Yusuf M. YAHAYA, Factors Contributing to the Erosion of Freehand Sketching Competence in Technology Education: A Case Study of Bayero University, Kano , FUDMA Journal of Engineering and Technology: Vol. 1 No. 2 (2025): December 2025
You may also start an advanced similarity search for this article.
