Short Messaging Service Spam Detection Model   Using Natural Language Processing and Deep Learning Techniques

Olatunde A. AKANO; Wariz A. ISMAEL; Ayomikun A. AWOSEYI; Femi AYO; Ifeoluwa M. OLANIYI; Jide E.T. AKINSOLA

doi:10.33003/j1kx8695

Short Messaging Service Spam Detection Model Using Natural Language Processing and Deep Learning Techniques

Authors

Olatunde A. AKANO

Department of Computer Sciences, Abiola Ajimobi, Technical University, Ibadan, Oyo State, Nigeria

Author
Wariz A. ISMAEL

Department of Computer Sciences, Abiola Ajimobi, Technical University, Ibadan, Oyo State, Nigeria

Author
Ayomikun A. AWOSEYI

Department of Computer Sciences, Abiola Ajimobi, Technical University, Ibadan, Oyo State, Nigeria

Author
Femi AYO

Department of Mathematical Sciences, Olabisi Onabanjo University, Ago-Iwoye, Ogun State, Nigeria

Author
Ifeoluwa M. OLANIYI

Department of Computer Sciences, Abiola Ajimobi, Technical University, Ibadan, Oyo State, Nigeria

Author
Jide E.T. AKINSOLA

Department of Computer Sciences, Abiola Ajimobi, Technical University, Ibadan, Oyo State, Nigeria

Author

DOI:

https://doi.org/10.33003/j1kx8695

Keywords:

Deep learning, machine learning, natural language processing, short message service, SMS spam.

Abstract

Unsolicited Short Message Service (SMS) messages, or SMS spams, pose a major challenge in mobile communication. These unwanted messages compromise user privacy, leading to data bridge or financial risks. To address this growing concern, this study explores the implementation of deep learning and Natural Language Processing (NLP) procedures to effectively detect SMS spam. By developing a robust spam detection system, this study enhances the security and usability of mobile communication platforms. This study implements an effective spam detection system using deep learning and NLP techniques. The system was developed using Python 3.10 within the Google Collaboratory environment. The SMS Spam Collection dataset, consisting of 5,574 characterized messages, underwent preprocessing procedures that included tokenization, stopword removal, lemmatization, and transformation using Term Frequency-Inverse Document Frequency (TF-IDF) vectorization. Three deep learning models were implemented for classification: Convolutional Neural Networks (CNN), Long Short-Term Memory networks (LSTM), and Recurrent Neural Networks (RNN). These models were trained and evaluated using performance metrics such as correctness, precision, recall, F1-score, Mean Squared Error (MSE), Root Mean Squared Error (RMSE), and Mean Absolute Error (MAE). Among the models tested, the CNN model demonstrated the best performance, achieving an accuracy of 96.90 percent, a precision of 0.9692, a recall of 0.9690, and an F1-score of 0.9691. It also had the lowest error rates, indicating its superior predictive capability. The results confirm the effectiveness of CNNs for SMS spam detection, particularly when combined with rigorous text preprocessing. The study suggests for further study, the application of federated leaning for modelling SMS spam detection.

References

Cover Image

Downloads

FJET_12_11_4

Published

08-08-2025

Issue

Vol. 1 No. 2 (2025): December 2025

Section

Articles

License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

How to Cite

[1]

Olatunde A. AKANO, Wariz A. ISMAEL, Ayomikun A. AWOSEYI, Femi AYO, Ifeoluwa M. OLANIYI, and Jide E.T. AKINSOLA, “Short Messaging Service Spam Detection Model Using Natural Language Processing and Deep Learning Techniques”, FJET, vol. 1, no. 2, pp. 35–47, Aug. 2025, doi: 10.33003/j1kx8695.

Download Citation

Most read articles by the same author(s)

Caleb A. ABORISADE, Jide E.T. AKINSOLA, Ifeoluwa M. OLANIYI, Fathia O. ONIPEDE, Emmanuel A. OLAJUBU, Ganiyu A. ADEROUNMU, Machine Learning-Based Polycystic Ovary Syndrome Generative Modelling via Ensemble Learning and Neural Networks for Infertility Prediction , FUDMA Journal of Engineering and Technology: Vol. 1 No. 1 (2025): July 2025

Short Messaging Service Spam Detection Model Using Natural Language Processing and Deep Learning Techniques

How to Cite

Similar Articles

Most read articles by the same author(s)