Research Article

On Term Weighting for Spam SMS Filtering

Volume: 3 Number: 3 December 30, 2020
EN

On Term Weighting for Spam SMS Filtering

Abstract

Due to rapid development of the technology, the usage of mobile telephones and short message services (SMS) have become widespread. Thus, the number of spam SMS messages has dramatically increased and the significance of identifying and filtering of suchlike messages raised. Moreover, since they have also risk to steal users’ personal information; the problem of identifying and filtering of Spam SMS messages stays popular in terms of also information and data security. In this study, the classification performances of five different term weighting methods on three different datasets containing SMS messages categorized as Spam and legitimate are compared by using two classifiers for corresponding problem. The results obtained showed that reasonable weighting of SMS contents plays an important role in identifying of spam SMS messages. On the other hand, it can be expressed that real classification potential of term weighting schemes reflected betterly the with feature vectors created by using fifty and higher number of terms on especially Turkish and English SMS message datasets. In addition, it has been observed that value ranges of the classification results of obtained from term weighting methods on Turkish SMS message dataset is wider for than ones obtained in English SMS message datasets.

Keywords

References

  1. H. Faris, I. Aljarah, and B. Al-Shboul, "A hybrid approach based on particle swarm optimization and random forests for e-mail spam filtering," in International Conference on Computational Collective Intelligence, 2016: Springer, pp. 498-508.
  2. R. Varghese and K. Dhanya, "Efficient feature set for spam Email filtering," in 2017 IEEE 7th International Advance Computing Conference (IACC), 2017: IEEE, pp. 732-737.
  3. M. Diale, T. Celik, and C. Van Der Walt, "Unsupervised feature learning for spam email filtering," Computers & Electrical Engineering, vol. 74, pp. 89-104, 2019.
  4. M. A. Shafi’I et al., "A review on mobile SMS spam filtering techniques," IEEE Access, vol. 5, pp. 15650-15666, 2017.
  5. K. O. Kawade and K. S. Oza, "Content-based SMS spam filtering using machine learning technique," International Journal of Computer Engineering and Applications, vol. 7, p. 4, 2018.
  6. T. H. Apandi and C. A. Sugianto, "Analisis Komparasi Machine Learning Pada Data Spam Sms," Jurnal TEDC, vol. 12, no. 1, pp. 58-62, 2019.
  7. S. J. Delany, M. Buckley, and D. Greene, "SMS spam filtering: Methods and data," Expert Systems with Applications, vol. 39, no. 10, pp. 9899-9908, 2012.
  8. J. M. Gómez Hidalgo, G. C. Bringas, E. P. Sánz, and F. C. García, "Content based SMS spam filtering," in Proceedings of the 2006 ACM Symposium on Document Engineering, 2006, pp. 107-114.

Details

Primary Language

English

Subjects

Computer Software

Journal Section

Research Article

Publication Date

December 30, 2020

Submission Date

May 11, 2020

Acceptance Date

November 14, 2020

Published in Issue

Year 2020 Volume: 3 Number: 3

APA
Dogan, T. (2020). On Term Weighting for Spam SMS Filtering. Sakarya University Journal of Computer and Information Sciences, 3(3), 239-249. https://doi.org/10.35377/saucis.03.03.735463
AMA
1.Dogan T. On Term Weighting for Spam SMS Filtering. SAUCIS. 2020;3(3):239-249. doi:10.35377/saucis.03.03.735463
Chicago
Dogan, Turgut. 2020. “On Term Weighting for Spam SMS Filtering”. Sakarya University Journal of Computer and Information Sciences 3 (3): 239-49. https://doi.org/10.35377/saucis.03.03.735463.
EndNote
Dogan T (December 1, 2020) On Term Weighting for Spam SMS Filtering. Sakarya University Journal of Computer and Information Sciences 3 3 239–249.
IEEE
[1]T. Dogan, “On Term Weighting for Spam SMS Filtering”, SAUCIS, vol. 3, no. 3, pp. 239–249, Dec. 2020, doi: 10.35377/saucis.03.03.735463.
ISNAD
Dogan, Turgut. “On Term Weighting for Spam SMS Filtering”. Sakarya University Journal of Computer and Information Sciences 3/3 (December 1, 2020): 239-249. https://doi.org/10.35377/saucis.03.03.735463.
JAMA
1.Dogan T. On Term Weighting for Spam SMS Filtering. SAUCIS. 2020;3:239–249.
MLA
Dogan, Turgut. “On Term Weighting for Spam SMS Filtering”. Sakarya University Journal of Computer and Information Sciences, vol. 3, no. 3, Dec. 2020, pp. 239-4, doi:10.35377/saucis.03.03.735463.
Vancouver
1.Turgut Dogan. On Term Weighting for Spam SMS Filtering. SAUCIS. 2020 Dec. 1;3(3):239-4. doi:10.35377/saucis.03.03.735463

Cited By

 

INDEXING & ABSTRACTING & ARCHIVING

 

31045 31044   ResimLink - Resim Yükle  31047 

31043 28939 28938 34240
 

 

29070    The papers in this journal are licensed under a Creative Commons Attribution-NonCommercial 4.0 International License