The Effects of Preprocessing on Turkish and English News Data
Abstract
Keywords
References
- [1] G. Salton, A. Wong, and C.-S. Yang, "A vector space model for automatic indexing". Communications of the ACM, 1975. 18(11): p. 613-620.
- [2] T. Joachims, "Text categorization with support vector machines: Learning with many relevant features". in European conference on machine learning. 1998. Springer.
- [3] Y. Yang, and J.O. Pedersen. "A comparative study on feature selection in text categorization." in ICML. 1997.
- [4] C. Lee, and G.G. Lee," Information gain and divergence-based feature selection for machine learning-based text categorization." Information processing & management, 2006. 42(1): p. 155-165.
- [5] S.R. Singh, H.A. Murthy, and T.A. Gonsalves, "Feature Selection for Text Classification Based on Gini Coefficient of Inequality. "Fsdm, 2010. 10: p. 76-85.
- [6] A. Rehman, K. Javed, and H.A. Babri, "Feature selection based on a normalized difference measure for text classification." Information Processing & Management, 2017. 53(2): p. 473-489.
- [7] A. Rehman, et al., "Selection of the most relevant terms based on a max-min ratio metric for text classification." Expert Systems with Applications, 2018. 114: p. 78-96.
- [8] Parlak, B. and A.K. Uysal, A novel filter feature selection method for text classification: Extensive Feature Selector. Journal of Information Science, 2021: p. 0165551521991037.
Details
Primary Language
English
Subjects
Computer Software , Software Engineering (Other)
Journal Section
Research Article
Authors
Bekir Parlak
*
0000-0001-8919-6481
Türkiye
Early Pub Date
April 28, 2023
Publication Date
April 30, 2023
Submission Date
November 21, 2022
Acceptance Date
March 30, 2023
Published in Issue
Year 2023 Volume: 6 Number: 1
Cited By
Graf Sinir Ağları ile İlişkisel Türkçe Metin Sınıflandırma
Journal of Polytechnic
https://doi.org/10.2339/politeknik.1423293NOVEL TERM WEIGHTING METHODS FOR TEXT CLASSIFICATION BASED ON ECONOMIC INEQUALITY METRICS
Eskişehir Technical University Journal of Science and Technology A - Applied Sciences and Engineering
https://doi.org/10.18038/estubtda.1784468A comparative study of linear and non-linear dimensionality reduction for opcode-frequency malware classification
Journal of Computer Virology and Hacking Techniques
https://doi.org/10.1007/s11416-026-00597-1
