Research Article

TurkishLex:Development of a Context-Aware Spell Checker for Detecting and Correcting Spelling Errors in Turkish Texts

Volume: 7 Number: 3 December 31, 2024
EN

TurkishLex:Development of a Context-Aware Spell Checker for Detecting and Correcting Spelling Errors in Turkish Texts

Abstract

In Turkish, correct spelling correction is crucial for effective communication and preserving the integrity of written text. The challenge lies in the complexity of Turkish morphology and spelling, which can lead to frequent and diverse spelling errors. This study presents a spelling checker adapted for Turkish by creating a new Turkish dataset. The proposed spelling checker model effectively captures both minor and major textual changes and can detect the error. Our findings show that the proposed spelling checker system provides high accuracy and reliability with 98.21% accuracy performance with the Symspell module in correcting Turkish texts. This study provides valuable information about the strengths and weaknesses of existing spelling checkers and contributes to the improvement of spelling correction tools for Turkish.

Keywords

Project Number

AR-22-087-0001

References

  1. Y. Chaabi and F. Ataa Allah, “Amazigh spell checker using Damerau-Levenshtein algorithm and N-gram,” Journal of King Saud University - Computer and Information Sciences, vol. 34, no. 8, Part B, pp. 6116–6124, Sep. 2022, doi: 10.1016/j.jksuci.2021.07.015.
  2. V. J. Hodge and J. Austin, “A comparison of a novel neural spell checker and standard spell checking algorithms,” Pattern Recognition, vol. 35, no. 11, pp. 2571–2580, Nov. 2002, doi: 10.1016/S0031-3203(01)00174-1.
  3. R. Garfinkel, E. Fernandez, and R. Gopal, “Design of an interactive spell checker: Optimizing the list of offered words,” Decision Support Systems, vol. 35, no. 3, pp. 385–397, Jun. 2003, doi: 10.1016/S0167-9236(02)00115-X.
  4. M. Nejja and A. Yousfi, “The Context in Automatic Spell Correction,” Procedia Computer Science, vol. 73, pp. 109–114, Jan. 2015, doi: 10.1016/j.procs.2015.12.055.
  5. K. Sarıtaş, C. A. Öz, and T. Güngör, “A comprehensive analysis of static word embeddings for Turkish,” Expert Systems with Applications, vol. 252, p. 124123, Oct. 2024, doi: 10.1016/j.eswa.2024.124123.
  6. S. Demir and B. Topcu, “Graph-based Turkish text normalization and its impact on noisy text processing,” Engineering Science and Technology, an International Journal, vol. 35, p. 101192, Nov. 2022, doi: 10.1016/j.jestch.2022.101192.
  7. Y. B. Kaya and A. C. Tantuğ, “Effect of tokenization granularity for Turkish large language models,” Intelligent Systems with Applications, vol. 21, p. 200335, Mar. 2024, doi: 10.1016/j.iswa.2024.200335.
  8. Kukich K. Techniques for automatically correcting words in text. ACM computing surveys (CSUR). 1992 Dec 1;24(4):377-439.

Details

Primary Language

English

Subjects

Software Engineering (Other)

Journal Section

Research Article

Early Pub Date

December 10, 2024

Publication Date

December 31, 2024

Submission Date

September 5, 2024

Acceptance Date

October 10, 2024

Published in Issue

Year 2024 Volume: 7 Number: 3

APA
Savci, P., & Daş, B. (2024). TurkishLex:Development of a Context-Aware Spell Checker for Detecting and Correcting Spelling Errors in Turkish Texts. Sakarya University Journal of Computer and Information Sciences, 7(3), 404-415. https://doi.org/10.35377/saucis.7.87942.1544012
AMA
1.Savci P, Daş B. TurkishLex:Development of a Context-Aware Spell Checker for Detecting and Correcting Spelling Errors in Turkish Texts. SAUCIS. 2024;7(3):404-415. doi:10.35377/saucis.7.87942.1544012
Chicago
Savci, Pinar, and Bihter Daş. 2024. “TurkishLex:Development of a Context-Aware Spell Checker for Detecting and Correcting Spelling Errors in Turkish Texts”. Sakarya University Journal of Computer and Information Sciences 7 (3): 404-15. https://doi.org/10.35377/saucis.7.87942.1544012.
EndNote
Savci P, Daş B (December 1, 2024) TurkishLex:Development of a Context-Aware Spell Checker for Detecting and Correcting Spelling Errors in Turkish Texts. Sakarya University Journal of Computer and Information Sciences 7 3 404–415.
IEEE
[1]P. Savci and B. Daş, “TurkishLex:Development of a Context-Aware Spell Checker for Detecting and Correcting Spelling Errors in Turkish Texts”, SAUCIS, vol. 7, no. 3, pp. 404–415, Dec. 2024, doi: 10.35377/saucis.7.87942.1544012.
ISNAD
Savci, Pinar - Daş, Bihter. “TurkishLex:Development of a Context-Aware Spell Checker for Detecting and Correcting Spelling Errors in Turkish Texts”. Sakarya University Journal of Computer and Information Sciences 7/3 (December 1, 2024): 404-415. https://doi.org/10.35377/saucis.7.87942.1544012.
JAMA
1.Savci P, Daş B. TurkishLex:Development of a Context-Aware Spell Checker for Detecting and Correcting Spelling Errors in Turkish Texts. SAUCIS. 2024;7:404–415.
MLA
Savci, Pinar, and Bihter Daş. “TurkishLex:Development of a Context-Aware Spell Checker for Detecting and Correcting Spelling Errors in Turkish Texts”. Sakarya University Journal of Computer and Information Sciences, vol. 7, no. 3, Dec. 2024, pp. 404-15, doi:10.35377/saucis.7.87942.1544012.
Vancouver
1.Pinar Savci, Bihter Daş. TurkishLex:Development of a Context-Aware Spell Checker for Detecting and Correcting Spelling Errors in Turkish Texts. SAUCIS. 2024 Dec. 1;7(3):404-15. doi:10.35377/saucis.7.87942.1544012

Cited By

 

INDEXING & ABSTRACTING & ARCHIVING

 

31045 31044   ResimLink - Resim Yükle  31047 

31043 28939 28938 34240
 

 

29070    The papers in this journal are licensed under a Creative Commons Attribution-NonCommercial 4.0 International License