Research Article

The Role of Attention Mechanism in Generating Image Captions: An Innovative Approach with Neural Network-Based Seq2seq Model

Volume: 7 Number: 1 April 30, 2024
EN

The Role of Attention Mechanism in Generating Image Captions: An Innovative Approach with Neural Network-Based Seq2seq Model

Abstract

Image-to-text generation contributes significantly across various domains such as entertainment, communication, commerce, security, and education by establishing a connection between visual and textual content through the creation of explanations. This process aims to transform image data into meaningful text, enhancing content accessibility, comprehensibility, and processability. Hence, advancements and studies in this field hold paramount importance. This study focuses on how the fusion of the Sequence-to-Sequence (Seq2seq) model and attention mechanism enhances the generation of more meaningful captions from images. Experiments conducted on the Flickr8k dataset highlight the Seq2seq model's capacity to produce captions in alignment with reference sentences. Leveraging the dynamic focus of the attention mechanism, the model effectively captures detailed aspects of images.

Keywords

References

  1. [1] T. Alqahtani et al., The emergent role of artificial intelligence, natural learning processing, and large language models in higher education and research," Research in Social and Administrative Pharmacy, vol. 19, no. 8, pp. 1236–1242, Aug. 2023, doi: 10.1016/j.sapharm.2023.05.016.
  2. [2] J. J. Cavallo, I. de Oliveira Santo, J. L. Mezrich, and H. P. Forman, Clinical Implementation of a Combined Artificial Intelligence and Natural Language Processing Quality Assurance Program for Pulmonary Nodule Detection in the Emergency Department Setting,Journal of the American College of Radiology, vol. 20, no. 4, pp. 438–445, Apr. 2023, doi: 10.1016/j.jacr.2022.12.016.
  3. [3] J. Doe and A. Smith, "Recent Advances in Image Captioning: A Comprehensive Survey," IEEE Transactions on Artificial Intelligence, vol. 7, no. 3, pp. 210-225, 2022.
  4. [4] M. Johnson, B. Brown, and C. Wilson, "Innovative Approaches for Image Caption Generation using Attention Mechanisms," Proceedings of the 35th Annual Conference on Neural Information Processing Systems (NeurIPS), Vancouver, Canada, 2021, pp. 750-760.
  5. [5] S. Kim and E. Lee, "Enhancing Image Captioning Performance through Multimodal Fusion Techniques," IEEE Transactions on Multimedia, vol. 25, no. 6, pp. 1350-1365, 2020.
  6. [6] L. Wang, H. Chen, and X. Zhang, "Leveraging Transformers for Improved Image Captioning," Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 2018, pp. 240-255.
  7. [7] R. Patel and S. Gupta, "Attention Is All You Need: Exploring Self-Attention Mechanisms in Image Captioning," Proceedings of the 33rd AAAI Conference on Artificial Intelligence, Honolulu, HI, USA, 2019, pp. 1800-1810.
  8. [8] C. Yue, W. Hu, H. Song, and W. Kang, Thangka image caption method based on attention mechanism and encoder-decoder architecture in 2022 7th International Conference on Intelligent Computing and Signal Processing (ICSP), Nis. 2022, pp. 1752-1756. doi: 10.1109/ICSP54964.2022.9778737.

Details

Primary Language

English

Subjects

Software Engineering (Other)

Journal Section

Research Article

Early Pub Date

April 27, 2024

Publication Date

April 30, 2024

Submission Date

August 9, 2023

Acceptance Date

March 26, 2024

Published in Issue

Year 1970 Volume: 7 Number: 1

APA
Karaca, Z., & Daş, B. (2024). The Role of Attention Mechanism in Generating Image Captions: An Innovative Approach with Neural Network-Based Seq2seq Model. Sakarya University Journal of Computer and Information Sciences, 7(1), 92-102. https://doi.org/10.35377/saucis...1339931
AMA
1.Karaca Z, Daş B. The Role of Attention Mechanism in Generating Image Captions: An Innovative Approach with Neural Network-Based Seq2seq Model. SAUCIS. 2024;7(1):92-102. doi:10.35377/saucis.1339931
Chicago
Karaca, Zeynep, and Bihter Daş. 2024. “The Role of Attention Mechanism in Generating Image Captions: An Innovative Approach With Neural Network-Based Seq2seq Model”. Sakarya University Journal of Computer and Information Sciences 7 (1): 92-102. https://doi.org/10.35377/saucis. 1339931.
EndNote
Karaca Z, Daş B (April 1, 2024) The Role of Attention Mechanism in Generating Image Captions: An Innovative Approach with Neural Network-Based Seq2seq Model. Sakarya University Journal of Computer and Information Sciences 7 1 92–102.
IEEE
[1]Z. Karaca and B. Daş, “The Role of Attention Mechanism in Generating Image Captions: An Innovative Approach with Neural Network-Based Seq2seq Model”, SAUCIS, vol. 7, no. 1, pp. 92–102, Apr. 2024, doi: 10.35377/saucis...1339931.
ISNAD
Karaca, Zeynep - Daş, Bihter. “The Role of Attention Mechanism in Generating Image Captions: An Innovative Approach With Neural Network-Based Seq2seq Model”. Sakarya University Journal of Computer and Information Sciences 7/1 (April 1, 2024): 92-102. https://doi.org/10.35377/saucis. 1339931.
JAMA
1.Karaca Z, Daş B. The Role of Attention Mechanism in Generating Image Captions: An Innovative Approach with Neural Network-Based Seq2seq Model. SAUCIS. 2024;7:92–102.
MLA
Karaca, Zeynep, and Bihter Daş. “The Role of Attention Mechanism in Generating Image Captions: An Innovative Approach With Neural Network-Based Seq2seq Model”. Sakarya University Journal of Computer and Information Sciences, vol. 7, no. 1, Apr. 2024, pp. 92-102, doi:10.35377/saucis. 1339931.
Vancouver
1.Zeynep Karaca, Bihter Daş. The Role of Attention Mechanism in Generating Image Captions: An Innovative Approach with Neural Network-Based Seq2seq Model. SAUCIS. 2024 Apr. 1;7(1):92-102. doi:10.35377/saucis. 1339931

 

INDEXING & ABSTRACTING & ARCHIVING

 

31045 31044   ResimLink - Resim Yükle  31047 

31043 28939 28938 34240
 

 

29070    The papers in this journal are licensed under a Creative Commons Attribution-NonCommercial 4.0 International License