Research Article

Systematic Analysis of Speech Transcription Modeling for Reliable Assessment of Depression Severity

Volume: 7 Number: 1 April 30, 2024
EN

Systematic Analysis of Speech Transcription Modeling for Reliable Assessment of Depression Severity

Abstract

For depression severity assessment, we systematically analyze a modular deep learning pipeline that uses speech transcriptions as input for depression severity prediction. Through our pipeline, we investigate the role of popular deep learning architectures in creating representations for depression assessment. Evaluation of the proposed architectures is performed on the publicly available Extended Distress Analysis Interview Corpus dataset (E-DAIC). Through the results and discussions, we show that informative representations for depression assessment can be obtained without exploiting the temporal dynamics between descriptive text representations. More specifically, temporal pooling of latent representations outperforms the state of the art, which employs recurrent architectures, by 8.8% in terms of Concordance Correlation Coefficient (CCC).

Keywords

Ethical Statement

It is declared that during the preparation process of this study, scientific and ethical principles were followed, and all the studies benefited from are stated in the bibliography.Since the data used in this research is publicly available, an ethics committee approval is not required / not applicable.

References

  1. [1] Hamilton M. A rating scale for depression. Journal of Neurology, Neurosurgery & Psychiatry, 23(1):56–62, 1960.
  2. [2] Kurt Kroenke, Tara Strine, Robert Spitzer, Janet Williams, Joyce Berry, and Ali Mokdad. The phq-8 as a measure of current depression in the general population. Journal of affective disorders, 114:163–73, 09 2008.
  3. [3] Amit Gupta, Priya Mathur, Shruti Bijawat, and Abhishek Dadheech. A novel work on analyzing stress and depression level of indian population during covid-19. Recent Advances in Computer Science and Communications, 13, 11 2020.
  4. [4] World Health Organization. Depression and other common mental disorders: global health estimates. Technical report, 2017. License: CC BY-NC-SA 3.0 IGO.
  5. [5] Jörg Zinken, Katarzyna Zinken, J. Clare Wilson, Lisa Butler, and Timothy Skinner. Analysis of syntax and word use to predict successful participation in guided self-help for anxiety and depression. Psychiatry Research, 179(2):181–186, 2010.
  6. [6] Stephanie Rude, Eva-Maria Gortner, and James Pennebaker. Language use of depressed and depression-vulnerable college students. Cognition Emotion - COGNITION EMOTION, 18:1121–1133, 12 2004.
  7. [7] Michael P. Caligiuri and Joel Ellwanger. Motor and cognitive aspects of motor retardation in depression. Journal of Affective Disorders, 57(1):83–93, 2000.
  8. [8] Psychomotor symptoms of depression. American Journal of Psychiatry, 154(1):4–17, 1997. PMID: 8988952.

Details

Primary Language

English

Subjects

Computer Software

Journal Section

Research Article

Early Pub Date

April 27, 2024

Publication Date

April 30, 2024

Submission Date

October 26, 2023

Acceptance Date

March 22, 2024

Published in Issue

Year 2024 Volume: 7 Number: 1

APA
Kaynak, E. B., & Dibeklioğlu, H. (2024). Systematic Analysis of Speech Transcription Modeling for Reliable Assessment of Depression Severity. Sakarya University Journal of Computer and Information Sciences, 7(1), 77-91. https://doi.org/10.35377/saucis...1381522
AMA
1.Kaynak EB, Dibeklioğlu H. Systematic Analysis of Speech Transcription Modeling for Reliable Assessment of Depression Severity. SAUCIS. 2024;7(1):77-91. doi:10.35377/saucis.1381522
Chicago
Kaynak, Ergün Batuhan, and Hamdi Dibeklioğlu. 2024. “Systematic Analysis of Speech Transcription Modeling for Reliable Assessment of Depression Severity”. Sakarya University Journal of Computer and Information Sciences 7 (1): 77-91. https://doi.org/10.35377/saucis. 1381522.
EndNote
Kaynak EB, Dibeklioğlu H (April 1, 2024) Systematic Analysis of Speech Transcription Modeling for Reliable Assessment of Depression Severity. Sakarya University Journal of Computer and Information Sciences 7 1 77–91.
IEEE
[1]E. B. Kaynak and H. Dibeklioğlu, “Systematic Analysis of Speech Transcription Modeling for Reliable Assessment of Depression Severity”, SAUCIS, vol. 7, no. 1, pp. 77–91, Apr. 2024, doi: 10.35377/saucis...1381522.
ISNAD
Kaynak, Ergün Batuhan - Dibeklioğlu, Hamdi. “Systematic Analysis of Speech Transcription Modeling for Reliable Assessment of Depression Severity”. Sakarya University Journal of Computer and Information Sciences 7/1 (April 1, 2024): 77-91. https://doi.org/10.35377/saucis. 1381522.
JAMA
1.Kaynak EB, Dibeklioğlu H. Systematic Analysis of Speech Transcription Modeling for Reliable Assessment of Depression Severity. SAUCIS. 2024;7:77–91.
MLA
Kaynak, Ergün Batuhan, and Hamdi Dibeklioğlu. “Systematic Analysis of Speech Transcription Modeling for Reliable Assessment of Depression Severity”. Sakarya University Journal of Computer and Information Sciences, vol. 7, no. 1, Apr. 2024, pp. 77-91, doi:10.35377/saucis. 1381522.
Vancouver
1.Ergün Batuhan Kaynak, Hamdi Dibeklioğlu. Systematic Analysis of Speech Transcription Modeling for Reliable Assessment of Depression Severity. SAUCIS. 2024 Apr. 1;7(1):77-91. doi:10.35377/saucis. 1381522

Cited By

 

INDEXING & ABSTRACTING & ARCHIVING

 

31045 31044   ResimLink - Resim Yükle  31047 

31043 28939 28938 34240
 

 

29070    The papers in this journal are licensed under a Creative Commons Attribution-NonCommercial 4.0 International License