Speech-based respiratory diagnostics: A study on COVID-19 detection with machine learning

  • Gaurav Datkhile
  • , Pramod H. Kachare
  • , Sandeep B. Sangle
  • , Ibrahim Al-Shourbaji
  • , Abdoh Jabbari
  • , Raimund Kirner
  • , Abdalla Alameen

Research output: Contribution to journalArticlepeer-review

Abstract

Respiratory sound analysis has emerged as a promising approach for detecting and diagnosing respiratory diseases, including COVID-19. This study investigates using OpenSMILE features for COVID-19 detection using vowel speech sounds /a/, /e/, and /o/ from the COSWARA dataset. OpenSMILE facilitates the extraction of audio and functional features, which are then classified using various machine learning algorithms. Multiple ML classifiers Random Forest (RF), Support Vector Machine, Decision Tree, and Artificial Neural Network are evaluated. To enhance classification performance, five distinct feature selection techniques were applied: ANOVA, chi-square, Information Gain, ReliefF, and Gini index. Among these, ANOVA-based selection yielded the most consistent results across classifiers and vowel sounds. Among the models evaluated, the RF classifier achieved the highest accuracies of 76.47% for vowel /a/ and 75.54% for vowels /a/ and /o/, respectively, when combined with ANOVA-selected features (155, 163, and 161 features). To statistically assess model and feature selection performances, the Friedman test was conducted across classifiers and feature selection techniques. Results confirmed the significance of Random Forest and ANOVA as robust combinations. This research contributes to developing accessible, scalable, and non-invasive diagnostic tools, enhancing the potential of telemedicine and remote healthcare systems for the early detection of respiratory diseases.

Original languageEnglish
Article numbere0332146
JournalPLoS ONE
Volume20
Issue number11 November
DOIs
StatePublished - Nov 2025

Fingerprint

Dive into the research topics of 'Speech-based respiratory diagnostics: A study on COVID-19 detection with machine learning'. Together they form a unique fingerprint.

Cite this