Arabic Speech Recognition: Advancement and Challenges

Ashifur Rahman, Md Mohsin Kabir, M. F. Mridha, Mohammed Alatiyyah, Haifa F. Alhasson, Shuaa S. Alharbi

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Abstract

Speech recognition is a captivating process that revolutionizes human-computer interactions, allowing us to interact and control machines through spoken commands. The foundation of speech recognition lies in understanding a given language's linguistic and textual characteristics. Although automatic speech recognition (ASR) systems flawlessly convert speech into text for various international languages, their implementation for Arabic remains inadequate. In this research, we diligently explore the current state of Arabic ASR systems and unveil the challenges encountered during their development. We categorize these challenges into two groups: those specific to the Arabic language and those more general. We propose strategies to overcome these obstacles and emphasize the need for ASR architectures tailored to the Arabic language's unique grammatical and phonetic structure. In addition, we provide a comprehensive and explicit description of various feature extraction methods, language models, and acoustic models utilized in the Arabic ASR system.

Original languageEnglish
Pages (from-to)39689-39716
Number of pages28
JournalIEEE Access
Volume12
DOIs
StatePublished - 2024

Keywords

  • Arabic speech recognition
  • Arabic speech-to-text
  • ASR technology
  • speech recognition
  • voice recognition

Fingerprint

Dive into the research topics of 'Arabic Speech Recognition: Advancement and Challenges'. Together they form a unique fingerprint.

Cite this