Arabic Speech Recognition: Advancement and Challenges

Ashifur Rahman; Md Mohsin Kabir; M. F. Mridha; Mohammed Alatiyyah; Haifa F. Alhasson; Shuaa S. Alharbi

doi:10.1109/ACCESS.2024.3376237

Arabic Speech Recognition: Advancement and Challenges

Ashifur Rahman, Md Mohsin Kabir, M. F. Mridha, Mohammed Alatiyyah, Haifa F. Alhasson, Shuaa S. Alharbi

Computer Sciences

Research output: Contribution to journal › Article › peer-review

6 Scopus citations

Abstract

Speech recognition is a captivating process that revolutionizes human-computer interactions, allowing us to interact and control machines through spoken commands. The foundation of speech recognition lies in understanding a given language's linguistic and textual characteristics. Although automatic speech recognition (ASR) systems flawlessly convert speech into text for various international languages, their implementation for Arabic remains inadequate. In this research, we diligently explore the current state of Arabic ASR systems and unveil the challenges encountered during their development. We categorize these challenges into two groups: those specific to the Arabic language and those more general. We propose strategies to overcome these obstacles and emphasize the need for ASR architectures tailored to the Arabic language's unique grammatical and phonetic structure. In addition, we provide a comprehensive and explicit description of various feature extraction methods, language models, and acoustic models utilized in the Arabic ASR system.

Original language	English
Pages (from-to)	39689-39716
Number of pages	28
Journal	IEEE Access
Volume	12
DOIs	https://doi.org/10.1109/ACCESS.2024.3376237
State	Published - 2024

Keywords

Arabic speech recognition
Arabic speech-to-text
ASR technology
speech recognition
voice recognition

Access to Document

10.1109/ACCESS.2024.3376237

Cite this

@article{ee68b03c337149c39623f8cdfa76de5f,

title = "Arabic Speech Recognition: Advancement and Challenges",

abstract = "Speech recognition is a captivating process that revolutionizes human-computer interactions, allowing us to interact and control machines through spoken commands. The foundation of speech recognition lies in understanding a given language's linguistic and textual characteristics. Although automatic speech recognition (ASR) systems flawlessly convert speech into text for various international languages, their implementation for Arabic remains inadequate. In this research, we diligently explore the current state of Arabic ASR systems and unveil the challenges encountered during their development. We categorize these challenges into two groups: those specific to the Arabic language and those more general. We propose strategies to overcome these obstacles and emphasize the need for ASR architectures tailored to the Arabic language's unique grammatical and phonetic structure. In addition, we provide a comprehensive and explicit description of various feature extraction methods, language models, and acoustic models utilized in the Arabic ASR system.",

keywords = "Arabic speech recognition, Arabic speech-to-text, ASR technology, speech recognition, voice recognition",

author = "Ashifur Rahman and Kabir, \{Md Mohsin\} and Mridha, \{M. F.\} and Mohammed Alatiyyah and Alhasson, \{Haifa F.\} and Alharbi, \{Shuaa S.\}",

note = "Publisher Copyright: {\textcopyright} 2013 IEEE.",

year = "2024",

doi = "10.1109/ACCESS.2024.3376237",

language = "English",

volume = "12",

pages = "39689--39716",

journal = "IEEE Access",

issn = "2169-3536",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Arabic Speech Recognition

T2 - Advancement and Challenges

AU - Rahman, Ashifur

AU - Kabir, Md Mohsin

AU - Mridha, M. F.

AU - Alatiyyah, Mohammed

AU - Alhasson, Haifa F.

AU - Alharbi, Shuaa S.

PY - 2024

Y1 - 2024

N2 - Speech recognition is a captivating process that revolutionizes human-computer interactions, allowing us to interact and control machines through spoken commands. The foundation of speech recognition lies in understanding a given language's linguistic and textual characteristics. Although automatic speech recognition (ASR) systems flawlessly convert speech into text for various international languages, their implementation for Arabic remains inadequate. In this research, we diligently explore the current state of Arabic ASR systems and unveil the challenges encountered during their development. We categorize these challenges into two groups: those specific to the Arabic language and those more general. We propose strategies to overcome these obstacles and emphasize the need for ASR architectures tailored to the Arabic language's unique grammatical and phonetic structure. In addition, we provide a comprehensive and explicit description of various feature extraction methods, language models, and acoustic models utilized in the Arabic ASR system.

AB - Speech recognition is a captivating process that revolutionizes human-computer interactions, allowing us to interact and control machines through spoken commands. The foundation of speech recognition lies in understanding a given language's linguistic and textual characteristics. Although automatic speech recognition (ASR) systems flawlessly convert speech into text for various international languages, their implementation for Arabic remains inadequate. In this research, we diligently explore the current state of Arabic ASR systems and unveil the challenges encountered during their development. We categorize these challenges into two groups: those specific to the Arabic language and those more general. We propose strategies to overcome these obstacles and emphasize the need for ASR architectures tailored to the Arabic language's unique grammatical and phonetic structure. In addition, we provide a comprehensive and explicit description of various feature extraction methods, language models, and acoustic models utilized in the Arabic ASR system.

KW - Arabic speech recognition

KW - Arabic speech-to-text

KW - ASR technology

KW - speech recognition

KW - voice recognition

UR - http://www.scopus.com/inward/record.url?scp=85188016191&partnerID=8YFLogxK

U2 - 10.1109/ACCESS.2024.3376237

DO - 10.1109/ACCESS.2024.3376237

M3 - Article

AN - SCOPUS:85188016191

SN - 2169-3536

VL - 12

SP - 39689

EP - 39716

JO - IEEE Access

JF - IEEE Access

ER -

Arabic Speech Recognition: Advancement and Challenges

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this