Deep Learning in Sign Language Recognition: A Hybrid Approach for the Recognition of Static and Dynamic Signs

Ahmed Mateen Buttar; Usama Ahmad; Abdu H. Gumaei; Adel Assiri; Muhammad Azeem Akbar; Bader Fahad Alkhamees

doi:10.3390/math11173729

Deep Learning in Sign Language Recognition: A Hybrid Approach for the Recognition of Static and Dynamic Signs

Ahmed Mateen Buttar, Usama Ahmad, Abdu H. Gumaei, Adel Assiri, Muhammad Azeem Akbar, Bader Fahad Alkhamees

Computer Sciences

Research output: Contribution to journal › Article › peer-review

49 Scopus citations

Abstract

A speech impairment limits a person’s capacity for oral and auditory communication. A great improvement in communication between the deaf and the general public would be represented by a real-time sign language detector. This work proposes a deep learning-based algorithm that can identify words from a person’s gestures and detect them. There have been many studies on this topic, but the development of static and dynamic sign language recognition models is still a challenging area of research. The difficulty is in obtaining an appropriate model that addresses the challenges of continuous signs that are independent of the signer. Different signers’ speeds, durations, and many other factors make it challenging to create a model with high accuracy and continuity. For the accurate and effective recognition of signs, this study uses two different deep learning-based approaches. We create a real-time American Sign Language detector using the skeleton model, which reliably categorizes continuous signs in sign language in most cases using a deep learning approach. In the second deep learning approach, we create a sign language detector for static signs using YOLOv6. This application is very helpful for sign language users and learners to practice sign language in real time. After training both algorithms separately for static and continuous signs, we create a single algorithm using a hybrid approach. The proposed model, consisting of LSTM with MediaPipe holistic landmarks, achieves around 92% accuracy for different continuous signs, and the YOLOv6 model achieves 96% accuracy over different static signs. Throughout this study, we determine which approach is best for sequential movement detection and for the classification of different signs according to sign language and shows remarkable accuracy in real time.

Original language	English
Article number	3729
Journal	Mathematics
Volume	11
Issue number	17
DOIs	https://doi.org/10.3390/math11173729
State	Published - Sep 2023

Keywords

Long Short-Term Memory (LSTM)
MediaPipe holistic
You Only Look Once (YOLO)
confusion matrix
convolutional neural network (CNN)
deep learning

Access to Document

10.3390/math11173729

Cite this

@article{3508bb114acc4e21baa0cefc702e7cd7,

title = "Deep Learning in Sign Language Recognition: A Hybrid Approach for the Recognition of Static and Dynamic Signs",

abstract = "A speech impairment limits a person{\textquoteright}s capacity for oral and auditory communication. A great improvement in communication between the deaf and the general public would be represented by a real-time sign language detector. This work proposes a deep learning-based algorithm that can identify words from a person{\textquoteright}s gestures and detect them. There have been many studies on this topic, but the development of static and dynamic sign language recognition models is still a challenging area of research. The difficulty is in obtaining an appropriate model that addresses the challenges of continuous signs that are independent of the signer. Different signers{\textquoteright} speeds, durations, and many other factors make it challenging to create a model with high accuracy and continuity. For the accurate and effective recognition of signs, this study uses two different deep learning-based approaches. We create a real-time American Sign Language detector using the skeleton model, which reliably categorizes continuous signs in sign language in most cases using a deep learning approach. In the second deep learning approach, we create a sign language detector for static signs using YOLOv6. This application is very helpful for sign language users and learners to practice sign language in real time. After training both algorithms separately for static and continuous signs, we create a single algorithm using a hybrid approach. The proposed model, consisting of LSTM with MediaPipe holistic landmarks, achieves around 92\% accuracy for different continuous signs, and the YOLOv6 model achieves 96\% accuracy over different static signs. Throughout this study, we determine which approach is best for sequential movement detection and for the classification of different signs according to sign language and shows remarkable accuracy in real time.",

keywords = "Long Short-Term Memory (LSTM), MediaPipe holistic, You Only Look Once (YOLO), confusion matrix, convolutional neural network (CNN), deep learning",

author = "Buttar, \{Ahmed Mateen\} and Usama Ahmad and Gumaei, \{Abdu H.\} and Adel Assiri and Akbar, \{Muhammad Azeem\} and Alkhamees, \{Bader Fahad\}",

note = "Publisher Copyright: {\textcopyright} 2023 by the authors.",

year = "2023",

month = sep,

doi = "10.3390/math11173729",

language = "English",

volume = "11",

journal = "Mathematics",

issn = "2227-7390",

publisher = "Multidisciplinary Digital Publishing Institute (MDPI)",

number = "17",

}

TY - JOUR

T1 - Deep Learning in Sign Language Recognition

T2 - A Hybrid Approach for the Recognition of Static and Dynamic Signs

AU - Buttar, Ahmed Mateen

AU - Ahmad, Usama

AU - Gumaei, Abdu H.

AU - Assiri, Adel

AU - Akbar, Muhammad Azeem

AU - Alkhamees, Bader Fahad

PY - 2023/9

Y1 - 2023/9

N2 - A speech impairment limits a person’s capacity for oral and auditory communication. A great improvement in communication between the deaf and the general public would be represented by a real-time sign language detector. This work proposes a deep learning-based algorithm that can identify words from a person’s gestures and detect them. There have been many studies on this topic, but the development of static and dynamic sign language recognition models is still a challenging area of research. The difficulty is in obtaining an appropriate model that addresses the challenges of continuous signs that are independent of the signer. Different signers’ speeds, durations, and many other factors make it challenging to create a model with high accuracy and continuity. For the accurate and effective recognition of signs, this study uses two different deep learning-based approaches. We create a real-time American Sign Language detector using the skeleton model, which reliably categorizes continuous signs in sign language in most cases using a deep learning approach. In the second deep learning approach, we create a sign language detector for static signs using YOLOv6. This application is very helpful for sign language users and learners to practice sign language in real time. After training both algorithms separately for static and continuous signs, we create a single algorithm using a hybrid approach. The proposed model, consisting of LSTM with MediaPipe holistic landmarks, achieves around 92% accuracy for different continuous signs, and the YOLOv6 model achieves 96% accuracy over different static signs. Throughout this study, we determine which approach is best for sequential movement detection and for the classification of different signs according to sign language and shows remarkable accuracy in real time.

AB - A speech impairment limits a person’s capacity for oral and auditory communication. A great improvement in communication between the deaf and the general public would be represented by a real-time sign language detector. This work proposes a deep learning-based algorithm that can identify words from a person’s gestures and detect them. There have been many studies on this topic, but the development of static and dynamic sign language recognition models is still a challenging area of research. The difficulty is in obtaining an appropriate model that addresses the challenges of continuous signs that are independent of the signer. Different signers’ speeds, durations, and many other factors make it challenging to create a model with high accuracy and continuity. For the accurate and effective recognition of signs, this study uses two different deep learning-based approaches. We create a real-time American Sign Language detector using the skeleton model, which reliably categorizes continuous signs in sign language in most cases using a deep learning approach. In the second deep learning approach, we create a sign language detector for static signs using YOLOv6. This application is very helpful for sign language users and learners to practice sign language in real time. After training both algorithms separately for static and continuous signs, we create a single algorithm using a hybrid approach. The proposed model, consisting of LSTM with MediaPipe holistic landmarks, achieves around 92% accuracy for different continuous signs, and the YOLOv6 model achieves 96% accuracy over different static signs. Throughout this study, we determine which approach is best for sequential movement detection and for the classification of different signs according to sign language and shows remarkable accuracy in real time.

KW - Long Short-Term Memory (LSTM)

KW - MediaPipe holistic

KW - You Only Look Once (YOLO)

KW - confusion matrix

KW - convolutional neural network (CNN)

KW - deep learning

UR - http://www.scopus.com/inward/record.url?scp=85176402392&partnerID=8YFLogxK

U2 - 10.3390/math11173729

DO - 10.3390/math11173729

M3 - Article

AN - SCOPUS:85176402392

SN - 2227-7390

VL - 11

JO - Mathematics

JF - Mathematics

IS - 17

M1 - 3729

ER -

Deep Learning in Sign Language Recognition: A Hybrid Approach for the Recognition of Static and Dynamic Signs

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this