Enhanced Bald Eagle Search Optimizer with Transfer Learning-based Sign Language Recognition for Hearing-impaired Persons

Asiri, Mashael M.; Motwakel, Abdelwahed; Drar, Suhanda

doi:10.57197/JDR-2023-0039

INTRODUCTION

Communication among dumb and deaf people is done through visual and textual expressions. Gestural interaction is in the scope of secure and confidential interaction ( Rastgoo et al., 2020). Facial parts and hands are hugely powerful in expressing the opinions of humans in confidential interaction. Ordinary people must assume few syntactic meanings for expressions done by people of dumb and deaf communities ( Das et al., 2023). There exist different ways of expression or communication; however, the main mode of human interaction is speech, and if it can be slowed down, individuals need to utilize a tactile–kinesthetic mode of interaction in its place ( Mannan et al., 2022). As per the National Statistical Office survey report, the percentage of people with these disabilities in India has been 2.2% since December 2018. One of the highest adaptations for individuals with hearing and speech impairments is sign language. It is termed as a visual language ( Hameed et al., 2022). It has five fundamental variables: movement, hand shape, place, orientation, and elements like eyebrow movements and mouth shape. The research was conducted on voice generation with smart gloves that can offer a voice-to-sign language movement ( Elakkiya, 2021). Nevertheless, people who have no idea of sign language typically reject or undervalue public damage due to the lack of proper interactions between them ( Li et al., 2020). Therefore, this study devises a mechanism designed to remove the interaction gap and present all individuals with an equal and fair chance. It takes videos of the individual making hand gestures, passing, and processing them to the presented method that forecasts words ( Sharma and Kumar, 2021). The scheme then generated meaningful sentences of those words that can be transformed into the language chosen by the correspondent.

Long ago, numerous exciting studies existed for the detection of dynamic hand gestures for sign languages ( Pandey et al., 2023). The detection remains an inspiring problem despite efforts made in the domain in the past few years. The necessity for understanding multi-modal data like movement and hand gestures in the event of American Sign Language (ASL), whereas it creates a problem vaguer, is acute ( Aly and Aly, 2020). Additionally, a massive amount of words in sign language with comparable gestures having a few instances for all words makes the issue harder. Occasionally, the same signs from various viewpoints or various signers have various appearances ( Lee et al., 2021).

In this study, we propose an Enhanced Bald Eagle Search Optimizer with Transfer Learning Sign Language Recognition (EBESO-TLSLR) technique for hearing-impaired persons. The presented EBESO-TLSLR technique aims to offer effective communication among hearing-impaired persons and normal people using deep learning (DL) models. In the EBESO-TLSLR technique, the SqueezeNet model is used for feature map generation. For recognition of sign language classes, the long short-term memory (LSTM) method can be used. Finally, the EBESO method is exploited for the optimal hyperparameter election of the LSTM approach. The simulation results of the EBESO-TLSLR method are validated on the sign language dataset.

RELATED STUDIES

Alnfiai (2023) presents an SSODL-ASLR method, expanded as shark smell optimization with DL-related automated sign language recognition, for speaking- and hearing-impaired people. The presented method focuses on the classification and recognition of sign language presented by deaf and dumb persons. In the initial phase, Mask RCNN approach can be used for sign language detection. Then, the SSO system with soft margin SVM method is employed for classifying sign languages. In Latif et al. (2021), the authors proposed an AI-related Arabic Sign Language (ArSL) Translator. This system can seize the image of the sign language effectuated by an individual who is deaf and offers real-time translation of hand gestures. New structures are devised to be mined from the imageries to be input to the four methods: regression, RF, RT, and bagging classifier.

In Latif et al. (2020), an ArSL detection structure with the design of the deep CNN (DCNN) was proposed. The goal is to aid persons with hearing difficulties to interact with normal persons. The presented structure identifies the sign of the Arabic alphabet grounded on real-time user inputs. In Bilgin and Mutludoğan (2019), the detection of sign language characters using a structure that has been trained via images of letters in ASL is intended. Capsule networks, which can be projected in the recent period, were utilized for testing and training processes and likened with LeNet which can be the first successful and presently with a method of DL. Hossain et al. (2020) devised a method to identify Bangla sign language (BSL) gestures with CNN. A large amount of openly available sign language data have been utilized to find BSL.

In Selvanambi et al. (2023), different methods to forecast what a user can be irritated to convey over hand gestures with ASL. A chapter emphasizes the presented approach or tool that is leveraged for practices of ASL by regular persons to practice and learn sign language. Marzouk et al. (2022) intended an ASODCAE-SLR method, expanded as atom search optimization include deep convolution AE-enabled sign language detection, for hearing and speaking disabled persons. The presented approach targets to support the interaction of speaking and hearing disabled people through the SLR procedure. As well, the devised method uses CapsNet feature extractors to generate a set of feature vectors.

MATERIALS AND METHODS

In this study, we have aimed to develop an automated sign language recognition using the EBESO-TLSLR technique for hearing-impaired persons. The presented EBESO-TLSLR technique is intended to offer effective communication among hearing-impaired persons and normal persons using the DL models. In the EBESO-TLSLR technique, a three-stage process is derived namely SqueezeNet feature extraction, LSTM-based sign recognition, and EBESO-based parameter tuning. Figure 1 depicts the workflow of the EBESO-TLSLR approach.

Figure 1:

Workflow of the EBESO-TLSLR approach. Abbreviation: EBESO-TLSLR, Enhanced Bald Eagle Search Optimizer with Transfer Learning Sign Language Recognition.

Feature extraction process

In the EBESO-TLSLR technique, the SqueezeNet model is used for feature map generation. SqueezeNet model can be a DCNN, which has a small amount of parameters, compressed structure, and attained higher precision than ImageNet and AlexNet with a similar amount of parameters ( Thanki, 2023). The major advantages are easier to deploy on a cloud platform, can be used and customized on hardware with restricted memory, and has fewer communication channels for training. This original SqueezeNet consists of 14 layers namely 3 max-pooling layers, 8 fire layers, 2 conventional convolution layers, 1 softmax, and a global avg-pooling layer. In this model, the convolution layer is exploited for extracting features from the input color retinal imageries. This sequential method comprises of a convolution layer, ith standalone Conv layer, and eight fire components. The amount of filters per fire component is increased progressively. The max pooling having the stride of 2 is implemented afterwards layers Conv layer 2, Conv layer 1, fire 3, and fire 7. The fire model is a squeeze Conv layer which comprises a 1×1 size fed into the expanding layer that has a mix of 1×1 and 3×3 convolutional filters. There are three tunable hyperparameters namely s1×1, e1×1, and e3×3. The e1×1 and e3×3 are expanded layers with filter sizes of 1×1 and 3×3, correspondingly, while s1×1 is a squeeze layer with a filter size of 1×1. Here, a fire module with hyperparameter s1×1 is used to limit the amount of input channels to the 3×3 size. After extracting features from the input retinal images, the flattened layer is used to convert features into a 1D0 layer.

Recognition process using the optimal LSTM model

To recognize various sign language classes, the LSTM approach was used. LSTM is a special kind of RNN used in the domain of DL. The main factor of the LSTM method is to present a storage unit for cyclic data communication, which records every past data up to the present moment ( Wang et al., 2022). Thus, in comparison to the short-term memory of conventional RNN, LSTM with long-term memory abilities: a gating model (including forget, input, and output gates) with a value between [0, 1] can be used for controlling the communication path of the internal data in the method.

Where tanh denotes the activation function. c _t− ₁ and h _t− ₁ denote the output of the memory unit and hidden layer (HL) at t–1 time. c _t , x _t , it, f _ta _t h _t , and o _t characterize storage unit, input, forget, candidate state, input gate, the output of HL, and output gate at t time, correspondingly. δ shows the logistic sigmoid function. Figure 2 represents the infrastructure of LSTM.

Figure 2:

LSTM architecture. Abbreviation: LSTM, long short-term memory.

The weights between HL and the output, forget, memory unit, and input gates, are represented as W _ho , W _hf , W _hc , and W _hi . W _xo , W _xf, W _xi , and W _xc correspondingly indicate the weight connection of the output gate, forget gate, input gate, and storage unit. ⊙ shows the product of vector components. b _f , b _i , and b _o indicate the bias. The LSTM loop structure controls the data flow by controlling the degree of closing and opening of forget, input, and output gates. Steps 1 to 6 are given as follows:

Step1: The forget gate f _t takes input x _t of the existing layer and the output h _t ₋₁ of HL at the prior moment as input, and the resultant output of forget gate can be multiplied with c _t ₋₁ for controlling what amount of data should be forgotten in the internal state c _t ₋₁ at the prior moment.

(1)

$f_{t} = σ (W_{x f} x_{t} + W_{h f} h_{t - 1} + b_{f})$

Step2: The input gate stores the present input data and the resultant output is utilized as the data to be updated.

(2)

$i_{t} = σ (W_{x i} x_{t} + W_{h i} h_{t - 1} + b_{i})$

Step3: The output gate 0 _fj controls internal state c _fj at the present moment, for controlling what amount of data should be outputted to the external state h _fj .

(3)

$o_{t} = σ (W_{x o} x_{t} + W_{h o} h_{t - 1} + b_{o})$

Step4: The output gate o _t is multiplied by the state of the storage unit processed by the tanh function. The output h _fj of HL is evaluated using Eq. (4):

(4)

$h_{t} = o_{t} \tanh ⊙ (c_{t})$

Step5: The memory unit c _fj records the prior data up to the present moment that is evaluated as follows:

(5)

$c_{t} = f_{t} ⊙ c_{t - 1} + i_{t} ⊙ a_{t}$

Step6: The candidate state can be formulated as follows:

(6)

$a_{t} = t a n h (W_{x c} x_{t} + W_{h c} h_{t - 1} + b_{c})$

Finally, the EBESO algorithm is used for optimal hyperparameter tuning. Bald eagles often feast on protein-rich food. It targets fish ( Alsaidan et al., 2022). Bald eagle chooses a space between the deep water and the land surface. The search step begins after the hunter attains the specified region. As well, bald eagles have outstanding eyesight that allows them to mark the fish inside the water from a higher distance in the air. Once the hunter specifies its targets, it begins to slowly descend to grab the fish and to catch its victim. The mathematical representation and equation of the BESO algorithm label the three phases of hunting by the bald eagle.

The first stage (selection)

The bald eagle defines the region where it could capture the fish. This phase is shown in mathematical method as follows:

(7)

$P_{n e w}, i = P_{b e s t} + α \times r (P_{m e a n} - P_{i})$

In Equation (7), α denotes the location change control parameter ϵ [1.5, 2]. r ϵ [ O, 1] at random. Another exploration region is chosen that is nearby to the formerly chosen one. P _best is an existing exploration space defined by the bald eagles. It can be selected based on the previous location defined. P _mean demonstrates that the eagle benefits from the data of the prior position. This initial phase significantly improves the candidate solution based on the mean location and the better location.

The second stage (search)

In this stage, the hunter searches for the victim. The search can be performed in the previously defined exploration region. Next, the eagle travels in different directions in the spiral region so that the searching process can be speeded up:

(8)

$\begin{array}{l} P_{i, n e w} = P_{i} + y (i) \times (P_{i} - P_{i + 1}) + x (i) \times (P_{i} - P_{m e a n}) \\ x (i) = \frac{x r (i)}{max(| x r |)}, y (i) = \frac{y r (i)}{max(| x r |)} \\ x r (i) = r (i) \times sin(θ (i)), y r (i) = r (i) \times cos(θ (i)) \\ θ (i) = a \times π \times r a n d \\ r (i) = θ (i) + R × r a n d \end{array}$

In Eq. (8), a denotes the random variables ϵ [5, 10]. ‘a’ and ‘R’ parameters control the variation in spiral shape. Likewise, the variable R ϵ [0.5, 2] controlled the determination of the exploration cycle number.

In this stage, the location moves toward the center. Once the ‘a’ and ‘R’ change, the BESO algorithm diversifies to seek more accurate solutions and to prevent trapping in the local solution.

The third stage (swooping)

In the hunting process, swooping is the third and last phase, where the eagles move from the better position toward the targeted prey.

(9)

$\begin{array}{l} P_{i n e w} = r a n d \times P_{b e s t} + x 1 (i) \times (P_{i} - c 1 \times P_{m e a n}) + y 1 (i) \times (P_{i} - c 2 \times P_{b e s t}) \\ x 1 (i) = \frac{x r (i)}{max(| x r |)}, y 1 (i) = \frac{y r (i)}{max(| y r |)} \\ x r (i) = r (i) \times sinh[θ (i)], y r (i) = r (i) \times cosh[θ (i)] \\ θ (i) = a \times π \times r a n d \\ r (i) = θ (i) \end{array}$

From the expression, the two random parameters, c1 and c2, change between 1 and 2 to further intensify the motion of the hunting eagle toward the better position. The mean solution might push the presented method to diverse or intense toward the optimum solution.

Finally, the optimum solution is accomplished by a minimal amount of iterations. All the stages are affected by intensification and diversification factors. They are significant for the continual upgrade to the candidate solution and lastly, to obtain the optimum one.

In the EBESO algorithm, the levy function can be included for improving the outcome of the presented BES method. This can be performed by altering the search agent-utilizing factor “LF”. This factor is arithmetically evaluated using the following expression:

(10)

$\begin{array}{l} L F (γ) = 0.01 \times \frac{u \times σ}{| v |^{\frac{1}{γ}}}, \\ σ = {(\frac{Γ (1 + γ) × s i n (\frac{π γ}{2})}{Γ (\frac{1 + γ}{2}) \times γ \times 2^{(\frac{γ - 1}{2})}})}^{\frac{1}{γ}} \end{array}$

Where v and u denote random values within [0, 1].

In the third step, the Levy function is inserted, the Swoop step. The new population can be evaluated using Equation (11).

(11)

$P_{i, n e w} = {\begin{matrix} P_{b e s t} + P 1 \times C F \times P_{i, n e w} r_{1} < 0.5 \\ P_{i, n e w} + P \times C F \times s t e p s i z e 2_{l} r_{1} \geq 0.5 \end{matrix}$

Where r ₁ denotes a random integer within [0, 1]. P1 and CF are constants. The stepsize _l is evaluated using Equation (12):

(12)

$s t e p s i z e 2_{l} = L F *(P_{b e s t} - L F * P_{i, n e w})$

RESULTS AND DISCUSSION

The sign language detection and classification outcomes of the EBESO-TLSLR method are demonstrated.

In Table 1 and Figure 3, the overall sign language recognition results of the EBESO-TLSLR technique are illustrated. The experimental values suggest that the EBESO-TLSLR technique reaches improved recognition rates. For instance, with 70% of TRP, the EBESO-TLSLR technique obtains average accu _y , prec _n , reca _l , and F _score of 99.42, 99.30, 99.37, and 99.39%, respectively. On the other hand, with 30% of TSP, the EBESO-TLSLR method attains average accu _y , prec _n , reca _l , and F _score of 98.46, 98.55, 98.54, and 98.41%, correspondingly.

Figure 3:

Average outcome of the EBESO-TLSLR approach on 70:30 of TRP/TSP. Abbreviation: EBESO-TLSLR, Enhanced Bald Eagle Search Optimizer with Transfer Learning Sign Language Recognition.

Table 1:

Sign language recognition outcome of the EBESO-TLSLR system on 70:30 of TRP/TSP.

Training phase (70%)
Sign	Accu _y	Prec _n	Reca _l	F _score	Sign	Accu _y	Prec _n	Reca _l	F _score
A	99.03	99.26	99.72	99.15	P	99.67	99.22	99.68	99.28
B	99.53	99.67	99.26	99.40	Q	99.44	99.05	99.31	99.20
C	99.60	99.59	99.72	99.51	R	99.41	99.25	99.64	99.51
D	99.31	99.45	99.00	99.65	S	99.71	99.37	99.09	99.09
E	99.56	99.08	99.74	99.23	T	99.14	99.57	99.07	99.23
F	99.31	99.05	99.27	99.30	U	99.34	99.45	99.24	99.77
G	99.00	99.06	99.50	99.53	V	99.65	99.43	99.22	99.62
H	99.28	99.10	99.10	99.79	W	99.57	99.30	99.71	99.07
I	99.71	99.26	99.62	99.79	X	99.38	99.73	99.79	99.40
J	99.57	99.24	99.03	99.10	Y	99.74	99.68	99.07	99.54
K	99.24	99.13	99.16	99.13	Z	99.28	99.72	99.04	99.79
L	99.54	99.72	99.36	99.55	Space	99.30	99.55	99.08	99.59
M	99.42	99.48	99.01	99.28	Nothing	99.53	99.25	99.48	99.64
N	99.65	99.06	99.27	99.06	Delete	99.44	99.34	99.30	99.28
O	99.51	99.31	99.73	99.42	Average	99.42	99.30	99.37	99.39

Testing phase (30%)
Sign	Accu _y	Prec _n	Reca _l	F _score	Sign	Accu _y	Prec _n	Reca _l	F _score
A	98.43	98.49	98.49	98.76	P	98.54	98.79	98.19	98.68
B	98.66	98.46	98.61	98.29	Q	98.72	98.18	98.13	98.39
C	98.14	98.38	98.71	98.51	R	98.85	98.35	98.08	98.51
D	98.58	98.68	98.58	98.24	S	98.70	98.32	98.91	98.31
E	98.62	98.56	98.13	98.48	T	98.51	98.45	98.52	98.65
F	98.07	98.26	98.04	98.52	U	98.96	98.56	98.85	98.68
G	98.29	98.59	98.21	98.15	V	98.01	98.80	98.60	98.65
H	98.25	98.44	98.79	98.54	W	98.57	98.61	98.53	98.40
I	98.46	98.15	98.71	98.87	X	98.60	98.03	98.12	98.59
J	98.14	98.61	98.42	98.66	Y	98.63	98.88	98.18	98.09
K	98.75	98.97	98.08	98.05	Z	98.43	98.80	98.79	98.67
L	98.93	98.87	98.94	98.79	Space	99.00	98.44	98.02	98.20
M	98.53	98.90	98.82	98.05	Nothing	98.56	98.51	98.89	98.61
N	98.60	98.70	98.95	98.14	Delete	98.75	98.81	98.19	98.60
O	98.44	98.21	98.57	98.17	Average	98.46	98.55	98.54	98.41

Abbreviation: EBESO-TLSLR, Enhanced Bald Eagle Search Optimizer with Transfer Learning Sign Language Recognition.

Figure 4 inspects the accuracy of the EBESO-TLSLR method in the training and validation of the test database. The result specified that the EBESO-TLSLR technique reach higher accuracy values over greater epochs. Also, the higher validation accuracy over training accuracy portrayed that the EBESO-TLSLR technique learns productively on the test database.

Figure 4:

Accuracy curve of the EBESO-TLSLR approach. Abbreviation: EBESO-TLSLR, Enhanced Bald Eagle Search Optimizer with Transfer Learning Sign Language Recognition.

The loss analysis of the EBESO-TLSLR technique in the training and validation is given on the test database in Figure 5. The result points out that the EBESO-TLSLR method reaches closer values of training and validation loss. The EBESO-TLSLR method learns productively on a test database.

Figure 5:

Loss curve of the EBESO-TLSLR approach. Abbreviation: EBESO-TLSLR, Enhanced Bald Eagle Search Optimizer with Transfer Learning Sign Language Recognition.

In Table 2, a detailed comparison study of the EBESO-TLSLR method is reported with recent approaches ( Alrowais et al., 2022).

Table 2:

Comparative outcome of the EBESO-TLSLR approach with other methods.

Methods	Accuracy (%)	Computation time (minutes)
KNN algorithm	96.25	16.54
SVM model	98.10	14.34
ANN model	98.11	15.41
CNN model	99.09	11.21
ODTL-SLRC	99.03	06.44
EBESO-TLSLR	99.42	03.01

Abbreviation: EBESO-TLSLR, Enhanced Bald Eagle Search Optimizer with Transfer Learning Sign Language Recognition.

Figure 6 illustrates the accu _y investigation of the EBESO-TLSLR method with existing techniques. The result demonstrates that the EBESO-TLSLR technique reaches improved performance. Based on accu _y , the EBESO-TLSLR technique obtains accu _y of 99.42% while the existing KNN, SVM, ANN, CNN, and ODTL-SLRC techniques attained accu _y of 96.25, 98.10, 98.11, 99.09, and 99.03%, correspondingly.

Figure 6:

Accu _y outcome of the EBESO-TLSLR approach with other approaches. Abbreviation: EBESO-TLSLR, Enhanced Bald Eagle Search Optimizer with Transfer Learning Sign Language Recognition.

Figure 7 illustrates the CT investigation of the EBESO-TLSLR method with existing approaches. The result highlighted that the EBESO-TLSLR method reaches improved performance. Based on CT, the EBESO-TLSLR approach gains CT of 3.01 m while the existing KNN, SVM, ANN, CNN, and ODTL-SLRC algorithms attained CT of 16.54, 14.34, 15.41, 11.21, and 6.44 m correspondingly. These outcomes reassured the better performance of the EBESO-TLSLR method over other approaches.

Figure 7:

CT outcome of the EBESO-TLSLR approach with other methods. Abbreviation: EBESO-TLSLR, Enhanced Bald Eagle Search Optimizer with Transfer Learning Sign Language Recognition.

CONCLUSION

In this study, we have aimed to develop automated sign language recognition using the EBESO-TLSLR technique for hearing-impaired persons. The presented EBESO-TLSLR technique is intended to offer effective communication among hearing-impaired persons and normal persons using the DL models. In the EBESO-TLSLR technique, the SqueezeNet model is used for feature map generation. For recognition of sign language classes, the LSTM method is used. Finally, the EBESO method is exploited for the optimal hyperparameter election of the LSTM model. The simulation results of the EBESO-TLSLR approach are validated on the sign language dataset. The simulation outcomes illustrate the superior results of the EBESO-TLSLR technique with a maximum accuracy of 96.25%. In future, the detection rate of the EBESO-TLSLR method can be boosted by the feature fusion process.

[1] Alnfiai MM. 2023. Deep learning-based sign language recognition for hearing and speaking impaired people. Intell. Autom. Soft Comput. Vol. 36(2):1653–1669

[2] Alrowais F, Alotaibi SS, Dhahbi S, Marzouk R, Mohamed A, Hilal AM. 2022. Sign language recognition and classification model to enhance quality of disabled people. Comput. Mater. Continua. Vol. 73(2):3419–3432

[3] Alsaidan I, Shaheen MA, Hasanien HM, Alaraj M, Alnafisah AS. 2022. A PEMFC model optimization using the enhanced bald eagle algorithm. Ain. Shams. Eng. J. Vol. 13(6):101749

[4] Aly S, Aly W. 2020. DeepArSLR: a novel signer-independent deep learning framework for isolated Arabic sign language gestures recognition. IEEE Access. Vol. 8:83199–83212

[5] Bilgin M, Mutludoğan K. 2019. American sign language character recognition with capsule networks2019 3rd International Symposium on Multidisciplinary Studies and Innovative Technologies (ISMSIT); IEEE. Ankara, Turkey. 11-13 October 2019; p. 1–6

[6] Das S, Imtiaz MS, Neom NH, Siddique N, Wang H. 2023. A hybrid approach for Bangla sign language recognition using deep transfer learning model with random forest classifier. Expert Syst. Appl. Vol. 213:118914

[7] Elakkiya R. 2021. RETRACTED ARTICLE: Machine learning based sign language recognition: a review and its research frontier. J. Ambient Intell. Hum. Comput. Vol. 12(7):7205–7224

[8] Hameed H, Usman M, Khan MZ, Hussain A, Abbas H, Imran MA, et al.. 2022. Privacy-preserving British sign language recognition using deep learning2022 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC); IEEE. Glasgow, Scotland, UK. July 11-15; p. 4316–4319

[9] Hossain S, Sarma D, Mittra T, Alam MN, Saha I, Johora FT. 2020. Bengali hand sign gestures recognition using convolutional neural network2020 Second International Conference on Inventive Research in Computing Applications (ICIRCA); IEEE. Coimbatore, India. July 15-17; p. 636–641

[10] Latif G, Mohammad N, AlKhalaf R, AlKhalaf R, Alghazo J, Khan M. 2020. An automatic Arabic sign language recognition system based on deep CNN: an assistive system for the deaf and hard of hearing. Int. J. Comput. Digital Syst. Vol. 9(4):715–724

[11] Latif G, Alghazo J, Mohammad N, Alghazo R. 2021. Communicating with the deaf and hard of hearing through automatic Arabic sign language translator. J. Phys. Vol. 1962(1):012055

[12] Lee CK, Ng KK, Chen CH, Lau HC, Chung SY, Tsoi T. 2021. American sign language recognition and training method with recurrent neural network. Expert Syst. Appl. Vol. 167:114403

[13] Li D, Rodriguez C, Yu X, Li H. 2020. Word-level deep sign language recognition from video: a new large-scale dataset and methods comparisonProceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision; Snowmass, CO, USA. March 1-5; p. 1459–1469

[14] Mannan A, Abbasi A, Javed AR, Ahsan A, Gadekallu TR, Xin Q. 2022. Hypertuned deep convolutional neural network for sign language recognition. Comput. Intell. Neurosci. Vol. 2022:1450822

[15] Marzouk R, Alrowais F, Al-Wesabi FN, Hilal AM. 2022. Atom search optimization with deep learning enabled Arabic sign language recognition for speaking and earing disability persons. Healthcare. Vol. 10(9):1606

[16] Pandey A, Chauhan A, Gupta A. 2023. Voice based sign language detection for dumb people communication using machine learning. J. Pharm. Negat. Results. 22–30

[17] Rastgoo R, Kiani K, Escalera S. 2020. Hand sign language recognition using multi-view hand skeleton. Expert Syst. Appl. Vol. 150:113336

[18] Selvanambi R, Karuppiah M, Islabudeen M. 2023. Mobile application-based sign language detector for deaf peopleDesigning and Developing Innovative Mobile Applications. IGI Global. p. 329–350

[19] Sharma S, Kumar K. 2021. ASL-3DCNN: American sign language recognition technique using 3-D convolutional neural networks. Multimedia Tools Appl. Vol. 80(17):26319–26331

[20] Thanki R. 2023. A deep neural network and machine learning approach for retinal fundus image classification. Healthcare Anal. Vol. 3:100140

[21] Wang J, Zhang D, Zhou Y. 2022. Ensemble deep learning for automated classification of power quality disturbances signals. Electr. Power Syst. Res. Vol. 213:108695

Journal of Disability Research

Enhanced Bald Eagle Search Optimizer with Transfer Learning-based Sign Language Recognition for Hearing-impaired Persons

Abstract

Main article text

INTRODUCTION

RELATED STUDIES

MATERIALS AND METHODS

Feature extraction process

Recognition process using the optimal LSTM model

The first stage (selection)

The second stage (search)

The third stage (swooping)

RESULTS AND DISCUSSION

CONCLUSION

REFERENCES

Author and article information

Journal

Affiliations

Author notes

Author information

Article

History

Page count

Funding

Categories

Comments

Comment on this article