Assisting Visually Impaired People Using Deep Learning-based Anomaly Detection in Pedestrian Walkways for Intelligent Transportation Systems on Remote Sensing Images

Alsolai, Hadeel; Al-Wesabi, Fahd N.; Motwakel, Abdelwahed; Drar, Suhanda

doi:10.57197/JDR-2023-0021

INTRODUCTION

Currently, in metropolitan areas, intelligent transportation systems were devised to help decrease traffic, the rate of injuries, accidents, and deaths, diminish environmental pollution, and reduce fuel consumption, etc. ( Khan et al., 2022). Such structures use diverse technologies [deep learning (DL), Internet of Things, image processing, and neural network (NN)] for many applications. Alternatively, technology companies (namely Tesla and Google) are attempting to develop self-driving cars that could offer safe travel for persons while the driver is sleepy, which could save his life along with other travelers’ lives, with automatic control cars to avoid accidents ( Pustokhina et al., 2021). Such automobiles should be fortified with sensors for sensing the atmosphere, finding objects close to the car, and notifying the driver, and have actuators to do real-time processes while the driver is sleepy or fails to pay attention to hazards to avoid accidents ( Mansour et al., 2021). Intelligent transportation systems were advanced for real-time operations and automatic road management in the situation of a natural accident (like an icy road, a mountain fall, and an avalanche) or unnatural disasters (namely car traffic accidents and road repairs) ( Chang et al., 2022). Such systems utilize sensors as a tool to understand and find the state of travel atmosphere. The data gathered by such sensors by means of communication technologies [dedicated short-range communication (DSRC and WiFi)] to other vehicles on the route are transferred to control centers ( Khan et al., 2022).

The captured visual data have details that are precise when compared with substitute data sources namely radar signals, global positioning system (GPS), mobile transmission, etc. ( Avola et al., 2022). In addition, it serves a main role in predicting accidents and other abnormal actions by collecting particulars concerning road traffic circumstances. The usage of closed-circuit television (CCTV) is useful in different real-time applications like accidental falls and grade crossing trespassing ( Sabuhi et al., 2021). Several computer vision (CV) relies on works were modeled by focusing processes namely behavioral learning, data acquisition, and feature extraction. The intention of studies is to calculate processes like vehicle prediction and observation, scene detection, anomaly prediction approaches, activity examination, multi camera-relied schemes and difficulties, video processing models, traffic observation, and human behavior learning ( Dey et al., 2019). Now, anomalous prediction is a sub-field of behavior learning from captured visual scenes. Machine learning (ML) refers to a domain of artificial intelligence (AI) that utilizes statistical approaches to learn hidden patterns from prevailing data and to make decision regarding unobserved records ( Abdullah and Jalal, 2023). The core task of machine learners is constructing a general model on the possible dispersal of training samples and generalizing experience to unseen instances. The learning procedure relies upon the data quality exhibited ( Aldayri and Albattah, 2022). An instance is presented in a dataset with various properties. Inappropriately, mining proficient features seems to be hard for a few tasks.

This manuscript presents a new sine cosine algorithm with deep learning-based anomaly detection in pedestrian walkways (SCADL-ADPW) method. The SCADL-ADPW techniques focus on the identification and classification of anomalies, i.e. vehicles in the pedestrian walkways of visually impaired people (VIP). To accomplish this, the SCADL-ADPW technique uses the VGG-16 model for feature vector generation. In addition, the SCA approach is designed for the optimal hyperparameter tuning process. For anomaly detection, long short-term memory (LSTM) method can be exploited. The experimental results of the SCADL-ADPW technique are studied on the UCSD anomaly detection dataset.

RELATED WORKS

Yang et al. (2018) devised to seize convolutional neural network (CNN)-related per-pixel semantic segmenters to cover navigational perception necessities in a unified way. It is not critical to observe crosswalk location or traffic light signals, but analyze the states of other vehicles and pedestrians. At the centroid of unification proposal is a DL structure, aspired to achieve robust and potential semantic understanding. Shimakawa et al. (2019) deal with obstacle recognition to support VIP walks. Previous studies had adopted a method of utilizing some image processing methods for finding the steps of staircase on depth imageries by an RGB-D camera. Still, it was hard to carry the RGB-D camera. This study implemented CNN, a type of DL approach to hindrance detection, and advanced an application program functioning on smartphones.

Lin et al. (2018) devised a visual localizer that has global optimization and ConvNet descriptor for attaining strong visual localization for aided navigation. Through concatenation of two compressed convolution layers of GoogLeNet, thousands of bytes are utilized for denoting image proficiently. Hsieh et al. (2021) developed an assistive mechanism to help VIP walk outdoors. Depending on the depth map acquired by ZED 2 and CNN-NN-FAST-SCNN, the image of the environment in front of the VIP can be devised into seven equal divisions.

Bhattacharya and Asari (2021) devised a method that incorporated wearable devices and Android devices to support VIP and make it simpler to navigate safely. The glove sync and the Android device are connected through Bluetooth connection, and the vibration actuator needs haptic feedback from the user based on the instruction from the paired devices. Le et al. (2022) recommend a novel BGN for camera-related recognition of pedestrian lanes in unstructured scene. For the VIP safety, along with an output segmentation map, network provided two complete-resolution maps of epistemic and aleatoric uncertainty in addition to well-calibrated confidence measure.

PROPOSED ANOMALY DETECTION MODEL

In this manuscript, we have devised a new SCADL-ADPW technique for anomaly detection purposes. The proposed SCADL-ADPW technique identifies the occurrence of anomalies in the pedestrian walkways on remote sensing images. The SCADL-ADPW techniques focus on the identification and classification of anomalies, i.e. vehicles in the pedestrian walkways of VIP. Figure 1 represents the workflow of the SCADL-ADPW algorithm.

Figure 1:

Workflow of the SCADL-ADPW approach. Abbreviation: SCADL-ADPW, sine cosine algorithm with deep learning-based anomaly detection in pedestrian walkways.

Feature extraction module

The SCADL-ADPW techniques use the VGG-16 model for feature vector generation. The VGG-16 is the essential basis of CNN ( Deepa and Chokkalingam, 2022). VGG designs and trains object detection where the most important CNN is VGG-16. In 2015, Google researchers introduce the InceptionV3 network. The high computation cost related to CNN can be overcome by the pretrained structure namely VGG-16. Figure 2 illustrates the infrastructure of VGG-16. In the presented method, the pooling, convolution, output and fully connected layers are the four main elements included in CNNs.

Figure 2:

Structure of VGG16.

Convolutional layer

The building block in CNNs are a convolution layer. The convolution-related method convert inputs into outputs. The fixed height and width with plane composes the input volume. The depth D can be determined by number of planes. The amount of M planes with the volume structures presented in the convolution layer. The amount of filters or kernels with a small size (K) determines all the planes. The weight with filter size is 3 × 3. Y _K shows the matrix value in terms of k ^th input plane.

(1)

$X_{l} = F (b_{l} + \sum_{k = 1}^{D} w_{k l} * y_{k}); l = 1, \dots, M,$

where w _kl denotes the parametric weight. Over kth input planes, the 2D matrix w _kl * Y _k is generated by means of convolution of these filters. The F activation function transform the outcome and bias function b _l .

The feature map resultant matrix X _l . The feature map denotes M. In this work, the activation function was ReLU. While keeping all the positive input, it changes every value of input to 0.

(2)

$F (y) = Max (0, y)$

The convolution layer output employs the ReLU function. The network learning speed and classification accuracy improved.

Pooling layer

The feature map size can be decreased by means of the pooling layer after all the blocks of a layer or convolution layer. The crucial data are preserved while avoiding inappropriate details. The squared section and the stride (s) of feature map size ( F) are the two crucial parameter to determine pooling.

The s = 2 and F = 2 with five pooling layers existing in VGG-16. The block organizes 13 convolution layers which comprise the VGG-16.

Fully connected and output layer

Attain feature map set afterwards the successive and pooling convolution layer. The input–output function implemented by all the neurons of the fully connected (FC) layer is represented as follows:

(3)

$Z = F (b + \sum_{j}^{n} w_{j} Y_{j})$

Therefore, the w _j and Y _j neuron denote input features and neuron weight. The FC layer output is Z:

(4)

$F (x_{j}) = S o f t m a x (x_{l}) = \frac{e^{(x_{l})}}{\sum_{k = 1}^{8} e^{(x_{k})}}, f o r l = 1, \dots .., 8$

The probability distribution outcome can be attained by using the output layer of all the node’s outcomes.

Hyperparameter tuning module

Here, the SCA approach is designed for the optimal hyperparameter tuning process. SCA is a population-based optimization technique which creates several initial choices and enables them to fluctuate toward optimum solution through the sin and cos operators ( Pham and Nguyen, 2023). The optimization method of population-related optimization algorithm split into two stages: exploration and exploitation.

The exploitation and exploration stages are as follows:

(5)

$X_{i}^{t + 1} = X_{i}^{t} + r_{1} * sin (r_{2}) * | r_{3} * P - X_{i}^{t} |,$

(6)

$X_{i}^{t + 1} = X_{i}^{t} + r_{1} * cos (r_{2}) * | r_{3} * P - X_{i}^{t} | .$

(7)

$X_{i}^{t + 1} = {\frac{X_{i}^{t} + r_{1} * sin (r_{2}) * | r_{3} * P - X_{i}^{t} |; r_{4} \leq 0..5}{X_{i}^{t} + r_{1} * cos (r_{2}) * | r_{3} * P - X_{i}^{t} |; r_{4} \geq 05},$

where r ₄ shows the random value in [0,1].

The key parameters are r ₁, r ₂, r ₃, and r ₄.

r ₁ determines various directions within space between the solution and the space outside the solution or the destination.
r ₂ provides distance to travel toward or outward the destination.
r ₃ shows random weight to emphasize weight of the exploitation ( r ₃ < 1) or exploration ( r ₃ > 1).

Effect of sin and cos functions in Eqs. (5) and (6). Eqs. (5) and (6) determine the spatial area between two solutions in the search range. But, the search space was expanded to high dimension. The cyclical model of sin and cos functions enables relocating around various solutions. This method guarantees exploitation of space determined among two solutions.

Furthermore, to avoid local optima, solution should be searched outside the space between the solution and the destination.

The range of sin and cos functions in Eqs. (5) to (6) is changed to balance the exploration and exploitation stages:

(8)

$r_{1} = a - t \frac{a}{T},$

In Eq. (8), t denotes the existing iteration, and T shows the maximum amount of iterations.

Anomaly detection module

For anomaly detection, the LSTM model is utilized. The key features of LSTM-NN involve the usage of gating modules to optimize the problem of RNN being incapable of transmitting prior data for a longer time, resolve the problems of gradient exploding and vanishing at longer sequence training, and accomplish long-term time series learning ( Hong and Tian, 2023). It includes forgetting, input, and output gates that could selectively learn input data and store relevant data in the storage unit. LSTM is often employed in different domains like speech recognition, unmanned driving, intelligent robots, etc.

(9)

$t a n h (x) = \frac{\sinh (x)}{\cosh (x)}$

(10)

$σ (x) = \frac{1}{1 + e^{- x}}$

(11)

$h_{t} = o_{t} \times t a n h (C)$

(12)

$C_{t} = f_{t} \times C_{t - 1} + i_{t} \times c_{t}$

(13)

$f_{t} = σ (w_{f} \cdot [h_{t - 1^{'}} X_{t}] + b_{f})$

(14)

$i_{t} = σ (w_{i} \cdot [h_{t - 1^{'}} X_{t}] + b_{i})$

(15)

$o_{t} = σ (w_{o} \cdot [h_{t - 1^{'}} X_{t}] + b_{0}),$

where C _t _–1 and C _t correspondingly signify the cell state at t–1 and t time, c _t denotes the cell status update value at t time, h _t _–1 and h _t correspondingly characterize the output at t−1 and t time, 0 _t, i _t and f _t signify output, input, and forgetting gates, correspondingly; X _t shows the input at t time, 0 _t, i _t and f _t denotes each matrix within [0,1] tanh and σ denotes activation functions. σ shows the sigmoid function that is S-type function could map the output of real number. The tanh function ranges within (–1,1). w ₀, w _j , and w _f correspondingly signify the weight matrices of the three gates; b ₀, b _i, and b _f signify the bias terms of the three gating functions.

Based on the abovementioned derived equations and LSTM structure diagram, the input of LSTM-NN at t time can be defined using the input X _t at the existing time and output h _t _–1 at t−1 time. The cell status update value c _t at t time can be defined using i _t input gate that can be used for selecting relevant data for updating unit state C _t . The forget gate f _t is used for controlling cell state at t−1 time and filter storage data.

The joint effect of f _t and i _t forgetting and input gates is to forget worthless data and update relevant data for obtaining unit state C _t at time t. The output gate 0 _t can be leveraged for controlling the cell state C _t at t time. Initially, the tanh function maps the value of the C _t cell state at (–1,1); then the input gate o _t analyzes the output h _t at t time.

EXPERIMENTAL EVALUATION

The proposed model is simulated using Python 3.6.5 tool on PC i5-8600k, GeForce 1050Ti 4GB, 16GB RAM, 250GB SSD, and 1TB HDD. The parameter settings are given as follows: learning rate: 0.01, dropout: 0.5, batch size: 5, epoch count: 50, and activation: ReLU.

In this section, the anomaly recognition in pedestrian walkways results is examined on the UCSD anomaly detection dataset. Figure 3 illustrates the sample images.

Figure 3:

Sample images.

In Table 1 and Figure 4, the recognition accuracy of the SCADL-ADPW approach is examined with current methods on the Test004 dataset. The outcomes guaranteed the improvement of the SCADL-ADPW technique on the detection process. S-40 sequence had an increased detection accuracy of 99.11%, while the DLADT-PW, RS-CNN, Fast R-CNN, and MDT methods have reported decreased detection accuracy of 96, 92.51, 80.41, and 78.21% respectively. Likewise, on S-46 sequence, an increased detection accuracy of 99.85% was observed, while the DLADT-PW, RS-CNN, Fast R-CNN, and MDT methodologies have reported decreased detection accuracy of 98.14, 96.60, 85.17, and 75.48% respectively. Similarly, on S-106 sequence, an increased detection accuracy of 99.23% was observed, while the DLADT-PW, RS-CNN, Fast R-CNN, and MDT methodologies have reported decreased detection accuracy of 97.62, 100.55, 90.77, and 82.73% respectively. Eventually, S-137 sequence had an increased detection accuracy of 99.37%, while the DLADT-PW, RS-CNN, Fast R-CNN, and MDT methods have reported decreased detection accuracy of 98.04, 99.49, 90.81, and 84.76%, respectively. Finally, on S-180 sequence, an increased detection accuracy of 99.10% was observed, while the DLADT-PW, RS-CNN, Fast R-CNN, and MDT methods had decreased detection accuracy of 99.39, 99.42, 85.96, and 84.24%, respectively.

Table 1:

Recognition accuracy of the SCADL-ADPW approach with other systems on Test004 dataset.

Test004
Sequences	DLADT-PW	RS-CNN	Fast R-CNN	MDT	SCADL-ADPW
S-40	96.00	92.51	80.41	78.21	99.11
S-42	97.31	94.10	81.55	77.17	99.57
S-46	98.14	96.60	85.17	75.48	99.85
S-51	98.18	96.39	80.88	90.68	99.44
S-75	99.81	99.74	76.92	81.66	99.15
S-106	97.62	100.55	90.77	82.73	99.23
S-123	96.52	97.93	90.97	87.77	99.32
S-135	99.67	99.16	91.05	81.93	99.78
S-136	96.92	97.61	93.98	83.39	99.52
S-137	98.04	99.49	90.81	84.76	99.37
S-149	98.79	99.42	84.06	80.15	99.61
S-158	98.78	99.25	78.58	78.54	99.77
S-177	97.87	98.12	79.05	76.71	98.91
S-178	98.07	100.12	82.81	76.59	99.16
S-180	99.39	99.42	85.96	84.24	99.10
Average	98.07	98.03	84.86	81.33	99.24

Abbreviation: SCADL-ADPW, sine cosine algorithm with deep learning-based anomaly detection in pedestrian walkways.

Figure 4:

Accu _y outcome of the SCADL-ADPW approach on Test004 dataset. Abbreviation: SCADL-ADPW, sine cosine algorithm with deep learning-based anomaly detection in pedestrian walkways.

In Table 2 and Figure 5, the recognition accuracy of the SCADL-ADPW approach is inspected with current approaches on the Test004 dataset. The results guaranteed the improvement of the SCADL-ADPW technique on the detection process. On S-40 sequence, an increased detection accuracy of 96.78% was observed, while the DLADT-PW, RS-CNN, Fast R-CNN, and MDT methods had decreased detection accuracy of 99.36, 97.44, 92.26, and 87.73%, respectively. Similarly, on S-46 sequence, an increased detection accuracy of 96.97% was observed, while the DLADT-PW, RS-CNN, Fast R-CNN, and MDT methodologies have reported decreased detection accuracy of 99.89, 99.91, 95.76, and 89.31%, correspondingly. Likewise, on S-106 sequence, an increased detection accuracy of 96.35% was observed, while the DLADT-PW, RS-CNN, Fast R-CNN, and MDT methods had decreased detection accuracy of 89.42, 86.97, 85.98, and 85.46%, correspondingly. Eventually, S-137 sequence had an increased detection accuracy of 96.80%, while the DLADT-PW, RS-CNN, Fast R-CNN, and MDT algorithms had decreased detection accuracy of 88.96, 85.52, 84.48, and 83%, respectively. Finally, S-180 sequence had an increased detection accuracy of 96.47%, while the DLADT-PW, RS-CNN, Fast R-CNN, and MDT systems had decreased detection accuracy of 90.05, 86.15, 83.47, and 83.78%, correspondingly.

Table 2:

Recognition accuracy of the SCADL-ADPW approach with other systems on Test007 dataset.

Test007
Sequence	DLADT-PW	RS-CNN	Fast R-CNN	MDT	SCADL-ADPW
S-40	99.36	97.44	92.26	87.73	96.78
S-42	99.15	98.77	95.21	88.18	96.14
S-46	99.89	99.91	95.76	89.31	96.97
S-51	99.02	98.32	94.59	85.60	96.78
S-75	96.59	96.01	91.31	87.57	96.32
S-106	89.42	86.97	85.98	85.46	96.35
S-123	99.16	99.44	95.08	91.38	96.65
S-135	99.13	98.67	97.13	84.64	96.18
S-136	86.85	86.68	82.49	77.64	96.77
S-137	88.96	85.52	84.48	83.00	96.80
S-149	76.64	70.81	58.48	56.15	96.93
S-158	96.81	90.84	85.80	84.07	96.15
S-177	82.82	76.23	72.65	64.98	96.95
S-178	81.53	75.88	66.15	64.45	96.65
S-180	90.05	86.15	83.47	83.78	96.47
Average	92.36	89.84	85.39	80.93	96.59

Abbreviation: SCADL-ADPW, sine cosine algorithm with deep learning-based anomaly detection in pedestrian walkways.

Figure 5:

Accu _y outcome of the SCADL-ADPW approach on Test007 dataset. Abbreviation: SCADL-ADPW, sine cosine algorithm with deep learning-based anomaly detection in pedestrian walkways.

The overall anomaly detection results of the SCADL-ADPW technique have been compared with other DL models in Table 3 and Figure 6 ( Pustokhina et al., 2021). The outcomes notify the improved average accuracy results over other models. On the Test004 dataset, the SCADL-ADPW technique achieves improved average accu _y of 99.24% while the RS-CNN, Fast R-CNN, MDT, and MPPCA techniques attain decreased average accu _y of 98.07, 98.03, 84.86, and 81.33%, respectively. Besides, on the Test007 dataset, the SCADL-ADPW method achieves improved average accu _y of 96.59%, while the RS-CNN, Fast R-CNN, MDT, and MPPCA approaches decreased average accu _y of 92.36, 89.84, 85.39, and 80.93%, respectively.

Table 3:

Average accuracy of the SCADL-ADPW approach with other systems on two datasets.

Techniques	RS-CNN	Fast R-CNN	MDT	MPPCA	SCADL-ADPW
Test004	98.07	98.03	84.86	81.33	99.24
Test007	92.36	89.84	85.39	80.93	96.59

Abbreviation: SCADL-ADPW, sine cosine algorithm with deep learning-based anomaly detection in pedestrian walkways.

Figure 6:

Average accu _y outcome of the SCADL-ADPW approach on two datasets. Abbreviation: SCADL-ADPW, sine cosine algorithm with deep learning-based anomaly detection in pedestrian walkways.

These results ensured the effectual anomaly detection performance of the SCADL-ADPW technique.

CONCLUSION

In this manuscript, we have designed a new SCADL-ADPW technique for anomaly detection purposes. The proposed SCADL-ADPW technique identifies the occurrence of anomalies in the pedestrian walkways on remote sensing images. The SCADL-ADPW techniques focus on the identification and classification of anomalies, i.e. vehicles in the pedestrian walkways of VIP. To accomplish this, the SCADL-ADPW technique uses the VGG-16 model for feature vector generation. In addition, the SCA approach is designed for the optimal hyperparameter tuning process. For anomaly detection, the LSTM model can be exploited. The experimental results of the SCADL-ADPW technique are studied on the UCSD anomaly detection dataset. The comparative outcomes stated the improved anomaly detection results of the SCADL-ADPW technique.

[1] Abdullah F, Jalal A. 2023. Semantic segmentation based crowd tracking and anomaly detection via neuro-fuzzy classifier in smart surveillance system. Arab. J. Sci. Eng. Vol. 48(2):2173–2190

[2] Aldayri A, Albattah W. 2022. Taxonomy of anomaly detection techniques in crowd scenes. Sensors. Vol. 22(16):6080

[3] Avola D, Cannistraci I, Cascio M, Cinque L, Diko A, Fagioli A, et al.. 2022. A novel GAN-based anomaly detection and localization method for aerial video surveillance at low altitude. Remote Sens. Vol. 14(16):4110

[4] Bhattacharya A, Asari VK. 2021. Wearable walking aid system to assist visually impaired persons to navigate sidewalks2021 IEEE Applied Imagery Pattern Recognition Workshop (AIPR); IEEE. Washington, DC, USA. p. 1–7

[5] Chang Y, Tu Z, Xie W, Luo B, Zhang S, Sui H, et al.. 2022. Video anomaly detection with spatio-temporal dissociation. Pattern Recognit. Vol. 122:108213

[6] Deepa N, Chokkalingam SP. 2022. Optimization of VGG16 utilizing the arithmetic optimization algorithm for early detection of Alzheimer’s disease. Biomed. Signal Process. Control. Vol. 74:103455

[7] Dey A, Mohammad F, Ahmed S, Sharif R, Saif AS. 2019. Anomaly detection in crowded scene by pedestrians behaviour extraction using long short term method: a comprehensive study. Int. J. Educ. Manag. Eng. Vol. 9(1):51

[8] Hong J, Tian W. 2023. Prediction in catalytic cracking process based on swarm intelligence algorithm optimization of LSTM. Processes. Vol. 11(5):1454

[9] Hsieh I, Cheng HC, Ke HH, Chen HC, Wang WJ. 2021. A CNN-based wearable assistive system for visually impaired people walking outdoors. Appl. Sci. Vol. 11(21):10026

[10] Khan AA, Nauman MA, Shoaib M, Jahangir R, Alroobaea R, Alsafyani M, et al.. 2022. Crowd anomaly detection in video frames using fine-tuned AlexNet model. Electronics. Vol. 11(19):3105

[11] Le HT, Phung SL, Bouzerdoum A. 2022. Bayesian Gabor network with uncertainty estimation for pedestrian lane detection in assistive navigation. IEEE Trans. Circuits Syst. Video Technol. Vol. 32(8):5331–5345

[12] Lin S, Cheng R, Wang K, Yang K. 2018. Visual localizer: outdoor localization based on convnet descriptor and global optimization for visually impaired pedestrians. Sensors. Vol. 18(8):2476

[13] Mansour RF, Escorcia-Gutierrez J, Gamarra M, Villanueva JA, Leal N. 2021. Intelligent video anomaly detection and classification using faster RCNN with deep reinforcement learning model. Image Vis. Comput. Vol. 112:104229

[14] Pham VHS, Nguyen VN. 2023. Cement transport vehicle routing with a hybrid sine cosine optimization algorithm. Adv. Civ. Eng. 2023

[15] Pustokhina IV, Pustokhin DA, Vaiyapuri T, Gupta D, Kumar S, Shankar K. 2021. An automated deep learning based anomaly detection in pedestrian walkways for vulnerable road users safety. Saf. Sci. Vol. 142:105356

[16] Sabuhi M, Zhou M, Bezemer CP, Musilek P. 2021. Applications of generative adversarial networks in anomaly detection: a systematic literature review. IEEE Access. Vol. 9:161003–161029

[17] Shimakawa M, Taguchi I, Okuma C, Kiyota K. 2019. Smartphone application program of obstacle detection for visually impaired people. ICIC Express Lett. Part B Appl. Vol. 10(3):219–226

[18] Yang K, Cheng R, Bergasa LM, Romera E, Wang K, Long N. 2018. Intersection perception through real-time semantic segmentation to assist navigation of visually impaired pedestrians2018 IEEE International Conference on Robotics and Biomimetics (ROBIO); December 12–15, 2018; IEEE. Kuala Lumpur, Malaysia. p. 1034–1039

Journal of Disability Research

Assisting Visually Impaired People Using Deep Learning-based Anomaly Detection in Pedestrian Walkways for Intelligent Transportation Systems on Remote Sensing Images

Abstract

Main article text

INTRODUCTION

RELATED WORKS

PROPOSED ANOMALY DETECTION MODEL

Feature extraction module

Convolutional layer

Pooling layer

Fully connected and output layer

Hyperparameter tuning module

Anomaly detection module

EXPERIMENTAL EVALUATION

CONCLUSION

CONFLICTS OF INTEREST

REFERENCES

Author and article information

Journal

Affiliations

Author notes

Author information

Article

History

Page count

Funding

Categories

Comments

Comment on this article