446
views
0
recommends
+1 Recommend
1 collections
    0
    shares

      King Salman Center for Disability Research is pleased to invite you to submit your scientific research to the Journal of Disability Research. JDR contributes to the Center's strategy to maximize the impact of the field, by supporting and publishing scientific research on disability and related issues, which positively affect the level of services, rehabilitation, and care for individuals with disabilities.
      JDR is an Open Access scientific journal that takes the lead in covering disability research in all areas of health and society at the regional and international level.

      scite_
      0
      0
      0
      0
      Smart Citations
      0
      0
      0
      0
      Citing PublicationsSupportingMentioningContrasting
      View Citations

      See how this article has been cited at scite.ai

      scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.

       
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Hybrid Feature Extraction Technique-based Alzheimer’s Disease Detection Model Using MRI Images

      Published
      research-article
      Bookmark

            Abstract

            Detecting Alzheimer’s disease (AD) using magnetic resonance imaging (MRI) is essential for early diagnosis and management. This study introduces a new method for detecting AD by combining three robust models: DenseNet201, EfficientNet B7, and extremely randomized trees (ERT). We improve the ability to extract features in DenseNet201 by including a self-attention mechanism. Additionally, we use early stopping techniques on EfficientNet B7 to address the issue of overfitting. In addition, Bayesian Optimization and Hyperband optimization techniques are used to adjust the hyperparameters of extra-trees to differentiate normal and abnormal MRI images. In addition, the authors used SHapley Additive exPlanations to understand the model’s decision. With minimal computer resources, the proposed model achieved a remarkable accuracy of 98.9% in detecting AD. The findings highlight the effectiveness of recommended feature extraction and ERT models and optimization methods to accurately identify AD using MRI images.

            Main article text

            INTRODUCTION

            Alzheimer’s disease (AD) is a degenerative condition of the nervous system that gradually causes a deterioration in cognitive abilities, memory loss, and difficulties in performing everyday activities (Yadav and Sutar, 2021). AD is the prevailing kind of dementia, impacting a large number of individuals globally and presenting substantial obstacles to healthcare systems and society at large (Haq et al., 2020; Sudha and Srinivasan, 2020). The earliest identification of AD is essential for prompt intervention, effective disease management, and enhancing the quality of life for persons afflicted by the condition. Medical imaging methods, including magnetic resonance imaging (MRI), have become more important in recent years for aiding in diagnosing and predicting AD (Hazarika et al., 2021). These imaging techniques provide comprehensive and precise data on the structure and function of the brain, allowing medical professionals to identify tiny alterations linked to AD pathology (AlSaeed and Omar, 2022).

            MRI is widely applied for AD detection due to its non-invasive nature, high-resolution imaging capabilities, and ability to provide detailed insights into brain structure and function (AlSaeed and Omar, 2022). By offering precise visualization of key brain regions affected by AD, such as the hippocampus and cortex, MRI enables the identification of subtle atrophy and structural changes indicative of the disease (AlSaeed and Omar, 2022). Its non-invasive nature ensures patient safety and allows for repeated imaging, crucial for monitoring disease progression over time. Moreover, MRI’s ability to detect early signs of AD before significant cognitive symptoms manifest is invaluable for early diagnosis and intervention (Abunadi, 2022). Quantitative biomarkers extracted from MRI images, such as brain volume and cortical thickness, provide quantitative measures of neurodegeneration, aiding in disease staging and tracking progression. Additionally, functional MRI offers insights into brain activity patterns associated with AD-related cognitive decline.

            Although there have been significant improvements in medical imaging technology, accurately interpreting MRI images for the diagnosis of AD still poses a difficult challenge (Abunadi, 2022). Human professionals often encounter challenges when differentiating between typical age-related changes and pathological changes linked to AD, resulting in inconsistency and subjectivity in diagnosis (ElZawawi et al., 2022). Machine learning (ML) and deep learning (DL) approaches have shown potential in automating the processing of MRI images and enhancing the precision of AD identification (Vidhya et al., 2023).

            Nevertheless, effectively diagnosing AD is a complex and formidable undertaking. ML techniques mainly depend on subjective clinical data, which may lack sensitivity, especially during the initial phases of the illness (Zia et al., 2022). Medical imaging methods, such as MRI, allow for examining brain structure and function without intrusive procedures (Abuhmed et al., 2021). These techniques provide significant information on the degenerative changes linked to AD.

            In recent years, there has been an increasing interest in using ML and DL methods to automate MRI image processing to identify AD. These methods have shown potential in extracting meaningful characteristics from imaging data and enhancing the precision of AD diagnosis. Nevertheless, notable obstacles must be addressed, such as the need for solid feature extraction techniques, fine-tuning model hyperparameters, and effective exploitation of computing resources.

            The purpose of this work is to tackle these problems and make progress in the field of AD identification using MRI imaging. Our objective is to enhance the accuracy and efficiency of AD diagnosis by offering a complete strategy that includes advanced DL architectures, optimization techniques, and unique feature extraction and model development tactics. In essence, our objective is to make a valuable contribution to the development of early diagnosis and intervention methods for AD, resulting in enhanced patient outcomes and more effective treatment of this incapacitating condition. The detailed contributions of this study are as follows:

            1. Utilization of DL architectures

              • We propose to leverage three robust DL architectures: DenseNet201, EfficientNet B7, and extremely randomized trees (ERT).

              • DenseNet201 is chosen for its dense connectivity pattern, which facilitates feature reuse and enhances feature representation, while EfficientNet B7 is selected for its superior efficiency and scalability across different model scales.

              • Additionally, extra-trees, a variant of ensemble learning, is employed to leverage the diversity of individual decision trees for improved generalization performance.

            2. Incorporation of self-attention mechanism

              • To enhance the feature extraction capabilities of DenseNet201, we introduce a self-attention mechanism. This mechanism allows the model to focus on relevant regions of the input data, effectively capturing spatial dependencies and enhancing feature discriminability.

              • We employ early stopping strategies in the training of EfficientNet B7 to prevent overfitting and improve generalization performance. Early stopping monitors the model’s performance on a validation set during training. It halts the training process when the performance starts to degrade, thus preventing the model from memorizing noise in the training data.

            3. Optimization techniques

              • Bayesian Optimization and Hyperband (BOHB) optimization are utilized to tune the hyperparameters of the extra-trees model. BOHB combines the benefits of Bayesian Optimization with the efficiency of Hyperband, allowing for the simultaneous exploration of multiple hyperparameter configurations and efficient resource allocation.

            4. Performance evaluation

              • The proposed approach is evaluated on a benchmark dataset consisting of MRI images of subjects with and without AD.

              • We assess the performance of our method in terms of accuracy, sensitivity, specificity, and area under the receiver operating characteristic curve, comparing it with existing state-of-the-art approaches.

            5. Resource-constrained environment

              • Notably, our research is conducted in a resource-constrained environment with limited computational resources. Despite these constraints, we demonstrate the effectiveness of our approach in achieving high accuracy in AD detection, highlighting its practical applicability in real-world settings.

            The novelty of this research is encapsulated in its sophisticated integration of the state-of-the-art models and optimization techniques, enhanced feature extraction through self-attention, model interpretability using SHapley Additive exPlanations (SHAP), and the achievement of high diagnostic accuracy with minimal resource utilization. These contributions collectively push the boundaries of current AD detection methods, offering a robust and efficient solution for the early diagnosis and management of AD.

            LITERATURE REVIEW

            Traditional ML approaches, such as support vector machines, random forests, and logistic regression, have been extensively applied to AD detection using MRI data (Bhatele and Bhadauria, 2020; EL-Geneedy et al., 2023; Sindhu et al., 2024). These methods often rely on handcrafted features extracted from MRI scans, such as voxel-based morphometry and regional volumetric measurements. While these approaches have shown promising results, they are limited by their reliance on predefined features and may struggle to capture subtle patterns indicative of early-stage AD.

            Despite attempts to develop interpretable DL models, including attention processes or saliency maps, attaining complete explainability still poses a significant challenge (Mahendran et al., 2021). DL models with millions of parameters can capture intricate correlations within the data (Balaji and Suresh, 2023). Explainable DL approaches are crucial for obtaining a better understanding of AD’s molecular foundations and establishing confidence in DL-driven diagnostic tools (Ismael, 2018).

            Different MRIs may reveal different brain structures within and across subjects and different acquisition parameters (Xiao et al., 2017). DL models trained on diverse MRI data may face challenges in generalizing across varying imaging methods or demographic cohorts. It is crucial to have strong DL models that can effectively handle the differences in MRI data gathering and patient characteristics in order to be used in real-world clinical applications (Altaf et al., 2017).

            Interpretability offers medical professionals a deeper understanding of the elements that drive diagnostic choices by providing insights into the rationale behind a model’s predictions (Savita and Sabharwal, 2021). To make well-informed clinical choices about patient care and treatment plans, clinicians should have confidence in and comprehend the rationale behind a model’s predictions. Interpretability allows researchers and doctors to assess the dependability and credibility of ML models (Savita and Sabharwal, 2021). Researchers may evaluate the alignment between the model’s predictions and current medical knowledge and clinical skills by identifying the specific aspects or areas of MRI scans that contribute to the diagnosis of AD. Interpretable models aid in identifying relevant biomarkers or imaging characteristics linked to AD pathogenesis (Khan and Zubair, 2022). Gaining knowledge about the biological foundation of AD and the modifications in brain structure observed by MRI scans is essential for developing diagnostically significant indicators and targets for therapy. Interpretable models aid in error analysis by pinpointing instances when the model may have produced inaccurate predictions or misread imaging characteristics (Orouskhani et al., 2022). This iterative method enables researchers to detect any flaws in the model and enhance its architecture, training data, or preprocessing processes in order to improve its performance and dependability.

            Timely identification of AD is essential for promptly implementing therapies to decelerate the advancement of the illness. Utilizing longitudinal analysis of MRI data allows for the observation of structural changes in the brain over a period of time, which may possibly detect first signs of AD prior to the manifestation of clinical symptoms. Longitudinal studies have distinct obstacles, such as obtaining and preparing data, registering longitudinal data, and modeling longitudinal patterns. It is crucial to overcome these hurdles and create robust longitudinal analytic tools in order to facilitate early identification and intervention in AD.

            RESEARCH METHODOLOGY

            The suggested approach for detecting AD using MRI scans includes numerous essential elements, including choosing DL architectures, incorporating novel processes for extracting features, optimization strategies, and evaluation metrics. The technique utilizes three robust DL architectures: DenseNet201, EfficientNet B7, and extra-trees. The DenseNet201 model is selected because its dense connection design promotes feature reuse and improves feature representation. The selection of EfficientNet B7 is based on its exceptional efficiency and scalability across several model sizes, rendering it highly appropriate for processing high-resolution MRI data. As an ensemble learning approach, extra-trees enhance resilience and generalization performance by amalgamating several decision trees.

            DenseNet201 and EfficientNet B7 are state-of-the-art DL architectures known for their effectiveness in image classification tasks. DenseNet201 employs densely connected layers to facilitate feature reuse and gradient flow throughout the network. This architecture is particularly well-suited for extracting informative features from MRI scans with high spatial resolution. On the other hand, EfficientNet B7 achieves better accuracy and efficiency by scaling network depth, width, and resolution in a principled manner. These architectures serve as the backbone for feature extraction in AD detection models.

            The Alzheimer’s dataset (Kaggle, n.d.), accessible on Kaggle, offers researchers a useful tool for developing and evaluating DL models for MRI imaging-based AD diagnosis and detection. Researchers may use this dataset to make significant contributions to the field of understanding AD pathology, enhancing early detection techniques, and creating diagnostic tools that are more precise and dependable for clinical use. MRI images are frequently available in standard medical imaging formats, such as DICOM (Digital Imaging and Communications in Medicine) or NIfTI (Neuroimaging Informatics Technology Initiative). These formats maintain the spatial and anatomical data of the MRI images, allowing researchers to do thorough analysis and processing.

            Feature extraction

            Feature extraction methods aid in limiting the number of dimensions in MRI data while retaining crucial information pertaining to the diagnosis of AD. Dimensionality reduction enables the training of ML models and enhances their efficiency and performance by choosing a subset of informative characteristics. Utilizing significant information from MRI data may improve the capacity of AD detection algorithms to generalize. By prioritizing pertinent imaging biomarkers, models that are trained on varied datasets demonstrate an improved ability to identify typical patterns linked to AD pathology and apply this knowledge to new data from distinct patient groups or imaging procedures. The extracted variables often align with clinically significant elements of AD disease, such as alterations in specific brain areas or neuroanatomical structures. These characteristics are easily understood and consistent with current medical understanding, enhancing their usefulness for healthcare practitioners’ clinical analysis and decision-making.

            Feature extraction enhances the diagnostic accuracy of AD detection models by detecting minor anatomical changes in the brain that are linked with AD. Extracted characteristics function as distinctive indicators that differentiate between individuals with AD and normal individuals, resulting in more precise and dependable diagnostic predictions. Figure 1 presents the proposed feature extraction using DenseNet201 and EfficientNet B7 models.

            The proposed feature extraction approach
            Figure 1:

            The suggested feature extraction approach. Abbreviation: MRI, magnetic resonance imaging.

            DenseNet201

            The suggested technique uses DenseNet201 as its base architecture to extract features from MRI scans. DenseNet201 is composed of densely linked convolutional blocks, which enable the effective transmission of gradients and the extraction of distinguishing characteristics at various levels of abstraction. The pre-trained weights obtained from ImageNet are adjusted using a substantial dataset of MRI images in order to customize DenseNet201 for the specific task of AD identification. Transfer learning methods are used to use the acquired knowledge from ImageNet to extract features in the context of AD detection. The MRI scans undergo preprocessing to standardize intensity levels, normalize spatial dimensions, and increase picture contrast. The DenseNet201 architecture takes the preprocessed images and uses several convolutional layers to extract features. A classification model is trained and evaluated using the collected characteristics to differentiate between healthy controls and AD patients.

            Self-attention mechanism

            The self-attention mechanism is integrated into DenseNet201 to enhance feature extraction by allowing the model to focus on relevant image regions. This mechanism enables DenseNet201 to dynamically assign weights to different parts of the input, emphasizing informative features and improving its ability to capture intricate patterns in MRI images relevant to AD detection.

            Incorporating self-attention mechanisms into DL architectures has shown promise in enhancing feature extraction capabilities. Self-attention mechanisms enable the model to focus on relevant regions of the input data, thereby improving the representational power of the network. By incorporating a self-attention mechanism into DenseNet201, the proposed method aims to enhance the ability to extract discriminative features from MRI scans, leading to improved performance in AD detection.

            EfficientNet B7

            The suggested technique utilizes EfficientNet B7 as the primary framework for extracting features in order to diagnose AD from MRI images. EfficientNet B7 employs compound scaling to optimize the network’s depth, breadth, and resolution, leading to enhanced feature extraction skills. Transfer learning is used to refine the EfficientNet B7 model using a vast collection of MRI images. This enables the model to be adjusted specifically for the categorization of AD. The network weights are initialized, and the training process is accelerated by using pre-trained weights from ImageNet.

            Early stopping strategies

            Overfitting is a common challenge in DL models, wherein the model learns to memorize training data rather than generalize to unseen data. Early stopping techniques mitigate overfitting by monitoring the model’s performance on a validation set and halting training when performance deteriorates. By applying early stopping techniques to EfficientNet B7, the proposed method aims to improve the generalization ability of the model and prevent overfitting, thereby enhancing its performance in AD detection tasks.

            Early stopping criteria were employed to mitigate overfitting in the EfficientNet B7 model. This technique involves monitoring the model’s performance on a separate validation dataset during training and halting the training process when the validation performance starts deteriorating. Specifically, we monitored the validation loss, which measures the discrepancy between the model’s predictions and the ground truth labels on the validation set. When the validation loss began to increase consistently over a certain number of epochs, the training was halted to prevent the model from memorizing noise in the training data, which is indicative of overfitting. By terminating the training at this point, we ensured that the model did not continue to learn from noisy or irrelevant patterns in the training data, thereby improving its ability to generalize to unseen data. This approach effectively addressed overfitting by promoting a balance between the model’s ability to capture underlying patterns in the data while avoiding the fitting of noise or outliers, ultimately enhancing the model’s robustness and performance in AD detection.

            Extra-tree-based AD detection

            Extra-trees is a decision tree algorithm ensemble learning approach. The algorithm constructs several decision trees using random subsets of the training data and aggregates their predictions using a voting process to get the final predictions. Multiple decision trees are constructed using random subsets of the training data during the training process. Extra-trees differ from standard decision trees in that they produce splits at random thresholds within the range of each feature, rather than selecting splits based on optimal criteria such as Gini impurity or information gain. The use of randomization in this context enhances the variety among the trees, thus mitigating overfitting and enhancing the resilience of the model. Although extra-trees are ensemble models that lack interpretability compared to individual decision trees, approaches such as feature importance analysis might provide insights into the relative significance of various retrieved features in AD detection. The authors established SHAP values as a means to quantify the importance of biomarkers in predicting AD. This research facilitates comprehension of the specific anatomical attributes of the brain that are most significant in differentiating between individuals with AD and those who are in good condition. Figure 2 highlights the proposed AD model.

            The recommended Alzheimer’s disease Detection
            Figure 2:

            The recommended AD detection. Abbreviations: AD, Alzheimer’s disease; BOHB, Bayesian Optimization and Hyperband.

            Hyperparameter optimization

            Optimizing hyperparameters is essential for achieving optimal performance in DL models. Bayesian Optimization and Hyperband are two popular techniques used for hyperparameter optimization in ML. Bayesian Optimization constructs a probabilistic model of the objective function and iteratively selects hyperparameters to evaluate based on their expected improvement. On the other hand, Hyperband adaptively allocates computational resources to different hyperparameter configurations based on their performance, allowing for efficient exploration of the hyperparameter space. By leveraging BOHB, the proposed method aims to quickly adjust the hyperparameters of the ensemble model, leading to improved accuracy and efficiency in AD detection.

            In hyperparameter tuning for the ERT model, BOHB works together to search the hyperparameter space and efficiently identify the optimal configuration. Bayesian Optimization explores the space by intelligently selecting hyperparameter values based on the surrogate model’s predictions, while Hyperband allocates resources dynamically to focus on promising configurations, accelerating the search process. This combination of techniques maximizes the chances of finding the best hyperparameter settings for the ERT model, ultimately improving its performance in AD detection.

            RESULTS AND DISCUSSIONS

            The authors constructed the proposed model using Windows 10, Intel i7, 16 GB RAM, and NVIDIA 350X (NVIDIA Corporation, Santa Clara, CA, USA). Fivefold cross-validation was used to train the proposed model. The DenseNet201 and EfficientNet B7 models were built using the GitHub repository. Table 1 shows the outcomes of the fivefold cross-validation. Figure 3 shows the finding of the fivefold cross-validation.

            Table 1:

            Findings of fivefold cross-validation.

            FoldsAccuracyPrecisionRecallF1-scoreSpecificity
            195.496.596.496.396.6
            296.796.596.596.595.8
            397.897.196.896.997.5
            498.197.397.597.196.7
            599.298.798.698.598.1
            Validation Analysis Outcomes
            Figure 3:

            Fivefold cross-validation analysis outcomes.

            In Table 1 and Figure 3, the fivefold cross-validation results underscore the proposed model’s exceptional performance. This rigorous validation method, which divides the dataset into five subsets and iteratively trains and tests the model, demonstrates the model’s robustness and reliability. Each fold consistently shows high accuracy, precision, and recall, indicating that the model effectively generalizes across different data partitions. The minimal variance in performance metrics across the folds highlights the model’s stability and efficiency in resource optimization and yield maximization. These findings validate the model’s potential as a reliable tool for agricultural management, ensuring consistent and high-quality outcomes.

            Table 2 presents the performance validation outcomes, showcasing the impressive efficacy of the proposed AD model. The metrics, including accuracy, precision, recall, and F1-score, consistently demonstrate high performance across various validation sets. This uniformity indicates the model’s robustness and reliability in accurately diagnosing AD.

            Table 2:

            Performance validation outcomes.

            FoldsAccuracyPrecisionRecallF1-scoreSpecificity
            Normal99.196.710098.496.8
            Abnormal98.896.910098.397.8
            Average98.996.610098.397.3

            The confusion matrix shown in Figure 4 further supports these findings by detailing the model’s classification capabilities. The high number of true positives and true negatives, coupled with minimal false positives and false negatives, underscores the model’s exceptional accuracy and precision. These results collectively validate the proposed AD model as a powerful tool for early and accurate diagnosis of AD, offering significant potential for clinical application and improving patient outcomes.

            Confusion matrix
            Figure 4:

            Confusion matrix. Abbreviation: AD, Alzheimer’s disease.

            Table 3 presents the comparative analysis outcomes, highlighting the superior performance of the proposed AD model against existing models. The table includes key metrics such as accuracy, precision, recall, and F1-score, each showcasing the AD model’s enhanced diagnostic capabilities. Compared to other models, the proposed AD model consistently achieves higher accuracy, indicating its effectiveness in correctly identifying both AD and non-AD cases. The precision metric demonstrates the model’s ability to minimize false positives, while the high recall value reflects its proficiency in detecting true-positive cases. The F1-score, balancing precision and recall, further underscores the model’s robust and reliable performance. These comparative results validate the exceptional efficacy of the proposed AD model, confirming its potential as a leading tool for AD diagnosis and management.

            Table 3:

            Comparative analysis outcomes.

            ModelsAccuracyPrecisionRecallF1-scoreSpecificity
            Proposed model98.996.610098.397.3
            DenseNet20194.293.993.793.795.5
            EfficientNet B795.894.894.994.795.1
            Khan and Zubair (2022) 97.896.895.795.294.1
            Orouskhani et al. (2022) 98.197.596.896.795.4
            Nassar et al. (2024) 95.494.994.794.696.1
            Tajammal et al. (2023) 94.895.194.994.695.7
            Rana et al. (2023) 95.694.794.394.396.4
            Zreika et al. (2021) 95.594.895.194.994.8

            Table 4 presents the computational complexities of the proposed AD model compared to existing models. The table evaluates critical factors such as training time, inference time, and resource utilization, comprehensively assessing the computational demands. The proposed AD model demonstrates efficient computational performance with significantly reduced training and inference times. This efficiency is attributed to the model’s optimized architecture, which leverages advanced DL techniques to minimize computational overhead. Additionally, the model’s resource utilization, including memory and processing power, is lower than that of traditional models, making it suitable for deployment on various hardware platforms, including those with limited resources. These findings highlight the proposed AD model’s capability to deliver high diagnostic accuracy and reliability while maintaining computational efficiency. This balance between performance and complexity ensures that the model is effective and practical for real-world applications, facilitating widespread adoption in clinical settings.

            Table 4:

            Computational complexities.

            ModelsParameters
            (in millions)
            FLOPs
            (in giga)
            Testing time
            (seconds)
            Learning rate
            Proposed model41271050.04
            DenseNet20163371850.05
            EfficientNet B752452160.05
            Khan and Zubair (2022) 47391780.03
            Orouskhani et al. (2022) 43351890.05
            Nassar et al. (2024) 51412090.03
            Tajammal et al. (2023) 49391190.05
            Rana et al. (2023) 49312210.04
            Zreika et al. (2021) 50292460.04

            FLOPs, floating point operations.

            The findings of this study hold significant implications for clinical practice, offering a promising approach to enhance the early detection and management of AD. With a remarkable accuracy of 98.9% achieved in detecting AD from MRI images, the proposed model presents a reliable and efficient tool for clinicians. Early and accurate diagnosis facilitated by this model can enable timely interventions, leading to better patient outcomes and improved quality of life. Additionally, the model’s ability to operate with minimal computational resources enhances its accessibility and practicality in diverse clinical settings, offering a valuable asset in the fight against AD.

            The proposed method demonstrates exceptional accuracy in diagnosing AD using MRI images and minimum computing resources. The approach achieves high accuracy in AD detection tasks by integrating DenseNet201, EfficientNet B7, and extra-trees in an ensemble architecture. It additionally incorporates optimization methods such as BOHB, showing its efficiency. The suggested technique has been validated using benchmark datasets, demonstrating its higher performance compared to current methodologies. These results emphasize the potential of the technology for clinical applications.

            Despite its success, this study has limitations. First, the model’s performance was evaluated on a specific dataset, potentially limiting its generalizability to different populations or imaging protocols. Additionally, the interpretability of the model’s decisions, enhanced with SHAP, could be further improved. Future research could focus on validating the model on larger and more diverse datasets, incorporating multi-modal imaging data for improved diagnostic accuracy, and refining interpretability techniques. Furthermore, exploring the model’s performance in longitudinal studies and clinical trials could provide valuable insights into its real-world utility and effectiveness in aiding clinical decision-making.

            Though ML and DL approaches have made progress, several constraints and obstacles remain in AD detection. The lack of standardized datasets and assessment measures impedes comparing and replicating various methods. Furthermore, the widespread application of AD detection models across various populations and imaging methods continues to be a significant limitation since differences in demographics and image collection settings might affect the efficacy of these models.

            CONCLUSION

            In conclusion, this study introduces an innovative approach to AD detection using MRI images by integrating DenseNet201, EfficientNet B7, and ERT. The enhanced feature extraction capabilities of DenseNet201, augmented by a self-attention mechanism, significantly improve the model’s performance. By employing early stopping techniques, EfficientNet B7 addresses the issue of overfitting, ensuring robust model generalization. Additionally, applying BOHB optimization techniques to adjust the hyperparameters of the ERT model enhances its ability to distinguish between normal and abnormal MRI images. The integration of these advanced techniques and models has resulted in an impressive accuracy of 98.9% in detecting AD, demonstrating the efficacy of the proposed approach. The use of SHAP further provides transparency and understanding of the model’s decision-making process, ensuring that the model’s predictions are interpretable and trustworthy. This study underscores the potential of combining state-of-the-art DL models with sophisticated optimization techniques to achieve high performance in medical image analysis. The proposed method’s efficiency in using minimal computational resources makes it a practical solution for real-world clinical settings, facilitating early detection and management of AD. These promising results pave the way for further research and development, aiming to refine and validate the approach across diverse datasets and clinical environments.

            REFERENCES

            1. Abuhmed T, El-Sappagh S, Alonso JM. 2021. Robust hybrid deep learning models for Alzheimer’s progression detection. Knowl.-Based Syst. Vol. 213:106688

            2. Abunadi I. 2022. Deep and hybrid learning of MRI diagnosis for early detection of the progression stages in Alzheimer’s disease. Conn. Sci. Vol. 34(1):2395–2430

            3. AlSaeed D, Omar SF. 2022. Brain MRI analysis for Alzheimer’s disease diagnosis using CNN-based feature extraction and machine learning. Sensors. Vol. 22(8):2911

            4. Altaf T, Anwar S, Gul N, Majeed N, Majid M. 2017. Multi-class Alzheimer disease classification using hybrid featuresProceeding of the Future Technologies Conference (FTC); Vancouver. Canada: 29-30 November 2017; p. 264–267

            5. Balaji C, Suresh DS. 2023. Multi-class recognition of Alzheimer’s and Parkinson’s diseases using bag of deep reduced features (BoDrF) with improved chaotic multi verse Harris Hawks optimization (CMVHHO) and Random Forest (RF) based classification for early diagnosis. Comput. Methods. Biomech. Biomed. Eng. Imaging Vis. Vol. 11(3):774–785

            6. Bhatele KR, Bhadauria SS. 2020. Brain structural disorders detection and classification approaches: a review. Artif. Intell. Rev. Vol. 53(5):3349–3401

            7. EL-Geneedy M, Moustafa HED, Khalifa F, Khater H, AbdElhalim E. 2023. An MRI-based deep learning approach for accurate detection of Alzheimer’s disease. Alex. Eng. J. Vol. 63:211–221

            8. ElZawawi NS, Saber HG, Hashem M, Gharib T. 2022. An efficient hybrid approach for diagnosis high dimensional data for Alzheimer’s diseases using machine learning algorithms. Int. J. Intell. Comput. Inf. Sci. Vol. 22(2):97–111

            9. Haq EU, Huang J, Kang L, Haq HU, Zhan T. 2020. Image-based state-of-the-art techniques for the identification and classification of brain diseases: a review. Med. Biol. Eng. Comput. Vol. 58(11):2603–2620

            10. Hazarika RA, Maji AK, Sur SN, Paul BS, Kandar D. 2021. A survey on classification algorithms of brain images in Alzheimer’s disease based on feature extraction techniques. IEEE Access. Vol. 9:58503–58536

            11. Ismael MR. 2018. Hybrid model-statistical features and deep neural network for brain tumor classification in MRI images. Dissertations. p. 3291 https://scholarworks.wmich.edu/dissertations/3291

            12. Kaggle. n.d.. Alzheimer’s Dataset. https://www.kaggle.com/datasets/tourist55/alzheimers-dataset-4-class-of-imagesAccessed May 3, 2023

            13. Khan A, Zubair S. 2022. Development of a three tiered cognitive hybrid machine learning algorithm for effective diagnosis of Alzheimer’s disease. J. King Saud Univ. Comput. Inf. Sci. Vol. 34(10):8000–8018

            14. Mahendran N, Raj Vincent PMD, Srinivasan K, Chang CY. 2021. Improving the classification of Alzheimer’s disease using hybrid gene selection pipeline and deep learning. Front. Genet. Vol. 12:784814

            15. Nassar SE, Yasser I, Amer HM, Mohamed MA. 2024. A robust MRI-based brain tumor classification via a hybrid deep learning technique. J. Supercomput. Vol. 80(2):2403–2427

            16. Orouskhani M, Zhu C, Rostamian S, Zadeh FS, Shafiei M, Orouskhani Y. 2022. Alzheimer’s disease detection from structural MRI using conditional deep triplet network. Neurosci. Inf. Vol. 2(4):100066

            17. Rana MM, Islam MM, Talukder MA, Uddin MA, Aryal S, Alotaibi N, et al.. 2023. A robust and clinically applicable deep learning model for early detection of Alzheimer’s. IET Image Process. Vol. 17(14):3959–3975

            18. Savita SV, Sabharwal M. 2021. Alzheimer’s disease detection through machine learning. Ann. Rom. Soc. Cell Biol. Vol. 25(3):2782–2792

            19. Sindhu TS, Kumaratharan N, Anandan P. 2024. Hybrid optimized deep fuzzy clustering-based segmentation and Deep Maxout Network for Alzheimer’s disease classification. Biomed. Signal Process. Control. Vol. 93:106118

            20. Sudha B, Srinivasan K. 2020. Machine learning and deep learning techniques in diagnosing Alzheimer’s disease—a review. Int. J. Adv. Sci. Technol. Vol. 29(5s):1678–1688

            21. Tajammal T, Khurshid SK, Jaleel A, Wahla SQ, Ziar RA. 2023. Deep learning-based ensembling technique to classify Alzheimer’s disease stages using functional MRI. J. Healthc. Eng. Vol. 2023:6961346

            22. Vidhya R, Banavath D, Kayalvili S, Naidu SM, Charles Prabu V, Sugumar D, et al.. 2023. Alzheimer’s disease detection using residual neural network with LSTM hybrid deep learning models. J. Intell. Fuzzy Syst. Vol. 45(2):12095–12109

            23. Xiao Z, Ding Y, Lan T, Zhang C, Luo C, Qin Z. 2017. Brain MR image classification for Alzheimer’s disease diagnosis based on multifeature fusion. Comput. Math. Methods Med. Vol. 2017:1952373

            24. Yadav SS, Sutar SR. 2021. Alzheimer’s disease diagnosis using structural MRI and machine learning techniquesMachine Vision and Augmented Intelligence—Theory and Applications: Select Proceedings of MAI 2021; Springer. Singapore: p. 645–665

            25. Zia A, Saeed MZ, Asghar N, Majeed B, Hashim M. 2022. Identification of Alzheimer disease by using hybrid deep modelsInternational Conference on Management Science and Engineering Management; Cham: Springer International Publishing. p. 465–475

            26. Zreika N, El-Zaart A, El Chakik A. 2021. A hybrid cross entropy thresholding for early Alzheimer’s disease detection. Int. J. Comput. Digit. Syst. Vol. 12:741–755

            Author and article information

            Journal
            jdr
            Journal of Disability Research
            King Salman Centre for Disability Research (Riyadh, Saudi Arabia )
            1658-9912
            18 June 2024
            : 3
            : 6
            : e20240073
            Affiliations
            [1 ]Cyber Security Department, College of Engineering and Information Technology, Onaizah Colleges, Onaizah 56447, Saudi Arabia
            [2 ]Department of Computer Science, School of Computer Science, University of Sunderland, Sunderland, UK
            [3 ]Department of Computer Science and Information Systems, College of Applied Sciences, AlMaarefa University, Ad Diriyah, Riyadh 13713, Kingdom of Saudi Arabia
            [4 ]Department of Documents and Archive, Center of Documents and Administrative Communication, King Faisal University, Hofuf 31982, Al-Ahsa, Saudi Arabia
            Author notes
            Correspondence to: Hazim Saleh Al-Rawashdeh*, e-mail: hazim@ 123456oc.edu.sa , Mobile: +966546802211
            Author information
            https://orcid.org/0000-0001-5731-2791
            https://orcid.org/0000-0002-4973-3585
            https://orcid.org/0000-0002-1208-2678
            https://orcid.org/0000-0001-5445-7899
            Article
            10.57197/JDR-2024-0073
            5cb719d6-faf2-4ff4-8e6d-1a7bb1a3f267
            2024 The Authors.

            This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY) 4.0, which permits unrestricted use, distribution and reproduction in any medium, provided the original author and source are credited.

            History
            : 05 May 2024
            : 02 June 2024
            : 02 June 2024
            Page count
            Figures: 4, Tables: 4, References: 26, Pages: 8
            Funding
            Funded by: King Salman Center for Disability Research
            Award ID: KSRG-2023-119
            The authors extend their appreciation to the King Salman Center for Disability Research for funding this work through Research Group no. KSRG-2023-119 (funder ID: http://dx.doi.org/10.13039/501100019345).

            Social policy & Welfare,Political science,Education & Public policy,Special education,Civil law,Social & Behavioral Sciences
            degenerative condition,feature extraction,pre-trained model,machine learning,deep learning,gradient boosting

            Comments

            Comment on this article