Diagnosing Osteoporosis in Postmenopausal Females Using Machine Learning and AdaBoostM1 Algorithm Based on Bone Mineral Density

Jabbar, Sohail; Ahmad, Awais; Tariq, Saba

doi:10.57197/JDR-2024-0055

INTRODUCTION

Osteoporosis is a common condition becoming more prevalent as the world’s population ages. This disorder leads to low bone mineral density (BMD) and a degradation of the bone’s microstructure, increasing the risk of fractures ( Friedman, 2006). BMD measures the amount of calcium and phosphorus in bone, used to diagnose osteoporosis, assess treatment effectiveness, and predict fracture risk, with low density. Osteoporosis-related hip, spine, and wrist fractures can cause diseases that reduce the patient’s quality of life and, in severe cases, increase the risk of mortality ( Dempster, 2011). Due to the rising global population and life expectancy, osteoporosis is becoming a more serious issue. Over 200 million people worldwide are believed to suffer from osteoporosis, and recent data from the International Osteoporosis Foundation indicate that one in three women and one in five men over the age of 50 are likely to experience osteoporotic fractures ( Shevroja et al., 2023). This disease progresses silently, often beginning with a low-energy fracture of a long bone or vertebrae. Early signs are often overlooked, leaving the disease undiagnosed and untreated, which is expected to continue given the asymptomatic early stages of the disease. To determine BMD, dual-energy x-ray absorptiometry is commonly used, which provides a T-score to indicate the BMD values ( Long et al., 2023). However, despite being the most reliable method, it is not often used in community settings due to its high cost and operational complexity.

Recent research has shown that dental panoramic radiographs (DPRs) can be a cost-effective option for digital imaging-based osteoporosis screenings. This is particularly important as women have a high-risk ratio for the lifetime probability of fracture, as Table 1 indicates ( Long et al., 2023). Several studies have demonstrated the feasibility of using DPRs for BMD assessment and osteoporosis screening. Additionally, panoramic radiation is frequently used in dental treatments for elderly patients ( Wong et al., 2022). Traditional methods for identifying osteoporosis have relied on manually classified feature indexes, but this approach has limitations due to the low-order representation of the heterogeneous patterns in radiographic images.

Table 1:

Remaining lifetime probability of fracture (%) in men and women risk ratio.

		At 50 years			At 80 years
Type of fracture	Women	Men	Risk ratio	Women	Men	Risk ratio
Hip	10.7	22.9	2.1	19.3	9.1	2.1
Forearm	4.6	20.8	4.5	8.9	1.6	5.6
Proximal humerus	4.1	12.9	3.1	7.7	2.5	3.1
Spine	8.3	15.1	1.8	8.7	4.7	1.9
Any of these	22.4	46.4	2.1	31.7	15.3	2.1

In the field of image classification, traditional machine learning (ML) algorithms such as support vector machine (SVM) and fuzzy classifiers require a lot of preprocessing tasks such as image normalization and area of interest (ROI) segmentation ( Caffarelli et al., 2022). This creates doubts about the repeatability of the classification process. Artificial intelligence (AI) is a vast field built on a foundation of mathematical and scientific disciplines. ML is a notable subset of AI that aims to enable machines to learn through feedback, experience, and input datasets. This subset further divides into deep learning (DL) and neural networks, with the ultimate goal of developing systems that can accurately process new and untouched datasets ( Ullah et al., 2022).

The use of ML in analyzing medical images has emerged as a promising and rapidly growing subset of AI. This field has a wide range of applications in image processing, which helps in detecting illnesses, aiding in computer-aided diagnostics, and advancing computer vision techniques ( Santos et al., 2019). The development of new imaging technologies such as multi-slice computed tomography, positron emission tomography, tomosynthesis, magnetic resonance imaging, tomography, and diffuse optical tomography necessitates the use of sophisticated ML techniques to further the analysis of medical images.

ML works on a collection of methods that are tailored to identify patterns in data autonomously, thereby aiding in making informed decisions in ambiguous situations and anticipating future data trends. This methodology relies on a data-driven decision-making process, which significantly reduces the need for human intervention by utilizing training data analysis to make predictions on new data inputs ( Tantalaki et al., 2019). In recent years, various ML strategies have been deployed to predict and identify diseases. For instance, natural language processing is used to analyze electronic health records to extract valuable data for disease forecasting and identification ( Van Vleck et al., 2019).

In the medical field, it is important to have transparency in decision-making. Explainable AI techniques such as SHapley Additive exPlanations help in understanding the predictions given by ML models. Additionally, generative models such as generative adversarial networks can be used to create artificial medical images, which can improve the accuracy of results. These techniques can work together to address specific issues and datasets ( Ahmed et al., 2020).

AI technology plays a crucial role in the medical domain, especially in image data analysis. It helps with disease detection, prediction, diagnosis, and classification, enables informed decision-making, and optimizes treatment strategies ( Sebastian and Peter, 2022). For instance, AI can detect tumors from medical imagery and facilitate early intervention. AI-based diagnostic models can identify underdiagnosed or undertreated individuals and recognize rare diseases ( Rabaan et al., 2022).

A study by Singh et al. has shown that ML algorithms such as SVM, k-nearest neighbors, Naïve Bayes, and decision tree are useful in detecting and diagnosing diseases such as cancer, diabetes, and heart attacks ( Singh et al., 2021). Another study found that DL technology can be applied to disc disease diagnosis categorization, using five attributes to improve decision analysis in illness diagnostics ( Hussain et al., 2023). Further research has demonstrated the effectiveness of ML techniques in identifying and classifying bone fractures, achieving an accuracy rate of 85% and a 0.86 area under the curve (AUC) through various DL approaches including detection, enumeration, and localization strategies ( Sahin, 2023).

This research aims to diagnose osteoporosis in postmenopausal women using BMD. Below are the contributions of our research:

This study aims to increase awareness of osteoporosis in postmenopausal women and highlights the unique challenges and risks faced by this demographic. Diagnostic frameworks and strategies will be developed specifically for community healthcare settings to achieve this objective. These frameworks will emphasize the crucial role of readily available methods such as clinical risk assessments and fracture risk prediction models, especially in resource-strained areas.
In addition, this research will focus on innovative diagnostic techniques, encouraging the use of automated detection and refined risk assessment models to improve diagnostic capabilities. The study will also underscore the importance of patient education in raising awareness among postmenopausal women about osteoporosis, its risk factors, and the critical role of early detection and management.
Finally, this study seeks to alleviate the societal and economic burden associated with osteoporotic fractures by promoting timely diagnosis and intervention in local settings. Its findings will have significant public health implications.

The article is organized into several sections. The Materials and Methods section reviews the related works in the field and provides details on the methodology of the proposed system. The Results and Discussion and Comparative Discussion sections delve into the results and discussion based on the research findings. Finally, the Conclusion and Future Work section offers the conclusion drawn from the study and outlines potential avenues for future work.

MATERIAL AND METHODS

BMD based on ML

The high-level architecture of a model is illustrated in Figure 1. It uses ML to determine BMD. The model consists of three layers, each with three modules.

Figure 1:

Machine learning model for detection of osteoporosis based on bone mineral density (BMD).

The first layer begins with the dataset module, which contains all relevant information on bone fractures. The next module, data preprocessing, is crucial to the study as it is responsible for cleaning and annotating the data. The final module in this layer focuses on data training, where the dataset is trained and prepared for the next layer. In the second layer, we start with data testing by analyzing the dataset based on the training data. The data are then transferred to the model module where predictions are made based on previously established patterns. To ensure the model’s effectiveness and reliability, it is essential to rigorously evaluate its performance using a standardized dataset. Our analysis utilized a publicly available dataset from Kaggle specializing in bone detection ( Kaggle, 2024).

The dataset provided is divided into two folders, one containing control data and the other containing data for various circumstances. The comma separated values files in each folder record the details of each patient, including their ID, age, fracture details, weight, height, and medication history. The actigraphy records have been compiled over several years, making this dataset a rich repository of patient information. The process of creating an ML model for osteoporosis diagnosis is presented in Figure 2, in the form of a knowledge process diagram. This diagram showcases the integral phases of data preparation, model training, and model assessment, providing a conceptual roadmap for medical professionals to better understand the operational dynamics of these models in osteoporosis diagnosis. Detailed descriptions of each phase are presented in the paragraphs that follow, aiming to provide a comprehensive insight into the diagnostic potential of this architecture.

Figure 2:

Knowledge workflow diagram of the proposed model.

Stage 1—data acquisition

The first step in studying BMD and osteoporosis is to collect relevant data such as age, gender, and medical history. This can be done through clinical research, surveys, and examination of medical records. For our study, we obtained the necessary data from Kaggle, a well-known open-source dataset archive. This dataset includes 169 rows and 9 columns of various attributes, including a unique ID for each patient, age, a classification label indicating whether the patient has experienced a fracture, weight in kilograms, and height in centimeters. Other details such as medication, waiting time, and BMD values were also included, providing a comprehensive insight into each patient’s health status. We created a graphical representation of gender-wise BMD distributions in Figure 3, where blue dots represent male data points and pink dots represent female data points. The graph highlights a discernible pattern: females are more affected by BMD variation than males, particularly postmenopausal females. This underscores the serious impact of osteoporosis in this demographic.

Figure 3:

Gender-wise BMD distributions. Abbreviation: BMD, bone mineral density.

Stage 2—data preprocessing

The next step in the ML process is data preprocessing, which is a crucial step that involves cleaning the dataset by removing outliers and transforming the data into a format suitable for ML algorithms. This stage includes several techniques, such as data cleaning, data transformation, data integration, and dimensionality reduction. Data cleaning involves removing noisy and irrelevant information from the dataset to ensure more accurate analyses. Data transformation involves converting data values from one format to another to ensure consistency and compatibility with the analytical tools used. The data integration technique is applied to eliminate data duplication, ensuring a single and correct representation of each data point. Dimensionality reduction, on the other hand, simplifies the data representation by converting three-dimensional data into two-dimensional data, while maintaining the essential characteristics and reducing complexity and storage requirements. Figure 4 illustrates the workflow of the data preprocessing stage, showing the systematic progression from the raw data to a refined format ready for analysis.

Figure 4:

Data preprocessing steps.

Stage 3—data training

During this stage, the data that have been preprocessed go through training using a chosen ML algorithm. There are many ML algorithms available, each with its strengths and weaknesses. The choice of algorithm, in this case, AdaBoostM1, depends on the specifics of the dataset and the analysis objectives. This crucial step involves inputting the preprocessed data into the algorithm, allowing it to learn the complex relationships between BMD and osteoporosis indicators. The chosen attributes within the dataset play a critical role in determining the effectiveness of the ML algorithm and, consequently, the accuracy of the osteoporosis identification process. Figure 5 demonstrates the importance of selecting attributes that are directly aligned to accurately identify osteoporosis through analysis.

Figure 5:

AdaBoostM1 method tree visualization. Abbreviation: BMD, bone mineral density.

Stage 4—model testing

It is crucial to ensure the performance of an ML algorithm. This can be accomplished by conducting a thorough evaluation using a separate dataset that is not used during the training phase. This approach helps to prevent overfitting to the training data and enables the model to perform well on new and unseen data.

Stage 5—model prediction and evaluation

After the testing phase, the ML algorithm has been validated and is now ready to be deployed in a clinical setting to help identify osteoporosis. Fine-tuning the algorithm is critical, and it involves adjusting hyperparameters, which are the settings that control the algorithm’s behavior. These adjustments can significantly improve the algorithm’s performance, leading to a more accurate diagnosis of osteoporosis. The field of ML is continually evolving, and with advancements, there are improved capabilities for detecting osteoporosis. This facilitates the development of novel and more effective strategies for identifying and managing the condition.

RESULTS AND DISCUSSION

The data obtained from this process can help medical professionals make more informed decisions about the care and treatment of their patients. These data can be used as a precise tool to distinguish between individuals with fractures and those without. Ultimately, it can lead to a higher standard of medical care and better patient outcomes. In Figure 6, a graph shows the correlation between age and the likelihood of fractures. The x-axis represents age, while the y-axis shows the risk of experiencing a fracture. The graph illustrates the increasing probability of fractures with age, with the red zone indicating higher risk. According to the study, the risk of fractures increases with age. For example, a 19-year-old has a 35.81% risk, while a 76-year-old has a substantial 88.75% risk.

$Relation of candidate age versus risk of fracture$

Figure 6:

Relation of candidate age versus risk of fracture.

This is because as people age, their bones become more brittle and weaker, making them more susceptible to fractures in the case of falls and other injuries. As a result, senior citizens are at an increased risk of experiencing bone fractures.

The line graph shown in Figure 7 displays the relationship between true positive rate (TPR) and false positive rate (FPR) at various thresholds. TPR indicates the percentage of correctly identified positive cases, while FPR indicates the percentage of negative cases that are incorrectly identified. The graph reveals that increasing the threshold correlates with an increase in TPR, which improves the model’s accuracy in identifying positive cases. However, raising the threshold also increases the FPR, which means that the model is more likely to label negative instances inaccurately. Therefore, establishing the optimal threshold is crucial to achieving a balance between TPR and FPR, thereby enhancing the model’s ability to distinguish between positive and negative cases.

Figure 7:

Relationship between true positive rate (TPR) and false positive rate (FPR).

In this scenario, a threshold of 0.5 is found to be appropriate for instructing the model to classify a case as positive. This means that the model will classify a case as positive if the probability of it being positive exceeds 0.5. This mechanism is closely related to precision and recall. Precision refers to the ratio of correctly identified positive instances, while recall represents the fraction of total positive instances that have been correctly identified.

As shown in Figure 8, the cost curve represents the performance of a binary classifier in evaluating the financial implications of diagnostic errors. There are two kinds of incorrect diagnoses: false positives, where individuals are incorrectly identified as having a fracture, and false negatives, where individuals who have a fracture are erroneously declared fracture-free. Each type of error has associated costs. The analysis of the cost curve reveals that, for this model, the financial consequences of false positives are less severe than those of false negatives. This suggests that the model is more likely to inaccurately identify individuals as fracture patients rather than falsely clearing individuals who have a fracture.

Figure 8:

Cost curves used for performance evaluation.

The following analysis has revealed that the model used tends to be overly cautious in its predictions, which may result in false alerts. Improving the model’s specificity can enhance its predictive capabilities, reducing unnecessary alerts and focusing on actual fracture cases. By fine-tuning the model, accurate and cost-effective BMD assessments can be conducted for osteoporosis diagnosis.

Figure 9 displays the results of a non-parametric statistical method called the Nemenyi test, which compares the means across different groups to identify significant differences. This test is crucial for understanding the pairwise comparisons among various groups. The diagram shows a value 1 along the diagonal line, indicating an equivalence between the values represented on the x and y axes. This analytical tool was used to evaluate the BMD dataset, comparing individual columns to determine their accuracy in identifying the essential characteristics for osteoporosis diagnosis. The test provides an invaluable landscape of the efficacy of different features in osteoporosis detection, serving as a compass to identify the most potent identifiers in the dataset.

Figure 9:

Nemenyi test is a non-parametric statistical test. Abbreviation: BMD, bone mineral density.

The receiver operating characteristic (ROC) curve, shown in Figure 10, is a tool used to evaluate binary classifiers’ effectiveness in distinguishing between two categories—in this case, fractures and no fractures. The curve is based on TPR and FPR. TPR measures the proportion of positive instances that are correctly identified, while FPR represents the proportion of negative instances that are incorrectly classified as positive.

Figure 10:

Area under ROC curve—classifiers’ effectiveness of fractures and no fractures. Abbreviation: ROC, receiver operating characteristic.

Ideally, a perfect classifier would achieve a TPR of 1 without any false positives, resulting in an FPR of 0. However, this equilibrium is practically impossible due to the inherent trade-off between TPR and FPR. The AUC, which represents the model’s overall performance, is 0.855 in this case. An AUC of 1 indicates a perfect classifier, while an AUC of 0.5 represents a random classifier. An AUC of 0.855 indicates a highly capable classifier, although there is still room for improvement. This could potentially be achieved by focusing on reducing the FPR to improve the model’s specificity in identifying fractures.

Figure 11 displays a scatter plot illustrating the relationship between BMD and fracture status. Blue points represent individuals without fractures, while red points represent individuals with fractures. The decision boundary, a delineating line used by the model to determine an individual’s fracture status, is at the center of the plot. Analyzing the plot reveals that the model has effectively predicted the fracture status of most individuals. However, there are instances of misclassification: the model incorrectly attributed fracture status to 9 individuals who did not have fractures (false positives) and failed to identify fractures in 11 individuals who did have them (false negatives).

$Scatter plot of BMD versus fracture status$

Figure 11:

Scatter plot of bone mineral density (BMD) versus fracture status.

The overall accuracy rate is about 88.1657%, which means that over 88% of the population analyzed has been predicted correctly. Although the accuracy rate is high, there is still room for improvement in reducing the number of false positive and false negative rates. One can adjust the classification algorithm’s parameters to improve the model’s accuracy in identifying fractures accurately to enhance its sensitivity and specificity. Depending on the application of this model, different approaches to tuning these parameters could be more beneficial to optimize the model’s predictive accuracy.

The graph shown in Figure 12 demonstrates the relationship between the decision threshold and the predicted probability of a positive classification for a logistic regression model. In this case, the positive classification refers to patients with fractures, while the negative classification refers to those without fractures. The x-axis represents the decision threshold, while the y-axis represents the predicted probability of a positive classification. The curve in the graph was generated using a threshold of 0.4743. This means that patients were classified as having a fracture if the predicted probability equaled or exceeded this value.

Figure 12:

Relationship between the decision threshold and the predicted probability of a positive classification for a logistic regression model.

The graph shows that the model correctly identified a significant number of fracture cases. However, there were still some instances of misdiagnosis, which suggests that the decision threshold may not be optimal. The test results provide further insights into the model’s effectiveness. The classification accuracy was found to be 88.1657%, indicating that over 88% of patients were correctly classified. The data also showed that the model had a sensitivity of 63.91% and a specificity of 99.2308%. This means that the model correctly identified 63.91% of patients with fractures and 99.2308% of those without.

Although the model’s specificity is commendably high, indicating the accurate classification of negative cases, there is still considerable room for improving its sensitivity to classify more positive cases correctly. This would augment the model’s overall efficacy.

Figure 13 presents a confusion matrix that is a potent tool in medical diagnostics. The values classified under “a” represent cases without fractures, while those categorized under “b” signify cases with fractures. Specifically, we observe 109 and 10 instances under category “a,” alongside 9 and 41 instances recorded under category “b.” This detailed breakdown gives healthcare professionals critical insights for informed analysis and decision-making.

Figure 13:

Confusion matrix of the proposed model.

Figure 14 presents the results of an ML test designed to divide patients into two groups: those with fractures and those without. To determine whether a patient has a fracture, a threshold of 0.4743 was established, which was optimized to improve the cost–benefit curve. This curve balances between accurately identifying patients with fractures (true positives) and mistakenly flagging patients without fractures (false positives). The confusion matrix shows the actual versus predicted classifications for the patients in the test group. The diagonal cells indicate successful classifications, while the off-diagonal cells show classification errors. The accompanying cost matrix shows the financial consequences of each type of misclassification, indicating that misdiagnosing a patient with a fracture could be 20 times more expensive than correctly identifying them as fracture-free.

Figure 14:

Cost–benefit and threshold curve of the proposed model.

The model achieved a classification accuracy of 88.1657% in terms of performance metrics, indicating the proportion of patients classified correctly. It demonstrated a sensitivity rate of 63.91%, indicating the correctly identified fracture cases, and an impressive specificity of 99.2308%, which refers to the correct identifications of non-fracture cases. The results confirm the model’s ability to identify patients who do not have fractures, highlighting its significant aptitude in this criterion. However, there is still room for improvement. Reducing the occurrence of false positives would increase the model’s accuracy further.

Figure 15 depicts a bubble chart that shows the connections between BMD, gender, medication usage, and the risk of sustaining fractures. Each bubble in the chart represents a unique patient, with the size of the bubble indicating the number of patients falling into a particular category. The chart’s color coding uses red to indicate patients who have suffered fractures and green for those who have not, making it easy to see each patient’s fracture status.

Figure 15:

Bubble chart represented fracture versus no fracture based on the BMD, sex, and medication. Abbreviation: BMD, bone mineral density.

The chart directly correlates decreasing BMD levels and increasing fracture risks. It means that as BMD levels decrease, the likelihood of experiencing a fracture goes up. The chart also shows that women are more likely to experience fractures than men, which could be due to generally lower BMD levels in women.

Moreover, the chart highlights the significant role of medication in reducing fracture risks. Patients under medication show a reduced propensity for fractures, presumably due to the medication’s effectiveness in increasing BMD and slowing bone degradation. The insights from this chart highlight that BMD, gender, and medication are critical factors that influence fracture risks. Healthcare professionals can use these data to develop preventive strategies and identify individuals with a higher risk of fractures, leading to a more informed and proactive approach to patient care.

Table 2 provides a detailed breakdown of the accuracy by class, which is essential for evaluating the performance of the classification model. The table assesses the ability of the model to distinguish between cases labeled as “fracture” and “no fracture.” The model shows a high TPR of 0.916 and a low FPR of 0.18 for the “no fracture” classification, resulting in a precision score of 0.924 and a recall score of 0.916. Currently, the F-measure, which is also known as the precision-to-recall ratio, is at 0.92.

Table 2:

Detailed accuracy by class.

Attribute	TP rate	FP rate	Precision	Recall	F-measure	AUC
No fracture	0.916	0.18	0.924	0.916	0.92	0.87
Fracture	0.82	0.084	0.804	0.82	0.812	0.87
Weighted average	0.888	0.152	0.888	0.888	0.888	0.87

Abbreviations: AUC, area under the curve; FP, false positive; TP, true positive.

The ROC area shown in Figure 16 represents the model’s overall performance, with a score of 0.87. The weighted average across all classes indicates an accuracy rate of 0.888, demonstrating that the model accurately identifies “fracture” and “no fracture” scenarios. These metrics provide valuable insights into the model’s performance, making it an essential tool for informed decision-making in healthcare and other contexts that involve classification.

Figure 16:

Result comparisons of the proposed model. Abbreviations: AUC, area under the curve; FP, false positive; TP, true positive.

COMPARATIVE DISCUSSION

When dealing with datasets with a class imbalance, using stratified cross-validation is a practical approach in statistical analysis and ML for assessing the effectiveness of predictive models. This method helps eliminate biases during model evaluation by ensuring that the target classes are equally represented in both the training and testing subsets, which reflects the overall dataset. The results obtained from stratified cross-validation concisely represent the model’s performance metrics. In this particular case, the model accurately categorized 150 out of 169 instances, resulting in an accuracy rate of 88.7574%. However, there were 19 misclassifications, representing an error rate of 11.2426%. The kappa statistic, which measures the level of agreement beyond what would be expected by chance, recorded a value of 0.7317, indicating that the model performed exceptionally well. The mean absolute error (MAE) and root mean square error (RMSE) metrics provide further insight into the model’s prediction errors. The RMSE, which is 0.3224, represents the standard deviation of the errors and shows their dispersion, while the MAE, which is 0.1743, represents the average magnitude of the errors. The model’s predictions were found to have a relative absolute error of 41.7357% compared to the actual values, along with a root relative squared error of 70.6302%. These parameters are essential for evaluating the accuracy of the predictions relative to the true values. In conclusion, stratified cross-validation is critical for evaluating ML models, especially when dealing with data imbalance. In this case, the model demonstrated remarkable predictive ability, high precision in classification, and acceptable error metrics, making it a suitable candidate for real-world applications where strict adherence to class balance is necessary for accurate forecasting.

Table 3 presents the results of a newly proposed model through a range of evaluation metrics, demonstrating outstanding performance. The model has an AUC value of 0.855, indicating its ability to distinguish between different classes accurately. An F1 score of 0.888 also highlights the precision and recall balance, demonstrating the model’s reliability. The model has a TPR of 88%, indicating its strength in identifying relevant instances with great accuracy. The overall accuracy rate of the model is 88.75%, indicating its potential superiority over existing models in various applications. Figure 17 shows the comparative performance of the proposed AdaBoostM1 model against existing techniques for osteoporosis screening. The model uses a dataset of BMD measurements from patients with and without osteoporosis. The results show that the AdaBoostM1 model surpasses current methodologies and more complex models requiring substantial training data. Additionally, it offers the benefit of being more straightforward to train and deploy. Therefore, it is a promising tool in osteoporosis screening, offering excellent accuracy and ease of use. The AdaBoostM1 model is the advanced form of AdaBoost mode. Adaptive boosting, or the AdaBoost algorithm, is an ML ensemble strategy that makes use of boosting techniques. Adaptive boosting is named thus because each instance’s weights are redistributed, with higher weights assigned to instances that are incorrectly identified.

Figure 17:

AUC and accuracy results in comparison with existing techniques. Abbreviation: AUC, area under the curve.

Table 3:

Comparison of osteoporosis screening results with existing techniques.

Author	Technique	AUC	F-measure	TPR (%)	Accuracy (%)
Shi et al. (2021)	CNN	0.61	—	—	85
Dzierżak and Omiotek (2022)	Radial function	0.829	—	51.0	83.3
Wu et al. (2023)	AdaBoost	0.814	0.679	—	74.4
Abedi et al. (2022)	CART	0.868	—	—	86
Liu et al. (2022)	VGG16	0.74	—	—	80
Wu et al. (2023)	KNN	0.632	0.518	—	65
Jang et al. (2021)	VGG16	0.74	—	91.1	81.2
Proposed model		0.875	0.888	88	88.75

Abbreviations: AUC, area under the curve; KNN, k-nearest neighbors; TPR, true positive rate.

CONCLUSION AND FUTURE WORK

This study examines the characteristics of osteoporosis, a systemic disease that weakens bone strength and increases the risk of fractures, especially in the hips, spine, and wrists, due to the degradation of BMD. This not only triggers other illnesses but also reduces the quality of life, and in severe cases, it can even increase the mortality rate. In community settings, it is essential to address the critical healthcare challenge of detecting osteoporosis and low BMD in postmenopausal women. The proposed model aims to be a robust tool in predicting osteoporotic fractures, which will significantly positively impact an individual’s quality of life.

Innovative solutions such as telemedicine and mobile screening units can enhance accessibility to screenings. Community-based osteoporosis screening initiatives can promote greater awareness and facilitate the early identification of at-risk individuals. Additionally, developing and validating more accurate and personalized risk assessment models remains a priority. Incorporating elements beyond the conventional clinical criteria, such as genetic markers and lifestyle factors, can be a prospective pathway for advancement in early detection strategies. The proposed system can be upgraded with the Modified MixNet Model ( Ahoor et al., 2023) to create an automated classification system for osteoporosis in postmenopausal females. Pre- and postmenopausal women are also prone to ovarian cancer ( Ziyambe et al., 2023). This study project can be extended to address this issue as well.

[1] Abedi R, Costache R, Shafizadeh-Moghadam H, Pham QB. 2022. Flash-flood susceptibility mapping based on XGBoost, random forest, and boosted regression trees. Geocarto Int. Vol. 37(19):5479–5496

[2] Ahmed Z, Mohamed K, Zeeshan S, Dong X. 2020. Artificial intelligence with multi-functional machine learning platform development for better healthcare and precision medicine. Database. Vol. 2020:baaa010

[3] Ahoor A, Arif F, Sajid MZ, Qureshi I, Abbas F, Jabbar S, et al.. 2023. MixNet-LD: an automated classification system for multiple lung diseases using modified MixNet model. Diagnostics. Vol. 13(20):3195

[4] Caffarelli C, Tomai Pitinca MD, Al Refaie A, De Vita M, Catapano S, Gonnelli S. 2022. Could radiofrequency echographic multi spectrometry (REMS) overcome the overestimation in BMD by dual-energy x-ray absorptiometry (DXA) at the lumbar spine? BMC Musculoskelet. Disord. Vol. 23(1):469

[5] Dempster DW. 2011. Osteoporosis and the burden of osteoporosis-related fractures. Am. J. Manag. Care. Vol. 17(6):S164

[6] Dzierżak R, Omiotek Z. 2022. Application of deep convolutional neural networks in the diagnosis of osteoporosis. Sensors. Vol. 22(21):8189

[7] Friedman W. 2006. Important determinants of bone strength: beyond bone mineral density. J. Clin. Rheumatol. Vol. 12(2):70–77

[8] Hussain M, Koundal D, Manhas J. 2023. Deep learning-based diagnosis of disc degenerative diseases using MRI: a comprehensive review. Comput. Electr. Eng. Vol. 105:108524

[9] Jang R, Choi JH, Kim N, Chang JS, Yoon PW, Kim C-H. 2021. Prediction of osteoporosis from simple hip radiography using deep learning algorithm. Sci. Rep. Vol. 11(1):19997

[10] Kaggle. 2024. https://www.kaggle.com/datasets/amarsharma768/bmd-dataAccessed March 14, 2024

[11] Liu T, Siegel E, Shen D. 2022. Deep learning and medical image analysis for COVID-19 diagnosis and prediction. Annu. Rev. Biomed. Eng. Vol. 24:179–201

[12] Long G, Liu C, Liang T, Zhang Z, Qin Z, Zhan X. 2023. Predictors of osteoporotic fracture in postmenopausal women: a meta-analysis. J. Orthop. Surg. Res. Vol. 18(1):574

[13] Rabaan AA, Bakhrebah MA, AlSaihati H, Alhumaid S, Alsubki RA, Turkistani S, et al.. 2022. Artificial intelligence for clinical diagnosis and treatment of prostate cancer. Cancers. Vol. 14(22):5595

[14] Sahin ME. 2023. Image processing and machine learning-based bone fracture detection and classification using x-ray images. Int. J Imaging Syst. Technol. Vol. 33(3):853–865

[15] Santos MK, Ferreira Júnior JR, Wada DT, Tenório APM, Barbosa MHN, Marques PMA. 2019. Artificial intelligence, machine learning, computer-aided diagnosis, and radiomics: advances in imaging towards to precision medicine. Radiol. Bras. Vol. 52:387–396

[16] Sebastian AM, Peter D. 2022. Artificial intelligence in cancer research: trends, challenges, and future directions. Life. Vol. 12(12):1991

[17] Shevroja E, Reginster JY, Lamy O, Al-Daghri N, Chandran M, Demoux-Baiada AL, et al.. 2023. Update on the clinical use of trabecular bone score (TBS) in the management of osteoporosis: results of an expert group meeting organized by the European Society for Clinical and Economic Aspects of Osteoporosis, Osteoarthritis and Musculoskeletal Diseases (ESCEO), and the International Osteoporosis Foundation (IOF) under the auspices of WHO Collaborating Center for Epidemiology of Musculoskeletal Health and Aging. Osteoporos. Int. Vol. 34:1501–1529

[18] Shi F, Wang J, Shi J, Wu Z, Wang Q, Tang Z, et al.. 2021. Review of artificial intelligence techniques in imaging data acquisition, segmentation, and diagnosis for COVID-19. IEEE Rev. Biomed. Eng. Vol. 14:4–15

[19] Singh P, Singh N, Singh KK, Singh A. 2021. Diagnosing of disease using machine learningMachine Learning and the Internet of Medical Things in Healthcare. p. 89–111. Elsevier.

[20] Tantalaki N, Souravlas S, Roumeliotis M. 2019. Data-driven decision making in precision agriculture: The rise of big data in agricultural systems. J. Agric. Food Inform. Vol. 20(4):344–380

[21] Ullah Z, Usman M, Jeon M, Gwak J. 2022. Cascade multiscale residual attention CNNS with adaptive ROI for automatic brain tumor segmentation. Inform. Sci. Vol. 608:1541–1556

[22] Van Vleck TT, Chan L, Coca SG, Craven CK, Do R, Ellis SB, et al.. 2019. Augmented intelligence with natural language processing applied to electronic health records for identifying patients with non-alcoholic fatty liver disease at risk for disease progression. Int. J. Med. Inform. Vol. 129:334–341

[23] Wong RMY, Cheung WH, Chow SKH, Ng RWK, Li W, Hsu AYC, et al.. 2022. Recommendations on the post-acute management of the osteoporotic fracture - patients with “very-high” re-fracture risk. J. Orthop. Translat. Vol. 37:94–99

[24] Wu X, Zhai F, Chang A, Wei J, Guo Y, Zhang J. 2023. Application of machine learning algorithms to predict osteoporosis in postmenopausal women with type 2 diabetes mellitus. J. Endocrinol. Invest. Vol. 46:2535–2546

[25] Ziyambe B, Yahya A, Abbas Q, Babar M, Albathan M, Asim M, et al.. 2023. A deep learning framework for the prediction and diagnosis of ovarian cancer in pre- and post-menopausal women. Diagnostics. Vol. 13(10):1703

Journal of Disability Research

Diagnosing Osteoporosis in Postmenopausal Females Using Machine Learning and AdaBoostM1 Algorithm Based on Bone Mineral Density

Abstract

Main article text

INTRODUCTION

MATERIAL AND METHODS

BMD based on ML

Stage 1—data acquisition

Stage 2—data preprocessing

Stage 3—data training

Stage 4—model testing

Stage 5—model prediction and evaluation

RESULTS AND DISCUSSION

COMPARATIVE DISCUSSION

CONCLUSION AND FUTURE WORK

SOURCE/DATA AVAILABILITY

ACKNOWLEDGMENTS

REFERENCES

Author and article information

Journal

Affiliations

Author notes

Author information

Article

History

Page count

Funding

Categories

Comments

Comment on this article