Comparison of Segmentation Algorithms for Detecting Myocardial Infarction Using Late Gadolinium Enhancement Magnetic Resonance Imaging

Sun, Yibo; Deng, Dongdong; Sun, Liping; He, Yi.; WANG, Hui; Dong, Jianzeng

doi:10.15212/CVIA.2019.0574

Abstract

Objective: The aim of this study was to validate the accuracy of a new automatic method for scar segmentation and compare its performance with that of two other frequently used segmentation algorithms.

Methods: Twenty-six late gadolinium enhancement cardiovascular magnetic resonance images of diseased hearts were segmented by the full width at half maximum (FWHM) method, the n standard deviations (nSD) method, and our new automatic method. The results of the three methods were compared with the consensus ground truth obtained by manual segmentation of the ventricular boundaries.

Results: Our automatic method yielded the highest Dice score and the lowest volume difference compared with the consensus ground truth segmentation. The nSD method produced large variations in the Dice score and the volume difference. The FWHM method yielded the lowest Dice score and the greatest volume difference compared with the automatic, 6SD, and 8SD methods, but resulted in less variation when different observers segmented the images.

Conclusion: The automatic method introduced in this study is highly reproducible and objective. Because it requires no manual intervention, it may be useful for processing large datasets produced in clinical applications.

Main article text

Introduction

Ventricular tachycardia is a life-threatening heart disease that occurs frequently in patients with myocardial infarction (MI) [1]. The presence of MI has an important prognostic and therapeutic value for predicting ventricle remodeling and cardiac dysfunction [2]. Late gadolinium enhancement (LGE) magnetic resonance imaging (MRI) is the standard imaging technique used for MI detection [3, 4]. It has many advantages compared with computed tomography and echocardiography [4], and has become the first choice for detecting myocardial fibrosis [5, 6].

Although LGE MRI is widely used in a clinical setting to detect the location, transmurality, and composition of an infarct [7], there is no recognized optimal method of LGE quantification [8]. Numerous methods for segmenting infarct tissues in the left ventricle have been devised. Of these, the most frequently used are threshold based, such as the full width at half maximum (FWHM) method [9] and the n standard deviations (nSD) method [10]. Although they are widely used in the clinic and in research, they require manual intervention, which is subjective and inefficient. Several semiautomatic and automatic methods have been proposed [11, 12], but most still require manual input or postprocessing, which hinders efficient image processing for a large number of patients.

The aim of this study was to validate the accuracy of a new automatic segmentation method and compare its performance against that of the nSD and FWHM methods for scar segmentation.

Methods

Image Acquisition

Cardiovascular magnetic resonance (CMR) images from 26 diseased hearts were collected from Beijing Anzhen Hospital. Basic information on the patients is given in Table 1. This study was approved by the Institutional Review Board of Beijing Anzhen Hospital, and informed consent was obtained from all participants. The detailed image acquisition protocol was published previously [13]. Briefly, CMR scans were performed with a 1.5 T scanner (Sonata, Siemens, Erlangen, Germany) with chest electrocardiogram gating and breath-hold techniques. The contrast agent was injected via an ulnar vein under high pressure, and late imaging was performed 15 min later. The scanning layer thickness was 8 mm or 10 mm with a field of view between 320×320 mm² and 340×340 mm². The final in-plane image resolution was between 1.4 mm and 1.75 mm.

Table 1

Characteristics of the Study Population.

	Frequency (%)	Mean±SD
Men	96.2 (n=25)	–
Women	3.8 (n=1)	–
Age (years)	–	57.58±10.58
EF (%)	–	40.37±14.83
LVEDV (mL)	–	132.22±51.52
Scar mass (mm³)	–	30,436.48±23,095.58
Total scar (%)	–	23.25±13.86

EF, ejection fraction; LVEDV, left ventricular end-diastolic volume; SD, standard deviation.

Image Processing

All analyses and measurements were performed with custom software developed in MATLAB (The MathWorks, USA). For all LGE CMR images, the epicardial and endocardial boundaries in every two-dimensional slice were manually segmented by one experienced expert. The papillary muscles were excluded from the endocardium. Then pixels between the epicardial and endocardial boundaries were segmented by nSD thresholding, FWHM thresholding, and our new automatic method (see the third paragraph of this section).

For the nSD method, observers with 1, 3, and 8 years of image segmentation experience selected a region of interest (ROI) far from the enhanced area. Then the mean and the SD of the selected ROI were calculated, and pixels with an intensity greater than nSD above the mean were defined as infarct tissue. The value of n was selected as 2, 4, 6, 8, or 10. For the FWHM method, the same three people selected one enhanced area, the maximal intensity of the selected enhanced area was calculated, and the pixels with intensity greater than half the maximal value were segmented as infarct tissue. For both the nSD method and the FWHM method, further manual improvements to the segmented images were made, including the removal of pixels not connected to the infarct tissue.

The automatic algorithm developed by our group uses a classification method based on the Gaussian mixture model to segment the tissue inside the epicardial and endocardial boundaries. The Gaussian mixture model assumes a Gaussian distribution of the image intensity of each fitted class (in our case noninfarct tissue and infarct tissue) [14, 15], each class having its own mean intensity μ and variance σ ². The maximal intensity (I _max) and minimal intensity (I _min) of the pixels between the epicardium and endocardium were calculated, and those pixels with intensity greater than 0.3 (I _max−I _min) were designated as containing infarct tissue. Next, the regions in each image layer with more than 15 pixels were retained to remove small clusters of pixels affected by noise or blood vessels. The maximal component in each layer and the components with pixel intensities greater than 15% of the maximal components were then designated as the final infarct tissue to remove numerous pixels containing fat tissue or artifacts.

Evaluation Metrics

After analysis of the entire stack of ventricular images, the Dice and volume difference metrics were used to evaluate the segmentations. The Dice score represents the overlap between the ground truth (the segmentation of scar tissue performed by two experts with more than 10 years of CMR image segmentation experience) and the segmentation produced by the other algorithms described earlier. The volume difference measures the difference between the infarct volumes measured with the ground truth and the other segmentation algorithms.

Results

The segmentations obtained with the nSD method, the FWHM method, and the automatic method were compared with the consensus ground truth and their accuracy was measured by the Dice score. Without any manual intervention, our automatic segmentation method produced the highest Dice score (0.8) and the smallest variation in the Dice score for each image dataset (Figure 1A). The 6SD and 8SD methods yielded a mean Dice score of 0.72, and the variation of the Dice score for each image dataset was much greater than that for the automatic method. The FWHM method had a mean Dice score of 0.64, and the variation of the Dice score for each image dataset was comparable with that for the automatic method.

Figure 1:

Segmentation accuracy of the automatic method and of individual observers using other segmentation methods. Dice scores and volume differences were calculated for every region of scar tissue found in the consensus segmentation. Panel (A) is the dice score without any manual intervention; Panel (B) is the dice score with manual removal of pixels that were not connected to the infarct tissue; Panel (C) is the volume difference without any manual intervention; Panel (D) is the volume difference with manual removal of pixels that were not connected to the infarct tissue. Volume differences are reported in milliliters. SD, Standard Deviations.

After manual removal of pixels that were not connected to the infarct tissue, the Dice scores produced with the nSD and FWHM methods were increased (Figure 1B). The 6SD and 8SD methods produced mean Dice scores of 0.79 and 0.77, respectively, which were close to the value obtained with the automatic method. The Dice score obtained with the FWHM method increased minimally from 0.64 to 0.65.

To further evaluate the accuracy of the infarct tissue segmentation, we also compared the volume difference between the consensus ground truth and the three segmentation methods (Figure 1C and D). Our automatic method produced the smallest volume differences, with a median value of 0.70 mL. Before manual modification of the segmented images (Figure 1C), the median values of the volume differences produced by the 6SD and 8SD methods were 1.31 mL and 1.09 mL, respectively. The FWHM method yielded a median volume difference of 1.47 mL. After manual modification of the segmented images (Figure 1D), the median values of the volume differences obtained with the 6SD and 8SD methods decreased to 0.81 mL and 0.71 mL, respectively. Manual intervention increased the median volume difference obtained with the FWHM method to 1.68 mL.

For different observers, large variations in the Dice score and the volume difference were obtained, especially before manual adjustments were made. The automatic method did not require any manual intervention for scar segmentation and thus had the highest reproducibility. The nSD method produced large variations in both the highest Dice scores and the lowest volume differences. For observers 1 and 3, the highest Dice score was obtained with the 6SD method, whereas the 10SD method yielded the highest Dice score for observer 2. The Dice score obtained with the FWHM method also varied for each observer. After adjustment of the segmented images, the 6SD method yielded the highest Dice score for all three observers. Before manual intervention, the 8SD method produced the lowest volume difference for observers 1 and 3, whereas the 10SD method resulted in the lowest value for observer 2. After manual adjustment of the image segmentation, the 8SD method produced the lowest volume difference for observers 1 and 2, while the 6SD method yielded the lowest value for observer 3.

Figure 2 shows different observers’ segmentations of one patient’s CMR images. The “manual” panel shows the manual segmentation of one slice, which was treated as the ground truth segmentation. The “automatic” panel shows the segmentation of scar tissue obtained with our automatic method. The result was very close to the consensus ground truth, which can be attributed to the efficient removal of artifacts by the automatic method. The first row in each observer’s segmentation in Figure 2 shows the scar segmentation produced with the nSD method (n=2, 4, 6, 8, and 10) and the FWHM method. The second row shows the modified segmentation obtained after manual removal of pixels disconnected from the infarct tissue. For all observers’ segmentations, the 2SD, 4SD, 6SD, and 8SD methods needed manual intervention to remove artifacts associated with fatty tissue on the epicardium. The 10SD method sometimes omitted some scar tissue from the segmentation (observer 3), whereas the FWHM method excluded some scar tissue in all observers’ segmentations.

Figure 2:

FWHM Full Width at Half Maximum; SD Standard Deviations.

Discussion

Comparison of the three methods for segmentation of infarct tissue in the ventricle showed that our automatic method produced the highest Dice score and the lowest volume difference compared with the consensus ground truth segmentation. The automatic method does not require any manual intervention and therefore produces highly reproducible and objective results. The nSD method resulted in large variations in the Dice score and volume difference for all values of n studied. Although the 6SD and 8SD methods yielded relatively good results, they relied on manual identification and removal of isolated pixels or artifacts. Furthermore, the segmentation of different patients’ images differed markedly between different observers, making the nSD method highly variable and unobjective. The FWHM method resulted in the lowest Dice score and highest volume difference compared with the automatic, 6SD, and 8SD methods, but was less variable when different observers segmented the images.

Both the nSD method and the FWHM method have been widely used for clinical image segmentation [10, 16]. Our results are consistent with those of earlier studies in which the 6SD method (n=2–6) produced the highest Dice scores [11] and the FWHM method yielded lower Dice scores [12, 13]. Compared with the nSD method, the FWHM method has greater reproducibility. However, both methods need manual input to select the ROIs for threshold computation and for artifact removal [11].

Several automatic or semiautomatic methods are used to segment infarct tissue [11]. The advantage of our method is that it is fully automated, which is desirable when imaging datasets from a large number of patients are being processed. However, our method is still limited by false segmentation introduced by noise, fatty tissue, or artifacts caused by an implantable cardioverter-defibrillator. In addition, although our method is fully automatic with respect to scar segmentation, it still requires manual segmentation of the endocardium and epicardium in the left ventricle.

Conclusions

We compared three methods for segmenting CMR images containing infarct tissue in the ventricle. Our automatic method proved to be highly repetitive and objective, producing the highest Dice scores and the lowest volume differences compared with the consensus ground truth segmentation. The 6SD and 8SD methods yielded relatively good metrics, but they produced considerable variation in image segmentation by different observers and therefore lacked reproducibility and objectivity. Although segmentation by the FWHM method suffered from less variation between observers, this algorithm yielded the lowest Dice scores and highest volume differences. The automatic method is therefore highly recommended for segmenting ventricular scar tissue, and may be useful for processing large imaging datasets produced in the clinic.

Acknowledgments

This work was supported by grants from the National Key Research and Development Program of China (2016YFC1301002 to Jianzeng Dong) and the National Natural Science Foundation of China (81901841 to Dongdong Deng; 81671650 and 81971569 to Yi He). Dongdong Deng also acknowledges support from Dalian University of Technology (DUT18RC(3)068). We thank Liwen Bianji, Edanz Group China (www.liwenbianji.cn/ac), for editing the English text of a draft of this manuscript.

References

MozaffarianD, BenjaminEJ, GoAS, ArnettDK, BlahaMJ, CushmanM, et al. Executive summary: heart disease and stroke statistics – 2016 update: a report from the American Heart Association. Circulation 2016;133:447–54.
AlexandreJ, SalouxE, DugueAE, LebonA, LemaitreA, RouleV, et al. Scar extent evaluated by late gadolinium enhancement CMR: a powerful predictor of long term appropriate ICD therapy in patients with coronary artery disease. J Cardiovasc Magn Reson 2013;15:12.
Ambale-VenkateshB, LimaJA. Cardiac MRI: a central prognostic tool in myocardial fibrosis. Nat Rev Cardiol 2015;12:18–29.
KaramitsosTD, ArvanitakiA, KarvounisH, NeubauerS, FerreiraVM. Myocardial tissue characterization and fibrosis by imaging. JACC Cardiovascular Imaging 2020;13:1221–34.
DisertoriM, RigoniM, PaceN, CasoloG, MaseM, GonziniL, et al. Myocardial fibrosis assessment by LGE Is a powerful predictor of ventricular tachyarrhythmias in ischemic and nonischemic LV dysfunction a meta-analysis. JACC Cardiovasc Imaging 2016;9:1046–55.
KatoS, SaitoN, KirigayaH, GyotokuD, IinumaN, KusakawaY, et al. Prognostic significance of quantitative assessment of focal myocardial fibrosis in patients with heart failure with preserved ejection fraction. Int J Cardiol 2015;191:314–9.
Fernandez-ArmentaJ, BerruezoA, AndreuD, CamaraO, SilvaE, SerraL, et al. Three-dimensional architecture of scar and conducting channels based on high resolution ce-CMR: insights for ventricular tachycardia ablation. Circ Arrhythm Electrophysiol 2013;6:528–37.
Schulz-MengerJ, BluemkeDA, BremerichJ, FlammSD, FogelMA, FriedrichMG, et al. Standardized image interpretation and post processing in cardiovascular magnetic resonance: Society for Cardiovascular Magnetic Resonance (SCMR) Board of Trustees Task Force on Standardized Post Processing. J Cardiovasc Magn Reson 2013;15:35.
AmadoLC, GerberBL, GuptaSN, SzarfG, SchockR, NasirK, et al. Accurate and objective infarct sizing by contrast-enhanced magnetic resonance imaging in a canine myocardial infarction model. J Am Coll Cardiol 2004;44:2383–9.
FlettAS, HasletonJ, CookC, HausenloyD, QuartaG, AritiC, et al. Evaluation of techniques for the quantification of myocardial scar of differing etiology using cardiac magnetic resonance. JACC Cardiovasc Imaging 2011;4:150–6.
KarimR, BhagirathP, ClausP, HousdenRJ, ChenZ, KarimaghalooZ, et al. Evaluation of state-of-the-art segmentation algorithms for left ventricle infarct from late gadolinium enhancement MR images. Med Image Anal 2016;30:95–107.
CarminatiMC, BoniottiC, FusiniL, AndreiniD, PontoneG, PepiM, et al. Comparison of image processing techniques for nonviable tissue quantification in late gadolinium enhancement cardiac magnetic resonance images. J Thorac Imaging 2016;31:168–76.
LiuD, MaX, LiuJ, ZhaoL, ChenH, XuL, et al. Quantitative analysis of late gadolinium enhancement in hypertrophic cardiomyopathy: comparison of diagnostic performance in myocardial fibrosis between gadobutrol and gadopentetate dimeglumine. Int J Cardiovasc Imaging 2017;33:1191–200.
PopM, GhugreNR, RamananV, MorikawaL, StaniszG, DickAJ, et al. Quantification of fibrosis in infarcted swine hearts by ex vivo late gadolinium-enhancement and diffusion-weighted MRI methods. Phys Med Biol 2013;58:5009–28.
HennemuthA, FrimanO, HuellebrandM, PeitgenH-O. Mixture-model-based segmentation of myocardial delayed enhancement MRI. In: CamaraO, MansiT, PopM, RhodeK, SermesantM, YoungA , editors. Statistical atlases and computational models of the heart. Imaging and modelling challenges. STACOM 2012. Lecture notes in computer science, vol 7746. Berlin: Springer; 2013. pp. 87–96.
RajchlM, StirratJ, GoubranM, YuJ, SchollD, PetersTM, et al. Comparison of semi-automated scar quantification techniques using high-resolution, 3-dimensional late-gadolinium-enhancement magnetic resonance imaging. Int J Cardiovasc Imaging 2015;31:349–57.

Author and article information

Journal

Journal ID (publisher-id): CVIA

Title: Cardiovascular Innovations and Applications

Abbreviated Title: CVIA

Publisher: Compuscript (Ireland )

ISSN (Electronic): 2009-8782

ISSN (Print): 2009-8618

Publication date (Print): November 2020

Publication date (Electronic): November 2020

Volume: 5

Issue: 2

Pages: 89-95

Affiliations

[1] ¹Department of Cardiology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan, China

[2] ²School of Biomedical Engineering, Dalian University of Technology, Dalian, Liaoning, China

[3] ³Department of Cardiology, Beijing Anzhen Hospital, Capital Medical University and National Clinical Research Center for Cardiovascular Diseases, Beijing, China

[4] ⁴Department of Cardiology, Beijing Friendship Hospital, Capital Medical University, Beijing, China

Author notes

Correspondence: Jianzeng Dong, PhD, MD, No. 1 Dragon Lake Central Road, Golden Water District, Zhengzhou, 450046 Henan, China, Tel.: (+86) 18612596084, E-mail: jz_dong@ 123456126.com

^aThese authors contributed equally.

Article

Publisher ID: cvia.2019.0574

DOI: 10.15212/CVIA.2019.0574

SO-VID: 5f5109ce-3d09-47a0-8cae-8257c69d8ede

License:

This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 Unported License (CC BY-NC 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. See https://creativecommons.org/licenses/by-nc/4.0/.

History

Date received : 17 December 2019

Date revision received : 03 March 2020

Date accepted : 01 May 2020

Comments

[1] MozaffarianD, BenjaminEJ, GoAS, ArnettDK, BlahaMJ, CushmanM, et al. Executive summary: heart disease and stroke statistics – 2016 update: a report from the American Heart Association. Circulation 2016;133:447–54.

[2] AlexandreJ, SalouxE, DugueAE, LebonA, LemaitreA, RouleV, et al. Scar extent evaluated by late gadolinium enhancement CMR: a powerful predictor of long term appropriate ICD therapy in patients with coronary artery disease. J Cardiovasc Magn Reson 2013;15:12.

[3] Ambale-VenkateshB, LimaJA. Cardiac MRI: a central prognostic tool in myocardial fibrosis. Nat Rev Cardiol 2015;12:18–29.

[4] KaramitsosTD, ArvanitakiA, KarvounisH, NeubauerS, FerreiraVM. Myocardial tissue characterization and fibrosis by imaging. JACC Cardiovascular Imaging 2020;13:1221–34.

[5] DisertoriM, RigoniM, PaceN, CasoloG, MaseM, GonziniL, et al. Myocardial fibrosis assessment by LGE Is a powerful predictor of ventricular tachyarrhythmias in ischemic and nonischemic LV dysfunction a meta-analysis. JACC Cardiovasc Imaging 2016;9:1046–55.

[6] KatoS, SaitoN, KirigayaH, GyotokuD, IinumaN, KusakawaY, et al. Prognostic significance of quantitative assessment of focal myocardial fibrosis in patients with heart failure with preserved ejection fraction. Int J Cardiol 2015;191:314–9.

[7] Fernandez-ArmentaJ, BerruezoA, AndreuD, CamaraO, SilvaE, SerraL, et al. Three-dimensional architecture of scar and conducting channels based on high resolution ce-CMR: insights for ventricular tachycardia ablation. Circ Arrhythm Electrophysiol 2013;6:528–37.

[8] Schulz-MengerJ, BluemkeDA, BremerichJ, FlammSD, FogelMA, FriedrichMG, et al. Standardized image interpretation and post processing in cardiovascular magnetic resonance: Society for Cardiovascular Magnetic Resonance (SCMR) Board of Trustees Task Force on Standardized Post Processing. J Cardiovasc Magn Reson 2013;15:35.

[9] AmadoLC, GerberBL, GuptaSN, SzarfG, SchockR, NasirK, et al. Accurate and objective infarct sizing by contrast-enhanced magnetic resonance imaging in a canine myocardial infarction model. J Am Coll Cardiol 2004;44:2383–9.

[10] FlettAS, HasletonJ, CookC, HausenloyD, QuartaG, AritiC, et al. Evaluation of techniques for the quantification of myocardial scar of differing etiology using cardiac magnetic resonance. JACC Cardiovasc Imaging 2011;4:150–6.

[11] KarimR, BhagirathP, ClausP, HousdenRJ, ChenZ, KarimaghalooZ, et al. Evaluation of state-of-the-art segmentation algorithms for left ventricle infarct from late gadolinium enhancement MR images. Med Image Anal 2016;30:95–107.

[12] CarminatiMC, BoniottiC, FusiniL, AndreiniD, PontoneG, PepiM, et al. Comparison of image processing techniques for nonviable tissue quantification in late gadolinium enhancement cardiac magnetic resonance images. J Thorac Imaging 2016;31:168–76.

[13] LiuD, MaX, LiuJ, ZhaoL, ChenH, XuL, et al. Quantitative analysis of late gadolinium enhancement in hypertrophic cardiomyopathy: comparison of diagnostic performance in myocardial fibrosis between gadobutrol and gadopentetate dimeglumine. Int J Cardiovasc Imaging 2017;33:1191–200.

[14] PopM, GhugreNR, RamananV, MorikawaL, StaniszG, DickAJ, et al. Quantification of fibrosis in infarcted swine hearts by ex vivo late gadolinium-enhancement and diffusion-weighted MRI methods. Phys Med Biol 2013;58:5009–28.

[15] HennemuthA, FrimanO, HuellebrandM, PeitgenH-O. Mixture-model-based segmentation of myocardial delayed enhancement MRI. In: CamaraO, MansiT, PopM, RhodeK, SermesantM, YoungA , editors. Statistical atlases and computational models of the heart. Imaging and modelling challenges. STACOM 2012. Lecture notes in computer science, vol 7746. Berlin: Springer; 2013. pp. 87–96.

[16] RajchlM, StirratJ, GoubranM, YuJ, SchollD, PetersTM, et al. Comparison of semi-automated scar quantification techniques using high-resolution, 3-dimensional late-gadolinium-enhancement magnetic resonance imaging. Int J Cardiovasc Imaging 2015;31:349–57.

Cardiovascular Innovations and Applications