+1 Recommend
3 collections
      • Record: found
      • Abstract: found
      • Conference Proceedings: found
      Is Open Access

      Multiomics integration-based molecular characterizations in COPD and post-COVID

      1 , 2
      25-26 October 2023
      Multi-omics, Consensus clustering algorithm, COPD, COVID-19, Respiratory disease, Smoking


            Pulmonologists and patients alike are currently confined to battling obstructive lung diseases such as Asthma, COPD and post-COVID with unrefined tools that lump together these spectrum disorders under a single umbrella. We have previously shown that integration of a multiple omics datasets can drastically improve the statistical power to detects subgroups in small clinical cohorts in COPD [1]. Integration of 9 omics data blocks collected from multiple molecular levels (mRNA, microRNA, proteomes, and metabolomes) and multiple anatomical locations (airway epithelium, lung resident immune cells, airway exudates, exosomes, and serum) clearly demonstrated that multi-omics integration using Similarity Network Fusion (SNF) drastically improve the accuracy of group classification. Specifically, the mean accuracy of classification improved from 0.27 for single-omics platforms, to 0.90 for 7-tuple omics, reducing the n needed for the desired 95% accuracy from n=30 for single omics datasets, to n=6 for 7-tuple omics datasets. For the best performing network combinations integration of 5 or more platforms provided 100% correct classification, with sub-groups of n=6-7. The fact that multi-omics integration facilitated correct classification of mild-moderate COPD in the presence of a strong confounder such as current smoking in an unsupervised fashion is unprecedented, as smoking causes alterations of up to 50% of the bio- molecules in the lung. However, missing data represents a major limiting factor in such studies. Missingness of specific data modalities for different subjects will result in vastly different numbers omics combinations available for each pair of subject, as well as variations in sample sizes (n) between the respective omics combinations. To address this issue, we have developed an extension of the Consensus clustering algorithm for integration of multi-omics data, ccml ( cran.rstudio.com/web/packages/ccml) [2], to facilitate inclusion of omics data sets with unequal numbers of missing labels in multi-omics predictions in clinical cohorts. Evaluation of the ccml algorithm using the Karolinska COSMIC cohort demonstrate that ccml effectively predicts molecularly distinct subgroups from the integration of 9 omics datasets, allowing for as much as 60% data missingness in individual data modalities when 5 or more data platforms are integrated. Ccml is a downstream tool for multi-omics integration analysis that mitigates the limitations posed by missing data, a prevalent issue in human cohort studies involving multiple data modalities. In this presentation, we will discuss the application of the above described integration tools in our Karolinska COSMIC cohort ( clinicaltrials.gov/ct2/show/NCT02627872) investigating sex differences in COPD, as well as out HemCOV cohrt (clinicaltrials.gov/study/NCT05894616) investigating post-COVID with respiratory manifestations.


            Author and article information

            23 October 2023
            [1 ] Respiratory Medicine Unit, Dept of Medicine Solna & Centre for Molecular Medicine, Karolinska Institutet (Sweden) ( https://ror.org/056d84691)
            [2 ] Department of Respiratory Medicine and Allergy, Karolinska University Hospital, Stockholm (Sweden) ( https://ror.org/00m8d6786)
            Author information

            Published under Creative Commons Attribution 4.0 International ( CC BY 4.0). Users are allowed to share (copy and redistribute the material in any medium or format) and adapt (remix, transform, and build upon the material for any purpose, even commercially), as long as the authors and the publisher are explicitly identified and properly acknowledged as the original source.

            Stockholm, Sweden
            25-26 October 2023



            Data sharing not applicable to this article as no datasets were generated or analysed during the current study.
            Pharmacology & Pharmaceutical medicine
            Multi-omics,Consensus clustering algorithm,COPD,COVID-19,Respiratory disease,Smoking


            1. Li Chuan-Xing, Wheelock Craig E., Sköld C. Magnus, Wheelock Åsa M.. Integration of multi-omics datasets enables molecular classification of COPD. European Respiratory Journal. Vol. 51(5)2018. European Respiratory Society (ERS). [Cross Ref]


            Comment on this article