Proteomic meta-study harmonization, mechanotyping and drug repurposing candidate prediction with ProHarMeD

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Proteomics technologies, which include a diverse range of approaches such as mass spectrometry-based, array-based, and others, are key technologies for the identification of biomarkers and disease mechanisms, referred to as mechanotyping. Despite over 15,000 published studies in 2022 alone, leveraging publicly available proteomics data for biomarker identification, mechanotyping and drug target identification is not readily possible. Proteomic data addressing similar biological/biomedical questions are made available by multiple research groups in different locations using different model organisms. Furthermore, not only various organisms are employed but different assay systems, such as in vitro and in vivo systems, are used. Finally, even though proteomics data are deposited in public databases, such as ProteomeXchange, they are provided at different levels of detail. Thus, data integration is hampered by non-harmonized usage of identifiers when reviewing the literature or performing meta-analyses to consolidate existing publications into a joint picture. To address this problem, we present ProHarMeD, a tool for harmonizing and comparing proteomics data gathered in multiple studies and for the extraction of disease mechanisms and putative drug repurposing candidates. It is available as a website, Python library and R package. ProHarMeD facilitates ID and name conversions between protein and gene levels, or organisms via ortholog mapping, and provides detailed logs on the loss and gain of IDs after each step. The web tool further determines IDs shared by different studies, proposes potential disease mechanisms as well as drug repurposing candidates automatically, and visualizes these results interactively. We apply ProHarMeD to a set of four studies on bone regeneration. First, we demonstrate the benefit of ID harmonization which increases the number of shared genes between studies by 50%. Second, we identify a potential disease mechanism, with five corresponding drug targets, and the top 20 putative drug repurposing candidates, of which Fondaparinux, the candidate with the highest score, and multiple others are known to have an impact on bone regeneration. Hence, ProHarMeD allows users to harmonize multi-centric proteomics research data in meta-analyses, evaluates the success of the ID conversions and remappings, and finally, it closes the gaps between proteomics, disease mechanism mining and drug repurposing. It is publicly available at https://apps.cosy.bio/proharmed/.

Related collections

Most cited references 52

Record: found
Abstract: not found
Article: not found

Random Forests

Leo Breiman (2001)

0 comments Cited 9386 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: found

Is Open Access

The FAIR Guiding Principles for scientific data management and stewardship

Mark D Wilkinson, Michel Dumontier, IJsbrand Jan Aalbersberg … (2017)

There is an urgent need to improve the infrastructure supporting the reuse of scholarly data. A diverse set of stakeholders—representing academia, industry, funding agencies, and scholarly publishers—have come together to design and jointly endorse a concise and measureable set of principles that we refer to as the FAIR Data Principles. The intent is that these may act as a guideline for those wishing to enhance the reusability of their data holdings. Distinct from peer initiatives that focus on the human scholar, the FAIR Principles put specific emphasis on enhancing the ability of machines to automatically find and use the data, in addition to supporting its reuse by individuals. This Comment is the first formal publication of the FAIR Principles, and includes the rationale behind them, and some exemplar implementations in the community.

0 comments Cited 3804 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification.

Jürgen Cox, Matthias Mann (2008)

Efficient analysis of very large amounts of raw data for peptide identification and protein quantification is a principal challenge in mass spectrometry (MS)-based proteomics. Here we describe MaxQuant, an integrated suite of algorithms specifically developed for high-resolution, quantitative MS data. Using correlation analysis and graph theory, MaxQuant detects peaks, isotope clusters and stable amino acid isotope-labeled (SILAC) peptide pairs as three-dimensional objects in m/z, elution time and signal intensity space. By integrating multiple mass measurements and correcting for linear and nonlinear mass offsets, we achieve mass accuracy in the p.p.b. range, a sixfold increase over standard techniques. We increase the proportion of identified fragmentation spectra to 73% for SILAC peptide pairs via unambiguous assignment of isotope and missed-cleavage state and individual mass precision. MaxQuant automatically quantifies several hundred thousand peptides per SILAC-proteome experiment and allows statistically robust identification and quantification of >4,000 proteins in mammalian cell lysates.

0 comments Cited 3674 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Tanja Laske:

ORCID: http://orcid.org/0000-0002-7922-7595

tanja.laske@uni-hamburg.de

Journal

Journal ID (nlm-ta): NPJ Syst Biol Appl

Journal ID (iso-abbrev): NPJ Syst Biol Appl

Title: NPJ Systems Biology and Applications

Publisher: Nature Publishing Group UK (London )

ISSN (Electronic): 2056-7189

Publication date (Electronic): 10 October 2023

Publication date PMC-release: 10 October 2023

Publication date Collection: 2023

Volume: 9

Electronic Location Identifier: 49

Affiliations

[1 ]Institute for Computational Systems Biology, University of Hamburg, ( https://ror.org/00g30e956) Hamburg, 22607 Germany

[2 ]Department of Preclinical Development and Validation, Fraunhofer Institute for Cell Therapy and Immunology IZI, ( https://ror.org/04x45f476) Leipzig, Germany

[3 ]Chair of Proteomics and Bioanalytics, Technical University of Munich, ( https://ror.org/02kkvpp62) Freising, Germany

[4 ]Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich, ( https://ror.org/02kkvpp62) Freising, Germany

[5 ]Department of Mathematics and Computer Science, University of Southern Denmark, ( https://ror.org/03yrrjy16) Odense, 5230 Denmark

Author information

Klaudia Adamowicz http://orcid.org/0000-0002-9418-4386

Lis Arend http://orcid.org/0000-0001-7990-8385

Andreas Maier http://orcid.org/0000-0003-4408-0068

Johannes R. Schmidt http://orcid.org/0000-0002-2026-9715

Bernhard Kuster http://orcid.org/0000-0002-9094-1677

Jan Baumbach http://orcid.org/0000-0002-0282-0462

Tanja Laske http://orcid.org/0000-0002-7922-7595

Article

Publisher ID: 311

DOI: 10.1038/s41540-023-00311-7

PMC ID: 10564802

PubMed ID: 37816770

SO-VID: bc0dbd07-ffbe-44ae-80d1-5a9249442b2c

License:

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

History

Date received : 20 June 2023

Date accepted : 25 September 2023

Funding

Funded by: FundRef https://doi.org/10.13039/501100002347, Bundesministerium für Bildung und Forschung (Federal Ministry of Education and Research);

Award ID: 01ZX2210D

Award ID: 01ZX1910D

Award ID: 01ZX1910B

Award ID: 161L0214A

Award Recipient : Andreas Maier Johannes R. Schmidt Bernhard Kuster Olga Zolotareva Tanja Laske

Funded by: FundRef https://doi.org/10.13039/100010661, EC | Horizon 2020 Framework Programme (EU Framework Programme for Research and Innovation H2020);

Award ID: 777111

Award ID: 101057619

Award Recipient : Andreas Maier Jan Baumbach

Funded by: FundRef https://doi.org/10.13039/100008398, Villum Fonden (Villum Foundation);

Award ID: 13154

Award Recipient : Jan Baumbach

Funded by: FundRef https://doi.org/10.13039/501100007352, Federal Department of Economic Affairs, Education and Research, Switzerland | Staatssekretariat für Bildung, Forschung und Innovation (State Secretariat for Education, Research and Innovation);

Award ID: 22.00115

Award Recipient : Jan Baumbach

Custom metadata

Keywords: computational biology and bioinformatics,systems biology

Data availability:

Keywords: computational biology and bioinformatics, systems biology

Comments

Comment on this article

scite_

Smart Citations

Citing PublicationsSupportingMentioningContrasting

View Citations

See how this article has been cited at scite.ai

scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.

Proteomic meta-study harmonization, mechanotyping and drug repurposing candidate prediction with ProHarMeD

Read this article at

Abstract

Related collections

Drug Repurposing Research Collection

Most cited references 52

Random Forests

The FAIR Guiding Principles for scientific data management and stewardship

MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification.

Author and article information

Contributors

Journal

Affiliations

Author information

Article

History

Funding

Categories

Custom metadata

Comments

Comment on this article

Similar content 214