Disease modules provide unique insights into the mechanisms of complex diseases and lay the foundation for mechanistic drug repurposing. Algorithms for their identification leverage biological networks to extend an initial set of disease-associated genes (seeds) into subnetworks reflecting biological processes likely to be integral components of the investigated disease. These subnetworks can unveil causal pathways and provide drug repurposing efforts with promising new targets for therapeutics. Various computational methods have been developed for disease module identification. Since these methods differ in their modeling assumptions and techniques, evaluating various tools across different parameters to optimize for a specific use case is advisable. However, this can be tedious since the individual tools require specific installation and input preparation procedures. Moreover, identifying the best modules is not straightforward and requires topological and biological validation strategies.To mitigate this, we developed a comprehensive pipeline for disease module identification and validation utilizing the workflow software Nextflow. Our pipeline automatically deploys software dependencies using Docker, making installation easy. It prepares the inputs for and runs five popular module detection tools. The generated outputs are annotated with drug-repurposing relevant information, converted into a unified BioPAX format, and extensively validated. This includes assessing the biological relevance based on overrepresentation analysis and the dedicated software DIGEST, as well as robustness and consistency analyses.With our contribution, we allow the community to systematically compare different approaches for disease module discovery, thus contributing to robustness and reproducibility in systems and network medicine.
Levi Hagai, Elkon Ran, Shamir Ron. DOMINO: a network‐based active module identification algorithm with reduced rate of false calls. Molecular Systems Biology. Vol. 17(1)2021. Springer Science and Business Media LLC. [Cross Ref]
Ghiassian Susan Dina, Menche Jörg, Barabási Albert-László. A DIseAse MOdule Detection (DIAMOnD) Algorithm Derived from a Systematic Analysis of Connectivity Patterns of Disease Proteins in the Human Interactome. PLOS Computational Biology. Vol. 11(4)2015. Public Library of Science (PLoS). [Cross Ref]
Bernett Judith, Krupke Dominik, Sadegh Sepideh, Baumbach Jan, Fekete Sándor P, Kacprowski Tim, List Markus, Blumenthal David B. Robust disease module mining via enumeration of diverse prize-collecting Steiner trees. Bioinformatics. Vol. 38(6):1600–1606. 2022. Oxford University Press (OUP). [Cross Ref]
Di Tommaso Paolo, Chatzou Maria, Floden Evan W, Barja Pablo Prieto, Palumbo Emilio, Notredame Cedric. Nextflow enables reproducible computational workflows. Nature Biotechnology. Vol. 35(4):316–319. 2017. Springer Science and Business Media LLC. [Cross Ref]