99
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Clique-based data mining for related genes in a biomedical database

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Progress in the life sciences cannot be made without integrating biomedical knowledge on numerous genes in order to help formulate hypotheses on the genetic mechanisms behind various biological phenomena, including diseases. There is thus a strong need for a way to automatically and comprehensively search from biomedical databases for related genes, such as genes in the same families and genes encoding components of the same pathways. Here we address the extraction of related genes by searching for densely-connected subgraphs, which are modeled as cliques, in a biomedical relational graph.

          Results

          We constructed a graph whose nodes were gene or disease pages, and edges were the hyperlink connections between those pages in the Online Mendelian Inheritance in Man (OMIM) database. We obtained over 20,000 sets of related genes (called 'gene modules') by enumerating cliques computationally. The modules included genes in the same family, genes for proteins that form a complex, and genes for components of the same signaling pathway. The results of experiments using 'metabolic syndrome'-related gene modules show that the gene modules can be used to get a coherent holistic picture helpful for interpreting relations among genes.

          Conclusion

          We presented a data mining approach extracting related genes by enumerating cliques. The extracted gene sets provide a holistic picture useful for comprehending complex disease mechanisms.

          Related collections

          Most cited references32

          • Record: found
          • Abstract: found
          • Article: not found

          Network-based prediction of protein function

          Functional annotation of proteins is a fundamental problem in the post-genomic era. The recent availability of protein interaction networks for many model species has spurred on the development of computational methods for interpreting such data in order to elucidate protein function. In this review, we describe the current computational approaches for the task, including direct methods, which propagate functional information through the network, and module-assisted methods, which infer functional modules within the network and use those for the annotation task. Although a broad variety of interesting approaches has been developed, further progress in the field will depend on systematic evaluation of the methods and their dissemination in the biological community.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Detecting community structure in networks

            M. Newman (2004)
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              A comprehensive pathway map of epidermal growth factor receptor signaling

              The epidermal growth factor receptor (EGFR) signaling pathway is one of the most important pathways that regulate growth, survival, proliferation, and differentiation in mammalian cells. Reflecting this importance, it is one of the best-investigated signaling systems, both experimentally and computationally, and several computational models have been developed for dynamic analysis. A map of molecular interactions of the EGFR signaling system is a valuable resource for research in this area. In this paper, we present a comprehensive pathway map of EGFR signaling and other related pathways. The map reveals that the overall architecture of the pathway is a bow-tie (or hourglass) structure with several feedback loops. The map is created using CellDesigner software that enables us to graphically represent interactions using a well-defined and consistent graphical notation, and to store it in Systems Biology Markup Language (SBML).
                Bookmark

                Author and article information

                Journal
                BMC Bioinformatics
                BMC Bioinformatics
                BioMed Central
                1471-2105
                2009
                1 July 2009
                : 10
                : 205
                Affiliations
                [1 ]Research and Development Headquarters, NTT DATA Corporation, Tokyo, 135-8671, Japan
                [2 ]The Advanced Algorithms Research Laboratory, The University of Electro-Communications, Tokyo, 182-8585, Japan
                [3 ]Research and Development Initiative, Chuo University, Tokyo, 112-8551, Japan
                [4 ]Medical Research Institute, Tokyo Medical and Dental University, Tokyo, 101-0062, Japan
                [5 ]Research Institute, HuBit Genomix Inc, Tokyo, 102-0092, Japan
                Article
                1471-2105-10-205
                10.1186/1471-2105-10-205
                2721841
                19566964
                88da4611-8971-47e3-9bec-89fa71599523
                Copyright © 2009 Matsunaga et al; licensee BioMed Central Ltd.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 17 December 2008
                : 1 July 2009
                Categories
                Research Article

                Bioinformatics & Computational biology
                Bioinformatics & Computational biology

                Comments

                Comment on this article