Integrating human omics data to prioritize candidate genes

Yong Chen, Xuebing Wu, Rui Jiang

Research output: Contribution to journalArticlepeer-review

32 Scopus citations

Abstract

Background: The identification of genes involved in human complex diseases remains a great challenge in computational systems biology. Although methods have been developed to use disease phenotypic similarities with a protein-protein interaction network for the prioritization of candidate genes, other valuable omics data sources have been largely overlooked in these methods. Methods. With this understanding, we proposed a method called BRIDGE to prioritize candidate genes by integrating disease phenotypic similarities with such omics data as protein-protein interactions, gene sequence similarities, gene expression patterns, gene ontology annotations, and gene pathway memberships. BRIDGE utilizes a multiple regression model with lasso penalty to automatically weight different data sources and is capable of discovering genes associated with diseases whose genetic bases are completely unknown. Results: We conducted large-scale cross-validation experiments and demonstrated that more than 60% known disease genes can be ranked top one by BRIDGE in simulated linkage intervals, suggesting the superior performance of this method. We further performed two comprehensive case studies by applying BRIDGE to predict novel genes and transcriptional networks involved in obesity and type II diabetes. Conclusion: The proposed method provides an effective and scalable way for integrating multi omics data to infer disease genes. Further applications of BRIDGE will be benefit to providing novel disease genes and underlying mechanisms of human diseases.

Original languageEnglish (US)
Article number57
JournalBMC Medical Genomics
Volume6
Issue number1
DOIs
StatePublished - Dec 18 2013
Externally publishedYes

All Science Journal Classification (ASJC) codes

  • Genetics
  • Genetics(clinical)

Fingerprint

Dive into the research topics of 'Integrating human omics data to prioritize candidate genes'. Together they form a unique fingerprint.

Cite this