Reconstructing directed gene regulatory network by only gene expression data

BMC Genomics. 2016 Aug 18;17 Suppl 4(Suppl 4):430. doi: 10.1186/s12864-016-2791-2.

Abstract

Background: Accurately identifying gene regulatory network is an important task in understanding in vivo biological activities. The inference of such networks is often accomplished through the use of gene expression data. Many methods have been developed to evaluate gene expression dependencies between transcription factor and its target genes, and some methods also eliminate transitive interactions. The regulatory (or edge) direction is undetermined if the target gene is also a transcription factor. Some methods predict the regulatory directions in the gene regulatory networks by locating the eQTL single nucleotide polymorphism, or by observing the gene expression changes when knocking out/down the candidate transcript factors; regrettably, these additional data are usually unavailable, especially for the samples deriving from human tissues.

Results: In this study, we propose the Context Based Dependency Network (CBDN), a method that is able to infer gene regulatory networks with the regulatory directions from gene expression data only. To determine the regulatory direction, CBDN computes the influence of source to target by evaluating the magnitude changes of expression dependencies between the target gene and the others with conditioning on the source gene. CBDN extends the data processing inequality by involving the dependency direction to distinguish between direct and transitive relationship between genes. We also define two types of important regulators which can influence a majority of the genes in the network directly or indirectly. CBDN can detect both of these two types of important regulators by averaging the influence functions of candidate regulator to the other genes. In our experiments with simulated and real data, even with the regulatory direction taken into account, CBDN outperforms the state-of-the-art approaches for inferring gene regulatory network. CBDN identifies the important regulators in the predicted network: 1. TYROBP influences a batch of genes that are related to Alzheimer's disease; 2. ZNF329 and RB1 significantly regulate those 'mesenchymal' gene expression signature genes for brain tumors.

Conclusion: By merely leveraging gene expression data, CBDN can efficiently infer the existence of gene-gene interactions as well as their regulatory directions. The constructed networks are helpful in the identification of important regulators for complex diseases.

Keywords: Gene expression; Gene regulatory network; Important regulators; Regulatory direction.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adaptor Proteins, Signal Transducing / genetics*
  • Algorithms
  • Alzheimer Disease / genetics*
  • Brain / metabolism
  • Brain / pathology
  • Computational Biology
  • Computer Simulation
  • DNA-Binding Proteins / genetics*
  • Gene Expression Regulation / genetics
  • Gene Regulatory Networks / genetics
  • Humans
  • Membrane Proteins / genetics*
  • Polymorphism, Single Nucleotide
  • Quantitative Trait Loci / genetics
  • Retinoblastoma Binding Proteins / genetics*
  • Transcription Factors / genetics
  • Transcriptome
  • Ubiquitin-Protein Ligases / genetics*

Substances

  • Adaptor Proteins, Signal Transducing
  • DNA-Binding Proteins
  • Membrane Proteins
  • RB1 protein, human
  • Retinoblastoma Binding Proteins
  • TYROBP protein, human
  • Transcription Factors
  • ZNF330 protein, human
  • Ubiquitin-Protein Ligases