Accurate Prediction of Transcriptional Activity of Single Missense Variants in HIV Tat with Deep Learning

Int J Mol Sci. 2023 Mar 24;24(7):6138. doi: 10.3390/ijms24076138.

Abstract

Tat is an essential gene for increasing the transcription of all HIV genes, and affects HIV replication, HIV exit from latency, and AIDS progression. The Tat gene frequently mutates in vivo and produces variants with diverse activities, contributing to HIV viral heterogeneity as well as drug-resistant clones. Thus, identifying the transcriptional activities of Tat variants will help to better understand AIDS pathology and treatment. We recently reported the missense mutation landscape of all single amino acid Tat variants. In these experiments, a fraction of double missense alleles exhibited intragenic epistasis. However, it is too time-consuming and costly to determine the effect of the variants for all double mutant alleles through experiments. Therefore, we propose a combined GigaAssay/deep learning approach. As a first step to determine activity landscapes for complex variants, we evaluated a deep learning framework using previously reported GigaAssay experiments to predict how transcription activity is affected by Tat variants with single missense substitutions. Our approach achieved a 0.94 Pearson correlation coefficient when comparing the predicted to experimental activities. This hybrid approach can be extensible to more complex Tat alleles for a better understanding of the genetic control of HIV genome transcription.

Keywords: HIV; deep learning; tat protein; variant effect.

MeSH terms

  • Acquired Immunodeficiency Syndrome*
  • Deep Learning*
  • Humans
  • Mutation, Missense
  • Transcription, Genetic
  • Transcriptional Activation
  • tat Gene Products, Human Immunodeficiency Virus / genetics
  • tat Gene Products, Human Immunodeficiency Virus / metabolism

Substances

  • tat Gene Products, Human Immunodeficiency Virus