Predicting viral exposure response from modeling the changes of co-expression networks using time series gene expression

  • PDF / 1,682,714 Bytes
  • 18 Pages / 595 x 794 pts Page_size
  • 40 Downloads / 180 Views

DOWNLOAD

REPORT


RESEARCH ARTICLE

Open Access

Predicting viral exposure response from modeling the changes of co-expression networks using time series gene expression data Fangli Dong1,2 , Yong He3 , Tao Wang2,4 , Dong Han1 , Hui Lu2,4 and Hongyu Zhao2,5* *Correspondence: [email protected] 2 SJTU-Yale Joint Center of Biostatistics and Data Science, Shanghai Jiao Tong University, Dongchuan Road, 200240 Shanghai, China 5 Department of Biostatistics, Yale School of Public Health, 60 College Street, New Haven CT 06520, USA Full list of author information is available at the end of the article

Abstract Background: Deciphering the relationship between clinical responses and gene expression profiles may shed light on the mechanisms underlying diseases. Most existing literature has focused on exploring such relationship from cross-sectional gene expression data. It is likely that the dynamic nature of time-series gene expression data is more informative in predicting clinical response and revealing the physiological process of disease development. However, it remains challenging to extract useful dynamic information from time-series gene expression data. Results: We propose a statistical framework built on considering co-expression network changes across time from time series gene expression data. It first detects change point for co-expression networks and then employs a Bayesian multiple kernel learning method to predict exposure response. There are two main novelties in our method: the use of change point detection to characterize the co-expression network dynamics, and the use of kernel function to measure the similarity between subjects. Our algorithm allows exposure response prediction using dynamic network information across a collection of informative gene sets. Through parameter estimations, our model has clear biological interpretations. The performance of our method on the simulated data under different scenarios demonstrates that the proposed algorithm has better explanatory power and classification accuracy than commonly used machine learning algorithms. The application of our method to time series gene expression profiles measured in peripheral blood from a group of subjects with respiratory viral exposure shows that our method can predict exposure response at early stage (within 24 h) and the informative gene sets are enriched for pathways related to respiratory and influenza virus infection. (Continued on next page)

© The Author(s). 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Co