Copula Approach for Developing a Biomarker Panel for Prediction of Dengue Hemorrhagic Fever

  • PDF / 1,208,637 Bytes
  • 16 Pages / 439.37 x 666.142 pts Page_size
  • 81 Downloads / 191 Views

DOWNLOAD

REPORT


Copula Approach for Developing a Biomarker Panel for Prediction of Dengue Hemorrhagic Fever Jong‑Min Kim1 · Hyunsu Ju2 · Yoonsung Jung3  Received: 12 August 2019 / Revised: 9 May 2020 / Accepted: 21 May 2020 © Springer-Verlag GmbH Germany, part of Springer Nature 2020

Abstract The choice of variable-selection methods to identify important variables for binary classification modeling is critical for producing stable statistical models that are interpretable, that generate accurate predictions, and have minimal bias. This work is motivated by the availability of data on clinical and laboratory features of dengue fever infections obtained from 51 individuals enrolled in a prospective observational study of acute human dengue infections. Our paper uses objective Bayesian method to identify important variables for dengue hemorrhagic fever (DHF) over the dengue data set. With the selected important variables by objective Bayesian method, we employ a Gaussian copula marginal regression model considering correlation error structure and a general method of semi-parametric Bayesian inference for Gaussian copula model to estimate, separately, the marginal distribution and dependence structure. We also carry out a receiver operating characteristic (ROC) analysis for the predictive model for DHF and compare our proposed model with the other models of Ju and Brasier (Variable selection methods for developing a biomarker panel for prediction of dengue hemorrhagic fever. BMC Res Notes 6:365, 2013) tested on the basis of the ROC analysis. Our results extend the previous models of DHF by suggesting that IL-10, Days Fever, Sex and Lymphocytes are the major features for predicting DHF on the basis of blood chemistries and cytokine measurements. In addition, the dependence structure of these Days Fever, Lymphocytes, IL-10 and Sex protein profiles associated with disease outcomes was discovered by the semi-parametric Bayesian Gaussian copula model and Gaussian partial correlation method.

* Yoonsung Jung [email protected] 1

Statistics Discipline, Division of Science and Mathematics, University of Minnesota-Morris, Morris, MN 56267, USA

2

U.S. Food and Drug Administration, 10903 New Hampshire Avenue, Silver Spring, MD 20993, USA

3

Cooperative Agricultural Research Center, College of Agriculture and Human Sciences, Prairie View A&M University, Prairie View, TX 77446, USA



13

Vol.:(0123456789)



Annals of Data Science

Keywords  Copula · Variable selection · ROC · Biomarker · Dependence

1 Introduction Acute dengue viral infections, hyperendemic to the tropics, are major cause of morbidity in tropical countries. There is no drug therapy or vaccine treatment now. Dengue hemorrhagic fever (DHF) is a late-stage complication of acute dengue infection primarily associated with capillary leakage, hemorrhage, circulatory shock, and representing life-threatening complications. Early detection of DHF may help identify patients that would benefit from more intensive therapy. Ahmad [2] proposed hyperbolic Sine Rayleigh distribution capable