Reliability and prognostic value of radiomic features are highly dependent on choice of feature extraction platform

  • PDF / 2,604,530 Bytes
  • 10 Pages / 595.276 x 790.866 pts Page_size
  • 103 Downloads / 171 Views

DOWNLOAD

REPORT


IMAGING INFORMATICS AND ARTIFICIAL INTELLIGENCE

Reliability and prognostic value of radiomic features are highly dependent on choice of feature extraction platform Isabella Fornacon-Wood 1 & Hitesh Mistry 1 & Christoph J. Ackermann 2 & Fiona Blackhall 1,3 & Andrew McPartlin 4 & Corinne Faivre-Finn 1,4 & Gareth J. Price 1 & James P. B. O’Connor 1,5 Received: 26 February 2020 / Revised: 28 March 2020 / Accepted: 14 May 2020 # The Author(s) 2020

Abstract Objective To investigate the effects of Image Biomarker Standardisation Initiative (IBSI) compliance, harmonisation of calculation settings and platform version on the statistical reliability of radiomic features and their corresponding ability to predict clinical outcome. Methods The statistical reliability of radiomic features was assessed retrospectively in three clinical datasets (patient numbers: 108 head and neck cancer, 37 small-cell lung cancer, 47 non-small-cell lung cancer). Features were calculated using four platforms (PyRadiomics, LIFEx, CERR and IBEX). PyRadiomics, LIFEx and CERR are IBSI-compliant, whereas IBEX is not. The effects of IBSI compliance, user-defined calculation settings and platform version were assessed by calculating intraclass correlation coefficients and confidence intervals. The influence of platform choice on the relationship between radiomic biomarkers and survival was evaluated using univariable cox regression in the largest dataset. Results The reliability of radiomic features calculated by the different software platforms was only excellent (ICC > 0.9) for 4/17 radiomic features when comparing all four platforms. Reliability improved to ICC > 0.9 for 15/17 radiomic features when analysis was restricted to the three IBSI-compliant platforms. Failure to harmonise calculation settings resulted in poor reliability, even across the IBSI-compliant platforms. Software platform version also had a marked effect on feature reliability in CERR and LIFEx. Features identified as having significant relationship to survival varied between platforms, as did the direction of hazard ratios. Conclusion IBSI compliance, user-defined calculation settings and choice of platform version all influence the statistical reliability and corresponding performance of prognostic models in radiomics. Key Points • Reliability of radiomic features varies between feature calculation platforms and with choice of software version. • Image Biomarker Standardisation Initiative (IBSI) compliance improves reliability of radiomic features across platforms, but only when calculation settings are harmonised. • IBSI compliance, user-defined calculation settings and choice of platform version collectively affect the prognostic value of features. Keywords Reliability of results . Biomarkers . Prognosis . Tomography, x-ray computed . Translation

Gareth J. Price and James P. B. O’Connor contributed equally to this work. Electronic supplementary material The online version of this article (https://doi.org/10.1007/s00330-020-06957-9) contains supplementary material, which is a