Grouping Total Variation and Sparsity: Statistical Learning with Segmenting Penalties

Prediction from medical images is a valuable aid to diagnosis. For instance, anatomical MR images can reveal certain disease conditions, while their functional counterparts can predict neuropsychiatric phenotypes. However, a physician will not rely on pre

PDF / 799,734 Bytes
9 Pages / 439.363 x 666.131 pts Page_size
24 Downloads / 216 Views

DOWNLOAD

REPORT

Abstract. Prediction from medical images is a valuable aid to diagnosis. For instance, anatomical MR images can reveal certain disease conditions, while their functional counterparts can predict neuropsychiatric phenotypes. However, a physician will not rely on predictions by black-box models: understanding the anatomical or functional features that underpin decision is critical. Generally, the weight vectors of classiﬁers are not easily amenable to such an examination: Often there is no apparent structure. Indeed, this is not only a prediction task, but also an inverse problem that calls for adequate regularization. We address this challenge by introducing a convex region-selecting penalty. Our penalty combines total-variation regularization, enforcing spatial contiguity, and 1 regularization, enforcing sparsity, into one group: Voxels are either active with non-zero spatial derivative or zero with inactive spatial derivative. This leads to segmenting contiguous spatial regions (inside which the signal can vary freely) against a background of zeros. Such segmentation of medical images in a target-informed manner is an important analysis tool. On several prediction problems from brain MRI, the penalty shows good segmentation. Given the size of medical images, computational eﬃciency is key. Keeping this in mind, we contribute an eﬃcient optimization scheme that brings signiﬁcant computational gains.

1

Introduction

For certain pathologies, medical images carry weak indicators of external phenotype. For instance, in Magnetic Resonance images, a pattern of brain atrophy centered on the thalamus predicts the evolution in Alzheimer’s disease [19]. Functional Magnetic Resonance Imaging (fMRI) can be used to infer subjects’ behavioral state from their brain activity [11]. Machine learning methods can identify these biomarkers. With linear predictors, the weight vectors form spatial maps in the image domain. However, minimizing a prediction error gives little control on the corresponding maps. Indeed, the prediction problem is often an ill-posed inverse problem in the sense that there are less samples than features available: many diﬀerent weight maps can generate exactly the same predictions. A choice among these candidates is implicitly taken by the estimator employed. In the empirical risk minimization framework, this choice is imposed via a penalty which favors maps according to certain criteria, interpretable as a “prior”. Sparsity for c Springer International Publishing Switzerland 2015 N. Navab et al. (Eds.): MICCAI 2015, Part I, LNCS 9349, pp. 685–693, 2015. DOI: 10.1007/978-3-319-24553-9_84

686

M. Eickenberg et al.

instance, imposable in convex optimization via the 1 norm, is very useful as it selects a small number of voxels for the prediction. It has been widely used in medical imaging, from fMRI [21] to regularizing diﬀeomorphic registration [8]. However, imposing sparsity can often lead to less stable weight maps. Indeed, for images with high spatial correlations, adjacent voxels contain similar infor

Data Loading...

Grouping Total Variation and Sparsity: Statistical Learning with Segmenting Penalties

Recommend Documents

The application of joint sparsity and total variation minimization algorithms to a real-life art restoration problem

Prediction in Total Variation: Characterizations

Communication-efficient distributed multi-task learning with matrix sparsity regularization

Segmenting Continuous but Sparsely-Labeled Structures in Super-Resolution Microscopy Using Perceptual Grouping

Top-down grouping affects adjacent dependency learning

Statistical Relational Learning with Soft Quantifiers

Statistical Learning and Kernel Methods

Statistical Learning and Its Consequences

Neural Networks and Statistical Learning

Texture Smoothing Based on Adaptive Total Variation

Grouping

Characterizing HPC Performance Variation with Monitoring and Unsupervised Learning