Leveraging Mid-Level Semantic Boundary Cues for Automated Lymph Node Detection

Histograms of oriented gradients (HOG) are widely employed image descriptors in modern computer-aided diagnosis systems. Built upon a set of local, robust statistics of low-level image gradients, HOG features are usually computed on raw intensity images.

PDF / 1,582,905 Bytes
9 Pages / 439.363 x 666.131 pts Page_size
80 Downloads / 162 Views

DOWNLOAD

REPORT

Abstract. Histograms of oriented gradients (HOG) are widely employed image descriptors in modern computer-aided diagnosis systems. Built upon a set of local, robust statistics of low-level image gradients, HOG features are usually computed on raw intensity images. In this paper, we explore a learned image transformation scheme for producing higher-level inputs to HOG. Leveraging semantic object boundary cues, our methods compute data-driven image feature maps via a supervised boundary detector. Compared with the raw image map, boundary cues oﬀer mid-level, more object-speciﬁc visual responses that can be suited for subsequent HOG encoding. We validate integrations of several image transformation maps with an application of computer-aided detection of lymph nodes on thoracoabdominal CT images. Our experiments demonstrate that semantic boundary cues based HOG descriptors complement and enrich the raw intensity alone. We observe an overall system with substantially improved results (∼ 78% versus 60% recall at 3 FP/volume for two target regions). The proposed system also moderately outperforms the state-of-the-art deep convolutional neural network (CNN) system in the mediastinum region, without relying on data augmentation and requiring signiﬁcantly fewer training samples.

1

Introduction

Quantitative assessment of lymph nodes (LNs) is routine in the daily radiological workﬂow. When measuring greater than 10 mm in short-axis diameter on an axial computed tomography (CT) slice, LNs are generally considered clinically relevant or actionable [13], indicative of diseases such as lung cancer, lymphoma, or inﬂammation. Manual detection of enlarged LNs, critical to determining disease progression and treatment response, is a time-consuming and error-prone process. Thus, there has been active research in recent years to develop accurate computer-aided lymph node detection (CADe) systems. A challenging object class for recognition, LNs exhibit substantial variation in appearance/location/pose as well as low contrast with surrounding anatomy on

The rights of this work are transferred to the extent transferable according to title 17 § 105 U.S.C.

c Springer International Publishing Switzerland 2015 (outside the US) N. Navab et al. (Eds.): MICCAI 2015, Part II, LNCS 9350, pp. 53–61, 2015. DOI: 10.1007/978-3-319-24571-3_7

54

A. Seﬀ et al.

CT scans. Recent work on LN CADe has varied according to the feature types and learning algorithms used for training. [1, 8] utilize direct 3D information from CT scans, performing boosting-based feature selection over a pool of 50–60 thousand 3D Haar wavelet features. Due to the curse of dimensionality (analyzed in [14]), such approaches can result in systems with limited sensitivity (e.g. 60.9% at 6.1 FP/scan for mediastinal LNs in [8]). Circumventing 3D feature computation during LN classiﬁcation, [14] implements a shallow hierarchy of linear models operating on 2D slices or views of LN candidate volumes of interest (VOIs) with histograms of oriented gradients (HOG) [3] features. Also

Data Loading...

Leveraging Mid-Level Semantic Boundary Cues for Automated Lymph Node Detection

Recommend Documents

Epitrochlear Lymph Node Dissection

Lymph Node Lesions

Lymph Node Classification

Salvage Lymph Node Dissection

Popliteal Lymph Node Dissection

Lymph Node Cytopathology

Sentinel lymph node biopsy for lung cancer

Lymph node dissection for esophageal cancer

Free Vascularized Lymph Node Transfer

Structure-preserving image smoothing with semantic cues

Weakly-Supervised Semantic Segmentation Using Motion Cues

An automated framework for advertisement detection and removal from sports videos using audio-visual cues