Benchmarking the Robustness of Semantic Segmentation Models with Respect to Common Corruptions

PDF / 6,420,937 Bytes
22 Pages / 595.276 x 790.866 pts Page_size
13 Downloads / 219 Views

Benchmarking the Robustness of Semantic Segmentation Models with Respect to Common Corruptions Christoph Kamann1

· Carsten Rother1

Received: 14 April 2020 / Accepted: 7 September 2020 © The Author(s) 2020

Abstract When designing a semantic segmentation model for a real-world application, such as autonomous driving, it is crucial to understand the robustness of the network with respect to a wide range of image corruptions. While there are recent robustness studies for full-image classification, we are the first to present an exhaustive study for semantic segmentation, based on many established neural network architectures. We utilize almost 400,000 images generated from the Cityscapes dataset, PASCAL VOC 2012, and ADE20K. Based on the benchmark study, we gain several new insights. Firstly, many networks perform well with respect to real-world image corruptions, such as a realistic PSF blur. Secondly, some architecture properties significantly affect robustness, such as a Dense Prediction Cell, designed to maximize performance on clean data only. Thirdly, the generalization capability of semantic segmentation models depends strongly on the type of image corruption. Models generalize well for image noise and image blur, however, not with respect to digitally corrupted data or weather corruptions. Keywords Semantic segmentation · Corruption robustness · Common image corruptions · Realistic image corruptions

1 Introduction In recent years, deep convolutional neural networks (DCNNs) have set the state-of-the-art on a broad range of computer vision tasks (Krizhevsky et al. 2012; He et al. 2016; Simonyan and Zisserman 2015; Szegedy et al. 2015; LeCun et al. 1998; Redmon et al. 2016; Chen et al. 2015; Goodfellow et al. 2016). The performance of CNN models is generally measured using benchmarks of publicly available datasets, which often consist of clean and post-processed images (Cordts et al. 2016; Everingham et al. 2010). However, it has been shown that model performance is prone to image corruptions (Zhou et al. 2017; Vasiljevic et al. 2016; Hendrycks and Dietterich 2019; Geirhos et al. 2018; Dodge and Karam 2016; Gilmer et al. 2019; Azulay and Weiss 2019; Kamann and Rother 2020), especially image noise decreases the performance significantly. Communicated by Daniel Scharstein.

B

Christoph Kamann [email protected] Carsten Rother [email protected]

1

Visual Learning Lab, HCI/IWR, Heidelberg University, Heidelberg, Germany

Image quality depends on environmental factors such as illumination and weather conditions, ambient temperature, and camera motion since they directly affect the optical and electrical properties of a camera. Image quality is also affected by optical aberrations of the camera lenses, causing, e.g., image blur. Thus, in safety-critical applications, such as autonomous driving, models must be robust towards such inherently present image corruptions (Hasirlioglu et al. 2016; Kamann et al. 2017; Janai et al. 2020). In this work, we present an extensive evaluation of the robus

Data Loading...

Benchmarking the Robustness of Semantic Segmentation Models with Respect to Common Corruptions

Recommend Documents

Increasing the Robustness of Semantic Segmentation Models with Painting-by-Numbers

Benchmarking algorithms for food localization and semantic segmentation

Correction to: Mathematical analysis of robustness of two-level domain decompositionmethods with respect to inexact coar

Semantic Segmentation with Peripheral Vision

Mathematical analysis of robustness of two-level domain decomposition methods with respect to inexact coarse solves

The Corruptions of Music

Region-Based Semantic Segmentation with End-to-End Training

Learning Common and Specific Features for RGB-D Semantic Segmentation with Deconvolutional Networks

Real-Time Semantic Segmentation with Label Propagation

Weakly Supervised Semantic Segmentation with Boundary Exploration

Semantic segmentation of brain tumor with nested residual attention networks

Semantic Segmentation Using ENet Architecture