Learning region-guided scale-aware feature selection for object detection

PDF / 3,096,236 Bytes
15 Pages / 595.276 x 790.866 pts Page_size
1 Downloads / 255 Views

(0123456789().,-volV)(0123456789().,-volV)

ORIGINAL ARTICLE

Learning region-guided scale-aware feature selection for object detection Liu Liu1,2 • Rujing Wang2 • Chengjun Xie2 • Rui Li2 • Fangyuan Wang1,2 • Man Zhou1,2 Yue Teng1,2

•

Received: 8 May 2020 / Accepted: 24 September 2020 Springer-Verlag London Ltd., part of Springer Nature 2020

Abstract Scale variation is one of the major challenges in object detection task. Modern region-based object detection architectures often adopt Feature Pyramid Network (FPN) as feature extraction neck to achieve multi-scale feature representation in solving scale variation problem. However, due to the rough feature selection strategy in Region of Interest (RoI) feature extraction step, these methods might not perform well on object detection under strong scale variation. In this work, we are motivated by the limitations of current FPN-based two-stage object detectors and then present a novel module, namely scale-aware feature selective (SAFS) module, that flexibly and adaptively selects feature levels in two-stage object detectors. Specifically, we firstly build the RoI Pyramid in standard FPN structure to extract RoI features from various scale levels. Next, in order to achieve scale-aware mechanism for solving scale variation issue, we develop a novel weighting gate function containing one set of trainable parameters to automatically learn the fusion weight for each RoI feature level, which relieves the limitation of hard feature selection strategy guided by online instance size. Outputs from the RoI features with the learned weights are fused for classification and bounding box regression. Furthermore, we design a multilevel SAFS architecture to obtain different types of RoI feature combinations that ensures our method is more robust to various instance scales. Experimental results show that our SAFS module is very compatible with most of two-stage object detectors and could achieve state-of-the-art results with Average Precision of 48.3 on COCO test-dev and other popular object detection benchmarks. Our code will be made publicly available. Keywords Scale variation Object detection RoI Pyramid Scale-aware feature selective

1 Introduction In modern computer vision field, Convolutional Neural Network (CNN) has shown the high efficiency on automatically extracting powerful features on various visual tasks guided by supervised learning [1–4]. The past decade has witnessed the superior performance when CNN is employed in object detection architectures [5–7]. Among these methods, two-stage approaches are fast becoming a key instrument in generic object detection task due to its & Rujing Wang [email protected] 1

University of Science and Technology of China, Hefei 230026, China

2

Institute of Intelligent Machines, Chinese Academy of Sciences, Hefei 230031, China

high-quality candidate boxes outputted from Region Proposal Network (RPN). Furthermore, Feature Pyramid Network (FPN) [8], introduced as model neck component, can play a vital role in addressing the issue o

Data Loading...

Learning region-guided scale-aware feature selection for object detection

Recommend Documents

Machine Learning and Feature Selection Based Ransomware Detection Using Hexacodes

Single shot object detection with refined feature

Pyramid context learning for object detection

Learning Data Augmentation Strategies for Object Detection

Interactive Feature Growing for Accurate Object Detection in Megapixel Images

Glowing Window-Based Feature Extraction Technique for Object Detection

Learning Discriminative Feature with CRF for Unsupervised Video Object Segmentation

Statistical edge-based feature selection for counterfeit coin detection

Machine Learning for Web Intrusion Detection: A Comparative Analysis of Feature Selection Methods mRMR and PFI

Review on Deep Learning in Feature Selection

Feature Selection for Clustering

Automatic Feature Learning for Glaucoma Detection Based on Deep Learning