Logo detection using weakly supervised saliency map

PDF / 6,567,058 Bytes
25 Pages / 439.642 x 666.49 pts Page_size
86 Downloads / 344 Views

Logo detection using weakly supervised saliency map Gautam Kumar1 · Prateek Keserwani1 · Partha Pratim Roy1 · Debi Prosad Dogra2 Received: 17 July 2018 / Revised: 11 August 2020 / Accepted: 2 September 2020 / © Springer Science+Business Media, LLC, part of Springer Nature 2020

Abstract Box level annotation of a large number of logo images for training purpose of typical deep learning architecture is highly challenging. Thus, a method that can detect the logo with the help of training to remove box-level annotations can be helpful. In this paper, we present a method of logo detection that utilizes weakly supervised learning of Convolutional Neural Network (CNN) to generate a deep saliency map. The saliency map is generated from the back-propagated response of the CNN trained with the classification task. The saliency map produces responses for the regions of logos. GrabCut segmentation method has been applied then to obtain the bounding box corresponding to the logo class predicted by the CNN for a given image. AlexNet, CaffeNet, and VGGNet deep architectures has been finetuned for the classification purpose. The framework is further utilized for detection through a back-propagated saliency map. The performance of the proposed methodology has been validated on the FlickrLogos-32 logo benchmark dataset. The proposed method outperforms the state-of-the-art baseline fully supervised methods with mean average precision (mAP) of 75.83%. Keywords Convolutional neural network · Saliency map · GrabCut · Logo detection · Weakly supervised

Gautam Kumar

[email protected] Prateek Keserwani [email protected] Partha Pratim Roy [email protected] Debi Prosad Dogra [email protected] 1

Department of CSE, Indian Institute of Technology Roorkee, Roorkee, India

2

School of Electrical Sciences, IIT Bhubaneswar, Bhubaneswar, India

Multimedia Tools and Applications

1 Introduction The online visual data repository is increasing rapidly due to the fast expansions of social networking sites. This massive increment in visual data needs robust and efficient solution for object detection and recognition [18, 20, 50] to support content-based retrieval. A Logo is a special class of the object, which is extremely important to find an item’s belonging to a specific brand. Generally, the design of the logo consists of symbols and texts. It often emphasizes the product or company name and shows specific meaning for the object. The logo represents the brand and represents the quality and services of the product. Logos are the key feature for readers to describe the ownership of visual documents. Logo detection and recognition systems have gained importance in the research community because of its wide range of applications. Some of its uses are namely, vehicle logo recognition, traffic management, product brand management in social media. Logo detection tries to identify the locations of the logos within an image. Detected regions can further be assigned to the specific class of the logo, known as logo recognition. It can b

Data Loading...

Logo detection using weakly supervised saliency map

Recommend Documents

Region Proposals for Saliency Map Refinement for Weakly-Supervised Disease Localisation and Classification

Distinct Class-Specific Saliency Maps for Weakly Supervised Semantic Segmentation

Reliable Saliency Maps for Weakly-Supervised Localization of Disease Patterns

Weakly supervised multi-scale recurrent convolutional neural network for co-saliency detection and co-segmentation

Spatial division networks for weakly supervised detection

Weakly-Supervised Semantic Segmentation Using Motion Cues

Weakly Supervised Object Localization Using Size Estimates

Uncertainty-Aware Weakly Supervised Action Detection from Untrimmed Videos

Hierarchical saliency mapping for weakly supervised object localization based on class activation mapping

Boosting Weakly Supervised Object Detection with Progressive Knowledge Transfer

Weakly Supervised Short Text Categorization Using World Knowledge

Weakly Supervised One-Stage Vision and Language Disease Detection Using Large Scale Pneumonia and Pneumothorax Studies