Scene Text Detection with Adaptive Line Clustering

We propose a scene text detection system which can maintain a high recall while achieving a fair precision. In our method, no character candidate is eliminated based on character-level features. A weighted directed graph is constructed and the minimum ave

PDF / 1,082,861 Bytes
14 Pages / 439.37 x 666.142 pts Page_size
13 Downloads / 285 Views

DOWNLOAD

REPORT

Abstract. We propose a scene text detection system which can maintain a high recall while achieving a fair precision. In our method, no character candidate is eliminated based on character-level features. A weighted directed graph is constructed and the minimum average cost path algorithm is adopted to extract line candidates. After assigning three line-level probability values to each line, the ﬁnal decisions are made according to the line candidate clustering of the current image. The proposed system has been evaluated on the ICDAR 2013 dataset. Compared with other published methods, it has achieved better performances. Keywords: Line-level features · Weighted directed graph model imum average cost path algorithm · Line candidate clustering

1

· Min-

Introduction

Text detection and recognition in natural scenes is a key technology to potential applications such as autonomous navigation, multilingual translation, and assistance to the visually impaired people. While there have been signiﬁcant advances in recent years, text detection still remains challenging due to the diversity of scene text and the complexity of background. For comprehensive surveys, refer to [8,20,24]. A typical text detection system usually consists of four components, namely, character candidate extraction, false candidate elimination, text line generation, and text line veriﬁcation. Existing methods for scene text detection can be largely categorized into three groups: methods using the sliding window [7,18,23], methods using the connected components [2,5,15,21], and hybrid methods [6]. Zhang et al. [23] adopted the sliding window which is based on the symmetry property of character groups. In the work [7] text saliency maps were computed by evaluating the character/background convolutional neural network classiﬁer in a sliding window. Tian et al. [18] detected character candidates by combining the sliding window scheme with a fast cascade boosting algorithm that exhibited a high performance. Compared with the methods using the sliding window, methods based on connected components have become popular during the last ten years, since c Springer International Publishing Switzerland 2016 G. Hua and H. J´ egou (Eds.): ECCV 2016 Workshops, Part I, LNCS 9913, pp. 364–377, 2016. DOI: 10.1007/978-3-319-46604-0 27

Scene Text Detection with Adaptive Line Clustering

365

these methods are usually more eﬃcient and relatively insensitive to variations in scale, orientation, font, and language type. SWT [4] and MSER [11] are two representative component-based methods for scene text detection, which constitute the basis of a lot of subsequent works [2,5,15,21]. The method of Neumann and Matas [12] exploited the MSER detector and then classiﬁed the detected regions as either a character, a multi-character or the background. In [3] a fast stroke detector was proposed based on an eﬃcient pixel intensity comparison to surrounding pixels. However, the character candidate extraction brings in too many non-text components especially in component-based met

Data Loading...

Scene Text Detection with Adaptive Line Clustering

Recommend Documents

Sequential Deformation for Accurate Scene Text Detection

Interactive Scene Text Detection on Mobile Devices

Class-Balanced Loss for Scene Text Detection

Automated Text Detection and Text-Line Construction in Natural Images

Fast RT-LoG operator for scene text detection

Scene Text Detection and Recognition: The Deep Learning Era

Path Aggregation and Dual Supervision Network for Scene Text Detection

Anchor-free multi-orientation text detection in natural scene images

Text Clustering

Downtown Osaka Scene Text Dataset

An adaptive text-line extraction algorithm for printed Arabic documents with diacritics

Scene Change Detection