Interpreting the Ratio Criterion for Matching SIFT Descriptors

Matching keypoints by minimizing the Euclidean distance between their SIFT descriptors is an effective and extremely popular technique. Using the ratio between distances, as suggested by Lowe, is even more effective and leads to excellent matching accurac

PDF / 533,743 Bytes
16 Pages / 439.37 x 666.142 pts Page_size
4 Downloads / 195 Views

DOWNLOAD

REPORT

Abstract. Matching keypoints by minimizing the Euclidean distance between their SIFT descriptors is an eﬀective and extremely popular technique. Using the ratio between distances, as suggested by Lowe, is even more eﬀective and leads to excellent matching accuracy. Probabilistic approaches that model the distribution of the distances were found eﬀective as well. This work focuses, for the ﬁrst time, on analyzing Lowe’s ratio criterion using a probabilistic approach. We provide two alternative interpretations of this criterion, which show that it is not only an eﬀective heuristic but can also be formally justiﬁed. The ﬁrst interpretation shows that Lowe’s ratio corresponds to a conditional probability that the match is incorrect. The second shows that the ratio corresponds to the Markov bound on this probability. The interpretations make it possible to slightly increase the eﬀectiveness of the ratio criterion, and to obtain matching performance that exceeds all previous (non-learning based) results.

Keywords: SIFT

1

· Matching · a contrario

Introduction

Matching objects in diﬀerent images is a fundamental task in computer vision, with applications in object recognition, panorama stitching, and many more. The common practice is to extract a set of distinctive keypoints from each image, compute a descriptor for each keypoint, and then match the keypoints using a similarity measure between the descriptors and possibly also geometric constraints. Many methods for detecting the keypoints and computing their descriptors have been proposed. See the reviews [11,23,25]. The scale invariant feature transform (SIFT) suggested by Lowe [8,9] is arguably the dominant algorithm for both keypoint detection and keypoint description. It speciﬁes feature points and corresponding neighborhoods as maxima in the scale space of the DoG operator. The descriptor itself is a set of histograms of gradient directions calculated in several (16) regions in this neighborhood, concatenated into a 128-dimensional vector. Various normalizations and several ﬁltering stages help to optimize the descriptor. The combination of scale space, gradient direction, and histograms makes the SIFT descriptor robust to scale, rotation, and illumination changes, and yet discriminative. Keypoints are c Springer International Publishing AG 2016 B. Leibe et al. (Eds.): ECCV 2016, Part V, LNCS 9909, pp. 697–712, 2016. DOI: 10.1007/978-3-319-46454-1 42

698

A. Kaplan et al.

matched by minimizing the Euclidean distance between their SIFT descriptors. However, to rank the matches, it is much more eﬀective to use the distance ratios: ratio(ai , bj(i) ) =

ai − bj(i) 2 ai − bj (i) 2

(1)

and not the distances themselves [8,9]. Here, ai denotes a descriptor in one image, and bj(i) , bj (i) correspond to the closest and the second-closest descriptors in the other image. SIFT has been challenged by many competing descriptors. The variations try to achieve faster runtime (e.g. SURF [2]), robustness to aﬃne transformation (ASIFT [14]), compatibility with colo

Data Loading...

Interpreting the Ratio Criterion for Matching SIFT Descriptors

Recommend Documents

Reduced-reference image quality assessment through SIFT intensity ratio

Image matching based on the adaptive redundant keypoint elimination method in the SIFT algorithm

The Remote Sensing Image Matching Algorithm Based on the Normalized Cross-Correlation and SIFT

Other Descriptors

A Digital Video Stabilization System Based on Reliable SIFT Feature Matching and Adaptive Low-Pass Filtering

Shape Descriptors

LandscapeAR: Large Scale Outdoor Augmented Reality by Matching Photographs with Terrain Models Using Learned Descriptors

Attribute Access and Descriptors

A New Approach for Fingerprint Verification Based on Wide Baseline Matching Using Local Interest Points and Descriptors

Local Descriptors

Online Invariance Selection for Local Feature Descriptors

Integration of Basic Descriptors for Image Retrieval