Optimizing ranking for response prediction via triplet-wise learning from historical feedback

  • PDF / 1,922,524 Bytes
  • 17 Pages / 595.276 x 790.866 pts Page_size
  • 86 Downloads / 219 Views

DOWNLOAD

REPORT


ORIGINAL ARTICLE

Optimizing ranking for response prediction via triplet-wise learning from historical feedback Lili Shan1 • Lei Lin1 • Chengjie Sun1 • Xiaolong Wang1 • Bingquan Liu1

Received: 3 September 2015 / Accepted: 14 June 2016  Springer-Verlag Berlin Heidelberg 2016

Abstract In the real-time bidding (RTB) display advertising ecosystem, when receiving a bid request, Demand-side platform (DSP) needs to predict user response on each ad impression and determines whether to bid and calculates the bid price according to its prediction. When given a fixed advertising budget, in order to maximize the return on investment (ROI), DSP aims to buy in more conversions and then more clicks than non-clicks. In this paper, we consider response prediction problem as a ranking problem for impression chances and propose a triplet-wise comparison based learning optimization which derived from Bayesian personalized ranking (BPR) based on pairwise learning to learn model parameters. Pairwise learning can only employ one type of historical click and conversion information through optimizing the correct order of random pair of a positive and a negative example for binary classification. While triplet-wise learning combines these two kinds of historical response information into the same model through taking into consideration the correct order of the pair of conversion and click-only as well as the pair of click-only and non-click. Since our method accomplishes the click and & Lili Shan [email protected] Lei Lin [email protected] Chengjie Sun [email protected] Xiaolong Wang [email protected] Bingquan Liu [email protected] 1

School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China

conversion prediction tasks in the same predicting procedure, our algorithm is good at ranking click impressions higher than non-click ones and conversion impressions higher than click-only ones. In this way, under a fixed budget, biding algorithm would preferentially buy in more conversions than others and then more clicks than non-clicks. Our experiments demonstrate that the improved method not only outperforms both pairwise and MSE schemes on three classes ranking in terms of multi-AUC, NDCG etc., but also, outperforms others on binary classification for click and non-click on the targeted real-world bidding log data owing to the introduction of historical conversion information. Keywords Response prediction  Triplet-wise learning  Rank optimization  Real time bidding  Demand-side platform

1 Introduction With the emergence and development of spot markets, real time bidding (RTB) advertising has become an increasingly important way for the publisher to sell its ad inventory. In the RTB-enabled display advertising ecosystem, there are three major players, publishers, ad exchanges and demand-side platforms (DSPs). The publisher supplies ad impression chances in which the ad will be exposed. The ad exchange aggregates these ad impressions from multiple publishers and sells them among sever

Data Loading...