Optimizing ranking for response prediction via triplet-wise learning from historical feedback

PDF / 1,922,524 Bytes
17 Pages / 595.276 x 790.866 pts Page_size
86 Downloads / 265 Views

ORIGINAL ARTICLE

Optimizing ranking for response prediction via triplet-wise learning from historical feedback Lili Shan1 • Lei Lin1 • Chengjie Sun1 • Xiaolong Wang1 • Bingquan Liu1

Received: 3 September 2015 / Accepted: 14 June 2016 Springer-Verlag Berlin Heidelberg 2016

Abstract In the real-time bidding (RTB) display advertising ecosystem, when receiving a bid request, Demand-side platform (DSP) needs to predict user response on each ad impression and determines whether to bid and calculates the bid price according to its prediction. When given a fixed advertising budget, in order to maximize the return on investment (ROI), DSP aims to buy in more conversions and then more clicks than non-clicks. In this paper, we consider response prediction problem as a ranking problem for impression chances and propose a triplet-wise comparison based learning optimization which derived from Bayesian personalized ranking (BPR) based on pairwise learning to learn model parameters. Pairwise learning can only employ one type of historical click and conversion information through optimizing the correct order of random pair of a positive and a negative example for binary classification. While triplet-wise learning combines these two kinds of historical response information into the same model through taking into consideration the correct order of the pair of conversion and click-only as well as the pair of click-only and non-click. Since our method accomplishes the click and & Lili Shan [email protected] Lei Lin [email protected] Chengjie Sun [email protected] Xiaolong Wang [email protected] Bingquan Liu [email protected] 1

School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China

conversion prediction tasks in the same predicting procedure, our algorithm is good at ranking click impressions higher than non-click ones and conversion impressions higher than click-only ones. In this way, under a fixed budget, biding algorithm would preferentially buy in more conversions than others and then more clicks than non-clicks. Our experiments demonstrate that the improved method not only outperforms both pairwise and MSE schemes on three classes ranking in terms of multi-AUC, NDCG etc., but also, outperforms others on binary classification for click and non-click on the targeted real-world bidding log data owing to the introduction of historical conversion information. Keywords Response prediction Triplet-wise learning Rank optimization Real time bidding Demand-side platform

1 Introduction With the emergence and development of spot markets, real time bidding (RTB) advertising has become an increasingly important way for the publisher to sell its ad inventory. In the RTB-enabled display advertising ecosystem, there are three major players, publishers, ad exchanges and demand-side platforms (DSPs). The publisher supplies ad impression chances in which the ad will be exposed. The ad exchange aggregates these ad impressions from multiple publishers and sells them among sever

Data Loading...

Optimizing ranking for response prediction via triplet-wise learning from historical feedback

Recommend Documents

Iterative sure independent ranking and screening for drug response prediction

Robot Learning from Feedback

Interactive task learning via embodied corrective feedback

Decoy selection for protein structure prediction via extreme gradient boosting and ranking

A novel approach for personalized response model: deep learning with individual dropout feature ranking

Learning from Heterogeneous Student Behaviors for Multiple Prediction Tasks

SimAug: Learning Robust Representations from Simulation for Trajectory Prediction

Challenge-Enabled Machine Learning to Drug-Response Prediction

Short-Term Traffic Speed Prediction via Machine Learning

Dynamical User Intention Prediction via Multi-modal Learning

Analytical Model of Early HARQ Feedback Prediction

aDFR: An Attention-Based Deep Learning Model for Flight Ranking