Multi-dueling bandits and their application to online ranker evaluation

Publikation: Bidrag til bog/antologi/rapport › Konferencebidrag i proceedings › Forskning › fagfællebedømt

New ranking algorithms are continually being developed and refined, necessitating the development of efficient methods for evaluating these rankers. Online ranker evaluation focuses on the challenge of efficiently determining, from implicit user feedback, which ranker out of a finite set of rankers is the best. Online ranker evaluation can be modeled by dueling ban- dits, a mathematical model for online learning under limited feedback from pairwise comparisons. Comparisons of pairs of rankers is performed by interleaving their result sets and examining which documents users click on. The dueling bandits model addresses the key issue of which pair of rankers to compare at each iteration, thereby providing a solution to the exploration-exploitation trade-off. Recently, methods for simultaneously comparing more than two rankers have been developed. However, the question of which rankers to compare at each iteration was left open. We address this question by proposing a generalization of the dueling bandits model that uses simultaneous comparisons of an unrestricted number of rankers. We evaluate our algorithm on synthetic data and several standard large-scale online ranker evaluation datasets. Our experimental results show that the algorithm yields orders of magnitude improvement in performance compared to stateof- the-art dueling bandit algorithms.

Originalsprog	Engelsk
Titel	Proceedings of the 25th ACM International Conference on Information and Knowledge Management
Antal sider	6
Forlag	Association for Computing Machinery
Publikationsdato	2016
Sider	2161-2166
ISBN (Elektronisk)	978-1-4503-4073-1
DOI	https://doi.org/10.1145/2983323.2983659
Status	Udgivet - 2016
Begivenhed	25th ACM International Conference on Information and Knowledge Management - Indianapolis, USA Varighed: 24 okt. 2016 → 28 okt. 2016 Konferencens nummer: 25

Konference

Konference	25th ACM International Conference on Information and Knowledge Management
Nummer	25
Land	USA
By	Indianapolis
Periode	24/10/2016 → 28/10/2016

Navn	ACM International Conference on Information and Knowledge Management

Forskningsområder

cs.IR, cs.LG, stat.ML

Datalogisk Institut