AWARE - Staff

AWARE: Exploiting evaluation measures to combine multiple assessors

Research output: Contribution to journal › Journal article › Research › peer-review

Marco Ferrante
Nicola Ferro
Maistro, Maria

We propose the Assessor-drivenWeighted Averages for Retrieval Evaluation (AWARE) probabilistic framework, a novel methodology for dealing with multiple crowd assessors that may be contradictory and/or noisy. By modeling relevance judgements and crowd assessors as sources of uncertainty, AWARE takes the expectation of a generic performance measure, like Average Precision, composed with these random variables. In this way, it approaches the problem of aggregating different crowd assessors from a new perspective, that is, directly combining the performance measures computed on the ground truth generated by the crowd assessors instead of adopting some classification technique to merge the labels produced by them. We propose several unsupervised estimators that instantiate the AWARE framework and we compare them with state-of-theart approaches, that is,Majoriity Vote and Expectation Maximization, on TREC collections. We found that AWARE approaches improve in terms of their capability of correctly ranking systems and predicting their actual performance scores.

Original language	English
Article number	20
Journal	ACM Transactions on Information Systems
Volume	36
Issue number	2
ISSN	1046-8188
DOIs	https://doi.org/10.1145/3110217
Publication status	Published - 1 Aug 2017
Externally published	Yes

Research areas

AWARE, Crowdsourcing, Performance measure, Unsupervised estimators, Weighted average

ID: 216517365

Department of Computer Science

AWARE: Exploiting evaluation measures to combine multiple assessors

Research areas