How to Measure the Reproducibility of System-oriented IR Experiments

Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningfagfællebedømt

Dokumenter

  • Fulltext

    Accepteret manuskript, 592 KB, PDF-dokument

  • Timo Breuer
  • Nicola Ferro
  • Norbert Fuhr
  • Maistro, Maria
  • Tetsuya Sakai
  • Philipp Schaer
  • Ian Soboroff

Replicability and reproducibility of experimental results are primary concerns in all the areas of science and IR is not an exception. Besides the problem of moving the field towards more reproducible experimental practices and protocols, we also face a severe methodological issue: we do not have any means to assess when reproduced is reproduced. Moreover, we lack any reproducibility-oriented dataset, which would allow us to develop such methods. To address these issues, we compare several measures to objectively quantify to what extent we have replicated or reproduced a system-oriented IR experiment. These measures operate at different levels of granularity, from the fine-grained comparison of ranked lists, to the more general comparison of the obtained effects and significant differences. Moreover, we also develop a reproducibility-oriented dataset, which allows us to validate our measures and which can also be used to develop future measures.

OriginalsprogEngelsk
TitelSIGIR '20 : Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval
Antal sider10
ForlagAssociation for Computing Machinery
Publikationsdato2020
Sider349-358
ISBN (Elektronisk)978-1-4503-8016-4
DOI
StatusUdgivet - 2020
Begivenhed43rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2020 - Virtual, Online, Kina
Varighed: 25 jul. 202030 jul. 2020

Konference

Konference43rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2020
LandKina
ByVirtual, Online
Periode25/07/202030/07/2020
SponsorACM Special Interest Group on Information Retrieval (SIGIR)

Bibliografisk note

Funding Information:
Acknowledgments. This paper is partially supported by AMAOS (Advanced Machine Learning for Automatic Omni-Channel Support), funded by Innovationsfonden, Denmark, and by DFG (German Research Foundation, project no. 407518790).

Publisher Copyright:
© 2020 ACM.

ID: 269912561