QLEVR - Ansatte

QLEVR: A Diagnostic Dataset for Quantificational Language and Elementary Visual Reasoning

Publikation: Bidrag til bog/antologi/rapport › Konferencebidrag i proceedings › Forskning › fagfællebedømt

Dokumenter

QLEVR
Forlagets udgivne version, 16,4 MB, PDF-dokument

Zechen Li
Søgaard, Anders

Synthetic datasets have successfully been used to probe visual question-answering datasets for their reasoning abilities. CLEVR (Johnson et al., 2017), for example, tests a range of visual reasoning abilities. The questions in CLEVR focus on comparisons of shapes, colors, and sizes, numerical reasoning, and existence claims. This paper introduces a minimally biased, diagnostic visual questionanswering dataset, QLEVR, that goes beyond existential and numerical quantification and focus on more complex quantifiers and their combinations, e.g., asking whether there are more than two red balls that are smaller than at least three blue balls in an image. We describe how the dataset was created and present a first evaluation of state-of-the-art visual question-answering models, showing that QLEVR presents a formidable challenge to our current models. Code and Dataset are available at https://github.com/ zechenli03/QLEVR.

Originalsprog	Engelsk
Titel	Findings of the Association for Computational Linguistics : NAACL 2022 - Findings
Forlag	Association for Computational Linguistics (ACL)
Publikationsdato	2022
Sider	980-996
ISBN (Elektronisk)	9781955917766
DOI	https://doi.org/10.18653/v1/2022.findings-naacl.73
Status	Udgivet - 2022
Begivenhed	2022 Findings of the Association for Computational Linguistics: NAACL 2022 - Seattle, USA Varighed: 10 jul. 2022 → 15 jul. 2022

Konference

Konference	2022 Findings of the Association for Computational Linguistics: NAACL 2022
Land	USA
By	Seattle
Periode	10/07/2022 → 15/07/2022

Bibliografisk note

ID: 341493689

Datalogisk Institut

QLEVR: A Diagnostic Dataset for Quantificational Language and Elementary Visual Reasoning

Dokumenter

Konference

Bibliografisk note