A Multilingual Benchmark for Probing Negation-Awareness with Minimal Pairs

Publikation: Bidrag til bog/antologi/rapport › Konferencebidrag i proceedings › Forskning › fagfællebedømt

Dokumenter

A Multilingual Benchmark for Probing Negation-Awareness
Forlagets udgivne version, 1,07 MB, PDF-dokument

Negation is one of the most fundamental concepts in human cognition and language, and several natural language inference (NLI) probes have been designed to investigate pretrained language models’ ability to detect and reason with negation. However, the existing probing datasets are limited to English only, and do not enable controlled probing of performance in the absence or presence of negation. In response, we present a multilingual (English, Bulgarian, German, French and Chinese) benchmark collection of NLI examples that are grammatical and correctly labeled, as a result of manual inspection and reformulation. We use the benchmark to probe the negation-awareness of multilingual language models and find that models that correctly predict examples with negation cues, often fail to correctly predict their counter-examples without negation cues, even when the cues are irrelevant for semantic inference.

Originalsprog	Engelsk
Titel	Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
Forlag	Association for Computational Linguistics
Publikationsdato	2021
Sider	244–257
DOI	https://doi.org/10.18653/v1/2021.conll-1.19
Status	Udgivet - 2021
Begivenhed	2021 Conference on Empirical Methods in Natural Language Processing - Varighed: 7 nov. 2021 → 11 nov. 2021

Konference

Konference	2021 Conference on Empirical Methods in Natural Language Processing
Periode	07/11/2021 → 11/11/2021

Antal downloads er baseret på statistik fra Google Scholar og www.ku.dk

Ingen data tilgængelig

ID: 299825199

Datalogisk Institut