Square One Bias in NLP: Towards a Multi-Dimensional Exploration of the Research Manifold

Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningfagfællebedømt

Dokumenter

  • Fulltext

    Forlagets udgivne version, 281 KB, PDF-dokument

The prototypical NLP experiment trains a standard architecture on labeled English data and optimizes for accuracy, without accounting for other dimensions such as fairness, interpretability, or computational efficiency. We show through a manual classification of recent NLP research papers that this is indeed the case and refer to it as the square one experimental setup. We observe that NLP research often goes beyond the square one setup, e.g, focusing not only on accuracy, but also on fairness or interpretability, but typically only along a single dimension. Most work targeting multilinguality, for example, considers only accuracy; most work on fairness or interpretability considers only English; and so on. Such one-dimensionality of most research means we are only exploring a fraction of the NLP research search space. We provide historical and recent examples of how the square one bias has led researchers to draw false conclusions or make unwise choices, point to promising yet unexplored directions on the research manifold, and make practical recommendations to enable more multi-dimensional research. We open-source the results of our annotations to enable further analysis.

OriginalsprogEngelsk
TitelACL 2022 - 60th Annual Meeting of the Association for Computational Linguistics, Findings of ACL 2022
RedaktørerSmaranda Muresan, Preslav Nakov, Aline Villavicencio
ForlagAssociation for Computational Linguistics (ACL)
Publikationsdato2022
Sider2340-2354
ISBN (Elektronisk)9781955917254
DOI
StatusUdgivet - 2022
Begivenhed60th Annual Meeting of the Association for Computational Linguistics, ACL 2022 - Dublin, Irland
Varighed: 22 maj 202227 maj 2022

Konference

Konference60th Annual Meeting of the Association for Computational Linguistics, ACL 2022
LandIrland
ByDublin
Periode22/05/202227/05/2022
SponsorAmazon Science, Bloomberg Engineering, et al., Google Research, Liveperson, Meta

Bibliografisk note

Publisher Copyright:
© 2022 Association for Computational Linguistics.

ID: 341486380