Do Language Models Know the Way to Rome?

Publikation: Bidrag til bog/antologi/rapport › Konferencebidrag i proceedings › Forskning › fagfællebedømt

Dokumenter

Fulltext
Forlagets udgivne version, 428 KB, application/octet-stream

Bastien Nathan Liétard
Mostafa Abdou
Søgaard, Anders

The global geometry of language models is important for a range of applications, but language model probes tend to evaluate rather local relations, for which ground truths are easily obtained. In this paper we exploit the fact that in geography, ground truths are available beyond local relations. In a series of experiments, we evaluate the extent to which language model representations of city and country names are isomorphic to real-world geography, e.g., if you tell a language model where Paris and Berlin are, does it know the way to Rome? We find that language models generally encode limited geographic information, but with larger models performing the best, suggesting that geographic knowledge can be induced from higher-order co-occurrence statistics.

Originalsprog	Engelsk
Titel	Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP
Forlag	Association for Computational Linguistics
Publikationsdato	2021
Sider	510–517
DOI	https://doi.org/10.18653/v1/2021.blackboxnlp-1.40
Status	Udgivet - 2021
Begivenhed	Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP - Online Varighed: 11 nov. 2021 → 11 nov. 2021

Konference

Konference	Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP
By	Online
Periode	11/11/2021 → 11/11/2021

ID: 300078921

Datalogisk Institut