Zero-Shot Cross-Lingual Transfer with Meta Learning

Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningfagfællebedømt

Dokumenter

Learning what to share between tasks has become a topic of great importance, as strategic sharing of knowledge has been shown to improve downstream task performance. This is particularly important for multilingual applications, as most languages in the world are under-resourced. Here, we consider the setting of training models on multiple different languages at the same time, when little or no data is available for languages other than English. We show that this challenging setup can be approached using meta-learning: in addition to training a source language model, another model learns to select which training instances are the most beneficial to the first. We experiment using standard supervised, zero-shot cross-lingual, as well as few-shot cross-lingual settings for different natural language understanding tasks (natural language inference, question answering). Our extensive experimental setup demonstrates the consistent effectiveness of meta-learning for a total of 15 languages. We improve upon the state-of-the-art for zero-shot and few-shot NLI (on MultiNLI and XNLI) and QA (on the MLQA dataset). A comprehensive error analysis indicates that the correlation of typological features between languages can partly explain when parameter sharing learned via meta-learning is beneficial.
OriginalsprogEngelsk
TitelProceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
ForlagAssociation for Computational Linguistics
Publikationsdato2020
Sider4547-4562
DOI
StatusUdgivet - 2020
BegivenhedThe 2020 Conference on Empirical Methods in Natural Language Processing - online
Varighed: 16 nov. 202020 nov. 2020
http://2020.emnlp.org

Konference

KonferenceThe 2020 Conference on Empirical Methods in Natural Language Processing
Lokationonline
Periode16/11/202020/11/2020
Internetadresse

Antal downloads er baseret på statistik fra Google Scholar og www.ku.dk


Ingen data tilgængelig

ID: 254992325