Moses and the Character-Based Random Babbling Baseline: CoAStaL at AmericasNLP 2021 Shared Task

Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningfagfællebedømt

Dokumenter

  • Fulltext

    Forlagets udgivne version, 193 KB, PDF-dokument

We evaluated a range of neural machine translation techniques developed specifically for low-resource scenarios. Unsuccessfully. In the end, we submitted two runs: (i) a standard phrase-based model, and (ii) a random babbling baseline using character trigrams. We found that it was surprisingly hard to beat (i), in spite of this model being, in theory, a bad fit for polysynthetic languages; and more interestingly, that (ii) was better than several of the submitted systems, highlighting how difficult low-resource machine translation for polysynthetic languages is.

OriginalsprogEngelsk
TitelProceedings of the 1st Workshop on Natural Language Processing for Indigenous Languages of the Americas, AmericasNLP 2021
RedaktørerManuel Mager, Arturo Oncevay, Annette Rios, Ivan Vladimir Meza Ruiz, Alexis Palmer, Graham Neubig, Katharina Kann
ForlagAssociation for Computational Linguistics
Publikationsdato2021
Sider248-254
ISBN (Elektronisk)9781954085442
DOI
StatusUdgivet - 2021
Begivenhed1st Workshop on Natural Language Processing for Indigenous Languages of the Americas, AmericasNLP 2021 - Virtual, Online
Varighed: 11 jun. 2021 → …

Konference

Konference1st Workshop on Natural Language Processing for Indigenous Languages of the Americas, AmericasNLP 2021
ByVirtual, Online
Periode11/06/2021 → …

ID: 291814762