Grammatical Error Correction through Round-Trip Machine Translation

Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningfagfællebedømt

Dokumenter

  • Fulltext

    Forlagets udgivne version, 183 KB, PDF-dokument

Machine translation (MT) operates on the premise of an interlingua which abstracts away from surface form while preserving meaning. A decade ago the idea of using round-trip MT to guide grammatical error correction was proposed as a way to abstract away from potential errors in surface forms (Madnani et al., 2012). At the time, it did not pan out due to the low quality of MT systems of the day. Today much stronger MT systems are available so we re-evaluate this idea across five languages and models of various sizes. We find that for extra large models input augmentation through round-trip MT has little to no effect. For more ‘workable’ model sizes, however, it yields consistent improvements, sometimes bringing the performance of a base or large model up to that of a large or xl model, respectively. The round-trip translation comes at a computational cost though, so one would have to determine whether to opt for a larger model or for input augmentation on a case-by-case basis.
OriginalsprogEngelsk
TitelFindings of the Association for Computational Linguistics: EACL 2023
ForlagAssociation for Computational Linguistics (ACL)
Publikationsdato2023
Sider2208-2215
ISBN (Elektronisk)978-1-959429-47-0
DOI
StatusUdgivet - 2023
Begivenhed17th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2023 - Findings of EACL 2023 - Dubrovnik, Kroatien
Varighed: 2 maj 20236 maj 2023

Konference

Konference17th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2023 - Findings of EACL 2023
LandKroatien
ByDubrovnik
Periode02/05/202306/05/2023
SponsorAdobe, Babelscape, Bloomberg Engineering, Duolingo, Liveperson

ID: 381561609