Learning attention for historical text normalization by learning to pronounce

Publikation: Bidrag til bog/antologi/rapport › Konferencebidrag i proceedings › Forskning › fagfællebedømt

Marcel Bollmann
Joachim Bingel
Søgaard, Anders

Automated processing of historical texts often relies on pre-normalization to modern word forms. Training encoder-decoder architectures to solve such problems typically requires a lot of training data, which is not available for the named task. We address this problem by using several novel encoder-decoder architectures, including a multi-task learning (MTL) architecture using a grapheme-to-phoneme dictionary as auxiliary data, pushing the state-of-the-art by an absolute 2% increase in performance. We analyze the induced models across 44 different texts from Early New High German. Interestingly, we observe that, as previously conjectured, multi-task learning can learn to focus attention during decoding, in ways remarkably similar to recently proposed attention mechanisms. This, we believe, is an important step toward understanding how MTL works.

Originalsprog	Engelsk
Titel	ACL 2017 - 55th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers)
Antal sider	13
Forlag	Association for Computational Linguistics
Publikationsdato	1 jan. 2017
Sider	332-344
ISBN (Elektronisk)	9781945626753
DOI	https://doi.org/10.18653/v1/P17-1031
Status	Udgivet - 1 jan. 2017
Begivenhed	55th Annual Meeting of the Association for Computational Linguistics, ACL 2017 - Vancouver, Canada Varighed: 30 jul. 2017 → 4 aug. 2017

Konference

Konference	55th Annual Meeting of the Association for Computational Linguistics, ACL 2017
Land	Canada
By	Vancouver
Periode	30/07/2017 → 04/08/2017
Sponsor	Amazon, Apple, Baidu, et al, Google, Tencent

ID: 194945376

Datalogisk Institut

Learning attention for historical text normalization by learning to pronounce

Konference