The Role of Syntactic Planning in Compositional Image Captioning

Publikation: Bidrag til bog/antologi/rapport › Konferencebidrag i proceedings › Forskning › fagfællebedømt

Dokumenter

The Role of Syntactic Planning in Compositional Image Captioning
Forlagets udgivne version, 1,2 MB, PDF-dokument

Image captioning has focused on generalizing to images drawn from the same distribution as the training set, and not to the more challenging problem of generalizing to different distributions of images. Recently, Nikolaus et al. (2019) introduced a dataset to assess compositional generalization in image captioning, where models are evaluated on their ability to describe images with unseen adjective–noun and noun–verb compositions. In this work, we investigate different methods to improve compositional generalization by planning the syntactic structure of a caption. Our experiments show that jointly modeling tokens and syntactic tags enhances generalization in both RNN- and Transformer-based models, while also improving performance on standard metrics.

Originalsprog	Engelsk
Titel	Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume
Udgivelsessted	Online
Forlag	Association for Computational Linguistics
Publikationsdato	apr. 2021
Sider	593–607
DOI	https://doi.org/10.18653/v1/2021.eacl-main.48
Status	Udgivet - apr. 2021
Begivenhed	The 16th Conference of the European Chapter of the Association for Computational Linguistics: EACL 2021 - Varighed: 21 apr. 2021 → 23 apr. 2021 Konferencens nummer: 16 https://2021.eacl.org/

Konference

Konference	The 16th Conference of the European Chapter of the Association for Computational Linguistics
Nummer	16
Periode	21/04/2021 → 23/04/2021
Internetadresse	https://2021.eacl.org/

ID: 275339891

Datalogisk Institut