Brain-Supervised Image Editing

Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningfagfællebedømt

Dokumenter

  • Fulltext

    Accepteret manuskript, 7,74 MB, PDF-dokument

Despite recent advances in deep neural models for semantic image editing, present approaches are dependent on explicit human input. Previous work assumes the availability of manually curated datasets for supervised learning, while for unsupervised approaches the human inspection of discovered components is required to identify those which modify worthwhile semantic features. Here, we present a novel alternative: the utilization of brain responses as a supervision signal for learning semantic feature representations. Participants $(N=30)$ in a neurophysiological experiment were shown artificially generated faces and instructed to look for a particular semantic feature, such as 'old' or 'smiling', while their brain responses were recorded via electroencephalography (EEG). Using supervision signals inferred from these responses, semantic features within the latent space of a generative adversarial network (GAN) were learned and then used to edit semantic features of new images. We show that implicit brain supervision achieves comparable semantic image editing performance to explicit manual labeling. This work demonstrates the feasibility of utilizing implicit human reactions recorded via brain-computer interfaces for semantic image editing and interpretation.

OriginalsprogEngelsk
TitelProceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022
ForlagIEEE Computer Society Press
Publikationsdato2022
Sider18459-18468
ISBN (Elektronisk)9781665469463
DOI
StatusUdgivet - 2022
Begivenhed2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022 - New Orleans, USA
Varighed: 19 jun. 202224 jun. 2022

Konference

Konference2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022
LandUSA
ByNew Orleans
Periode19/06/202224/06/2022
NavnProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Vol/bind2022-June
ISSN1063-6919

Bibliografisk note

Funding Information:
This research was partially funded by the Academy of Finland. Computing resources were provided by the Finnish Grid and Cloud Infrastructure (urn:nbn:fi:researchinfras- 2016072533)

Funding Information:
This research was partially funded by the Academy of Finland. Computing resources were provided by the Finnish Grid and Cloud Infrastructure (urn:nbn:fi:research-infras-2016072533). We thank Michiel Spapé for his contributions to the neurophysiological experimentation and advice.

Publisher Copyright:
© 2022 IEEE.

ID: 339144873