Learning single-view 3D reconstruction with limited pose supervision

Publikation: Bidrag til tidsskrift › Konferenceartikel › Forskning › fagfællebedømt

Guandao Yang
Yin Cui
Belongie, Serge
Bharath Hariharan

It is expensive to label images with 3D structure or precise camera pose. Yet, this is precisely the kind of annotation required to train single-view 3D reconstruction models. In contrast, unlabeled images or images with just category labels are easy to acquire, but few current models can use this weak supervision. We present a unified framework that can combine both types of supervision: a small amount of camera pose annotations are used to enforce pose-invariance and view-point consistency, and unlabeled images combined with an adversarial loss are used to enforce the realism of rendered, generated models. We use this unified framework to measure the impact of each form of supervision in three paradigms: semi-supervised, multi-task, and transfer learning. We show that with a combination of these ideas, we can train single-view reconstruction models that improve up to 7 points in performance (AP) when using only 1% pose annotated training data.

Originalsprog	Engelsk
Tidsskrift	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Sider (fra-til)	90-105
Antal sider	16
ISSN	0302-9743
DOI	https://doi.org/10.1007/978-3-030-01267-0_6
Status	Udgivet - 2018
Eksternt udgivet	Ja
Begivenhed	15th European Conference on Computer Vision, ECCV 2018 - Munich, Tyskland Varighed: 8 sep. 2018 → 14 sep. 2018

Konference

Konference	15th European Conference on Computer Vision, ECCV 2018
Land	Tyskland
By	Munich
Periode	08/09/2018 → 14/09/2018

Bibliografisk note

ID: 301825834

Datalogisk Institut