On Label Granularity and Object Localization

Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review

Elijah Cole
Kimberly Wilber
Grant Van Horn
Xuan Yang
Marco Fornoni
Pietro Perona
Belongie, Serge
Andrew Howard
Oisin Mac Aodha

Weakly supervised object localization (WSOL) aims to learn representations that encode object location using only image-level category labels. However, many objects can be labeled at different levels of granularity. Is it an animal, a bird, or a great horned owl? Which image-level labels should we use? In this paper we study the role of label granularity in WSOL. To facilitate this investigation we introduce iNatLoc500, a new large-scale fine-grained benchmark dataset for WSOL. Surprisingly, we find that choosing the right training label granularity provides a much larger performance boost than choosing the best WSOL algorithm. We also show that changing the label granularity can significantly improve data efficiency.

Original language	English
Title of host publication	Computer Vision – ECCV 2022 : 17th European Conference, Proceedings
Editors	Shai Avidan, Gabriel Brostow, Moustapha Cissé, Giovanni Maria Farinella, Tal Hassner
Number of pages	17
Publisher	Springer
Publication date	2022
Pages	604-620
ISBN (Print)	9783031200793
DOIs	https://doi.org/10.1007/978-3-031-20080-9_35
Publication status	Published - 2022
Event	17th European Conference on Computer Vision, ECCV 2022 - Tel Aviv, Israel Duration: 23 Oct 2022 → 27 Oct 2022

Conference

Conference	17th European Conference on Computer Vision, ECCV 2022
Land	Israel
By	Tel Aviv
Periode	23/10/2022 → 27/10/2022

Series	Lecture Notes in Computer Science
Volume	13670 LNCS
ISSN	0302-9743

Department of Computer Science

On Label Granularity and Object Localization

Conference

Bibliographical note

Links