On Label Granularity and Object Localization

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • Elijah Cole
  • Kimberly Wilber
  • Grant Van Horn
  • Xuan Yang
  • Marco Fornoni
  • Pietro Perona
  • Belongie, Serge
  • Andrew Howard
  • Oisin Mac Aodha

Weakly supervised object localization (WSOL) aims to learn representations that encode object location using only image-level category labels. However, many objects can be labeled at different levels of granularity. Is it an animal, a bird, or a great horned owl? Which image-level labels should we use? In this paper we study the role of label granularity in WSOL. To facilitate this investigation we introduce iNatLoc500, a new large-scale fine-grained benchmark dataset for WSOL. Surprisingly, we find that choosing the right training label granularity provides a much larger performance boost than choosing the best WSOL algorithm. We also show that changing the label granularity can significantly improve data efficiency.

Original languageEnglish
Title of host publicationComputer Vision – ECCV 2022 : 17th European Conference, Proceedings
EditorsShai Avidan, Gabriel Brostow, Moustapha Cissé, Giovanni Maria Farinella, Tal Hassner
Number of pages17
PublisherSpringer
Publication date2022
Pages604-620
ISBN (Print)9783031200793
DOIs
Publication statusPublished - 2022
Event17th European Conference on Computer Vision, ECCV 2022 - Tel Aviv, Israel
Duration: 23 Oct 202227 Oct 2022

Conference

Conference17th European Conference on Computer Vision, ECCV 2022
LandIsrael
ByTel Aviv
Periode23/10/202227/10/2022
SeriesLecture Notes in Computer Science
Volume13670 LNCS
ISSN0302-9743

Bibliographical note

Publisher Copyright:
© 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.

Links

ID: 342672283