Multiclass recognition and part localization with humans in the loop

Publikation: Bidrag til tidsskriftKonferenceartikelForskningfagfællebedømt

We propose a visual recognition system that is designed for fine-grained visual categorization. The system is composed of a machine and a human user. The user, who is unable to carry out the recognition task by himself, is interactively asked to provide two heterogeneous forms of information: clicking on object parts and answering binary questions. The machine intelligently selects the most informative question to pose to the user in order to identify the object's class as quickly as possible. By leveraging computer vision and analyzing the user responses, the overall amount of human effort required, measured in seconds, is minimized. We demonstrate promising results on a challenging dataset of uncropped images, achieving a significant average reduction in human effort over previous methods.

OriginalsprogEngelsk
TidsskriftProceedings of the IEEE International Conference on Computer Vision
Sider (fra-til)2524-2531
Antal sider8
DOI
StatusUdgivet - 2011
Eksternt udgivetJa
Begivenhed2011 IEEE International Conference on Computer Vision, ICCV 2011 - Barcelona, Spanien
Varighed: 6 nov. 201113 nov. 2011

Konference

Konference2011 IEEE International Conference on Computer Vision, ICCV 2011
LandSpanien
ByBarcelona
Periode06/11/201113/11/2011
SponsorTOYOTA, Google, Microsoft Research, Siemens, technicolor

ID: 301830771