Microsoft COCO: Common objects in context

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • Tsung Yi Lin
  • Michael Maire
  • Belongie, Serge
  • James Hays
  • Pietro Perona
  • Deva Ramanan
  • Piotr Dollár
  • C. Lawrence Zitnick

We present a new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding. This is achieved by gathering images of complex everyday scenes containing common objects in their natural context. Objects are labeled using per-instance segmentations to aid in precise object localization. Our dataset contains photos of 91 objects types that would be easily recognizable by a 4 year old. With a total of 2.5 million labeled instances in 328k images, the creation of our dataset drew upon extensive crowd worker involvement via novel user interfaces for category detection, instance spotting and instance segmentation. We present a detailed statistical analysis of the dataset in comparison to PASCAL, ImageNet, and SUN. Finally, we provide baseline performance analysis for bounding box and segmentation detection results using a Deformable Parts Model.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science, Springer
Number of pages16
Volume8693 LNCS
Publication date2014
EditionPART 5
Pages740-755
DOIs
Publication statusPublished - 2014
Externally publishedYes
Event13th European Conference on Computer Vision, ECCV 2014 - Zurich, Switzerland
Duration: 6 Sep 201412 Sep 2014

Conference

Conference13th European Conference on Computer Vision, ECCV 2014
LandSwitzerland
ByZurich
Periode06/09/201412/09/2014
SeriesLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
ISSN0302-9743

ID: 302817706