Finding pictures of objects in large collections of images

Research output: Contribution to journal › Conference article › Research › peer-review

Standard

Finding pictures of objects in large collections of images. / Forsyth, David A.; Malik, Jitendra; Fleck, Margaret M.; Greenspan, Hayit; Leung, Thomas; Belongie, Serge; Carson, Chad; Bregler, Chris.

In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 1996, p. 335-360.

Research output: Contribution to journal › Conference article › Research › peer-review

Harvard

Forsyth, DA, Malik, J, Fleck, MM, Greenspan, H, Leung, T, Belongie, S, Carson, C & Bregler, C 1996, 'Finding pictures of objects in large collections of images', Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), pp. 335-360. https://doi.org/10.1007/3-540-61750-7_36

APA

Forsyth, D. A., Malik, J., Fleck, M. M., Greenspan, H., Leung, T., Belongie, S., Carson, C., & Bregler, C. (1996). Finding pictures of objects in large collections of images. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 335-360. https://doi.org/10.1007/3-540-61750-7_36

Vancouver

Forsyth DA, Malik J, Fleck MM, Greenspan H, Leung T, Belongie S et al. Finding pictures of objects in large collections of images. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1996;335-360. https://doi.org/10.1007/3-540-61750-7_36

Author

Forsyth, David A. ; Malik, Jitendra ; Fleck, Margaret M. ; Greenspan, Hayit ; Leung, Thomas ; Belongie, Serge ; Carson, Chad ; Bregler, Chris. / Finding pictures of objects in large collections of images. In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 1996 ; pp. 335-360.

Bibtex

@inproceedings{1a289ac66f0e45dc81f3bc97e5045465,

title = "Finding pictures of objects in large collections of images",

abstract = "Retrieving images from very large collections, using image content as a key, is becoming an important problem. Users prefer to ask for pictures using notions of content that are strongly oriented to the presence of abstractly defined objects. Computer programs that implement these queries automatically are desirable, but are hard to build because conventional object recognition techniques from computer vision cannot recognize very general objects in very general contexts. This paper describes our approach to object recognition, which is structured around a sequence of increasingly specialized grouping activities that assemble coherent regions of image that can be shown to satisfy increasingly stringent constraints. The constraints that are satisfied provide a form of object classification in quite general contexts. This view of recognition is distinguished by: far richer involvement of early visual primitives, including color and texture; hierarchical grouping and learning strategies in the classification process; the ability to deal with rather general objects in uncontrolled configurations and contexts. We illustrate these properties with four case-studies: one demonstrating the use of color and texture descriptors; one showing how trees can be described by fusing texture and geometric properties; one learning scenery concepts using grouped features; and one showing how this view of recognition yields a program that can tell, quite accurately, whether a picture contains naked people or not.",

author = "Forsyth, {David A.} and Jitendra Malik and Fleck, {Margaret M.} and Hayit Greenspan and Thomas Leung and Serge Belongie and Chad Carson and Chris Bregler",

note = "Funding Information: We would like to thank R. Blasi and K. Murphy who collaborated with S. Belongie in the work on learning decision trees for visual concept classification. We thank Joe Mundy for suggesting that the response of a grouper may indicate the presence of an object. Aspects of this research were supported by the National Science Foundation under grants IRI-9209728, IRI-9420716, IRI-9501493, an NSF Young Investigator award, an NSF Digital Library award IRI-9411334, an instrumentation award CDA-9121985, and by a Berkeley Fellowship. Publisher Copyright: {\textcopyright} 1996, Springer Verlag. All rights reserved.; International Workshop on Object Representation in Computer Vision II, ECCV 1996 ; Conference date: 13-04-1996 Through 14-04-1996",

year = "1996",

doi = "10.1007/3-540-61750-7_36",

language = "English",

pages = "335--360",

journal = "Lecture Notes in Computer Science",

issn = "0302-9743",

publisher = "Springer Verlag",

}

RIS

TY - GEN

T1 - Finding pictures of objects in large collections of images

AU - Forsyth, David A.

AU - Malik, Jitendra

AU - Fleck, Margaret M.

AU - Greenspan, Hayit

AU - Leung, Thomas

AU - Belongie, Serge

AU - Carson, Chad

AU - Bregler, Chris

N1 - Funding Information: We would like to thank R. Blasi and K. Murphy who collaborated with S. Belongie in the work on learning decision trees for visual concept classification. We thank Joe Mundy for suggesting that the response of a grouper may indicate the presence of an object. Aspects of this research were supported by the National Science Foundation under grants IRI-9209728, IRI-9420716, IRI-9501493, an NSF Young Investigator award, an NSF Digital Library award IRI-9411334, an instrumentation award CDA-9121985, and by a Berkeley Fellowship. Publisher Copyright: © 1996, Springer Verlag. All rights reserved.

PY - 1996

Y1 - 1996

N2 - Retrieving images from very large collections, using image content as a key, is becoming an important problem. Users prefer to ask for pictures using notions of content that are strongly oriented to the presence of abstractly defined objects. Computer programs that implement these queries automatically are desirable, but are hard to build because conventional object recognition techniques from computer vision cannot recognize very general objects in very general contexts. This paper describes our approach to object recognition, which is structured around a sequence of increasingly specialized grouping activities that assemble coherent regions of image that can be shown to satisfy increasingly stringent constraints. The constraints that are satisfied provide a form of object classification in quite general contexts. This view of recognition is distinguished by: far richer involvement of early visual primitives, including color and texture; hierarchical grouping and learning strategies in the classification process; the ability to deal with rather general objects in uncontrolled configurations and contexts. We illustrate these properties with four case-studies: one demonstrating the use of color and texture descriptors; one showing how trees can be described by fusing texture and geometric properties; one learning scenery concepts using grouped features; and one showing how this view of recognition yields a program that can tell, quite accurately, whether a picture contains naked people or not.

AB - Retrieving images from very large collections, using image content as a key, is becoming an important problem. Users prefer to ask for pictures using notions of content that are strongly oriented to the presence of abstractly defined objects. Computer programs that implement these queries automatically are desirable, but are hard to build because conventional object recognition techniques from computer vision cannot recognize very general objects in very general contexts. This paper describes our approach to object recognition, which is structured around a sequence of increasingly specialized grouping activities that assemble coherent regions of image that can be shown to satisfy increasingly stringent constraints. The constraints that are satisfied provide a form of object classification in quite general contexts. This view of recognition is distinguished by: far richer involvement of early visual primitives, including color and texture; hierarchical grouping and learning strategies in the classification process; the ability to deal with rather general objects in uncontrolled configurations and contexts. We illustrate these properties with four case-studies: one demonstrating the use of color and texture descriptors; one showing how trees can be described by fusing texture and geometric properties; one learning scenery concepts using grouped features; and one showing how this view of recognition yields a program that can tell, quite accurately, whether a picture contains naked people or not.

UR - http://www.scopus.com/inward/record.url?scp=84979074103&partnerID=8YFLogxK

U2 - 10.1007/3-540-61750-7_36

DO - 10.1007/3-540-61750-7_36

M3 - Conference article

AN - SCOPUS:84979074103

SP - 335

EP - 360

JO - Lecture Notes in Computer Science

JF - Lecture Notes in Computer Science

SN - 0302-9743

T2 - International Workshop on Object Representation in Computer Vision II, ECCV 1996

Y2 - 13 April 1996 through 14 April 1996

ER -

ID: 302163653

Datalogisk Institut