Fine-Grained Image Analysis with Deep Learning: A Survey

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Dokumenter

  • Fulltext

    Forlagets udgivne version, 4,7 MB, PDF-dokument

  • Xiu Shen Wei
  • Yi Zhe Song
  • Oisin Mac Aodha
  • Jianxin Wu
  • Yuxin Peng
  • Jinhui Tang
  • Jian Yang
  • Belongie, Serge

Fine-grained image analysis (FGIA) is a longstanding and fundamental problem in computer vision and pattern recognition, which underpins a diverse set of real-world applications. The task of FGIA targets analyzing visual objects from subordinate categories, e.g., species of birds or models of cars. The small inter-class and large intra-class variation inherent to fine-grained image analysis makes it a challenging problem. Capitalizing on advances in deep learning, in recent years we have witnessed remarkable progress in deep learning powered FGIA. In this paper we present a systematic survey of these advances, where we attempt to re-define and broaden the field of FGIA by consolidating two fundamental fine-grained research areas -- fine-grained image recognition and fine-grained image retrieval. In addition, we also review other key issues of FGIA, such as publicly available benchmark datasets and related domain-specific applications. We conclude by highlighting several research directions and open problems which need further exploration from the community.

OriginalsprogEngelsk
TidsskriftIEEE Transactions on Pattern Analysis and Machine Intelligence
Vol/bind44
Udgave nummer12
Sider (fra-til)8927-8948
ISSN0162-8828
DOI
StatusUdgivet - 2022

Bibliografisk note

Publisher Copyright:
IEEE

ID: 301817074