Fine-Grained Image Analysis with Deep Learning: A Survey

Research output: Contribution to journalJournal articleResearchpeer-review

Documents

  • Fulltext

    Final published version, 4.7 MB, PDF document

  • Xiu Shen Wei
  • Yi Zhe Song
  • Oisin Mac Aodha
  • Jianxin Wu
  • Yuxin Peng
  • Jinhui Tang
  • Jian Yang
  • Belongie, Serge

Fine-grained image analysis (FGIA) is a longstanding and fundamental problem in computer vision and pattern recognition, which underpins a diverse set of real-world applications. The task of FGIA targets analyzing visual objects from subordinate categories, e.g., species of birds or models of cars. The small inter-class and large intra-class variation inherent to fine-grained image analysis makes it a challenging problem. Capitalizing on advances in deep learning, in recent years we have witnessed remarkable progress in deep learning powered FGIA. In this paper we present a systematic survey of these advances, where we attempt to re-define and broaden the field of FGIA by consolidating two fundamental fine-grained research areas -- fine-grained image recognition and fine-grained image retrieval. In addition, we also review other key issues of FGIA, such as publicly available benchmark datasets and related domain-specific applications. We conclude by highlighting several research directions and open problems which need further exploration from the community.

Original languageEnglish
JournalIEEE Transactions on Pattern Analysis and Machine Intelligence
Volume44
Issue number12
Pages (from-to)8927-8948
ISSN0162-8828
DOIs
Publication statusPublished - 2022

Bibliographical note

Publisher Copyright:
IEEE

    Research areas

  • Birds, Deep Learning, Deep learning, Fine-Grained Image Recognition, Fine-Grained Image Retrieval, Fine-Grained Images Analysis, Image analysis, Image recognition, Image retrieval, Task analysis, Visualization

ID: 301817074