An algorithm competition for automatic species identification from herbarium specimens

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

An algorithm competition for automatic species identification from herbarium specimens. / Little, Damon P.; Tulig, Melissa; Tan, Kiat Chuan; Liu, Yulong; Belongie, Serge; Kaeser-Chen, Christine; Michelangeli, Fabián A.; Panesar, Kiran; Guha, R. V.; Ambrose, Barbara A.

In: Applications in Plant Sciences, Vol. 8, No. 6, e11365, 01.06.2020.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Little, DP, Tulig, M, Tan, KC, Liu, Y, Belongie, S, Kaeser-Chen, C, Michelangeli, FA, Panesar, K, Guha, RV & Ambrose, BA 2020, 'An algorithm competition for automatic species identification from herbarium specimens', Applications in Plant Sciences, vol. 8, no. 6, e11365. https://doi.org/10.1002/aps3.11365

APA

Little, D. P., Tulig, M., Tan, K. C., Liu, Y., Belongie, S., Kaeser-Chen, C., Michelangeli, F. A., Panesar, K., Guha, R. V., & Ambrose, B. A. (2020). An algorithm competition for automatic species identification from herbarium specimens. Applications in Plant Sciences, 8(6), [e11365]. https://doi.org/10.1002/aps3.11365

Vancouver

Little DP, Tulig M, Tan KC, Liu Y, Belongie S, Kaeser-Chen C et al. An algorithm competition for automatic species identification from herbarium specimens. Applications in Plant Sciences. 2020 Jun 1;8(6). e11365. https://doi.org/10.1002/aps3.11365

Author

Little, Damon P. ; Tulig, Melissa ; Tan, Kiat Chuan ; Liu, Yulong ; Belongie, Serge ; Kaeser-Chen, Christine ; Michelangeli, Fabián A. ; Panesar, Kiran ; Guha, R. V. ; Ambrose, Barbara A. / An algorithm competition for automatic species identification from herbarium specimens. In: Applications in Plant Sciences. 2020 ; Vol. 8, No. 6.

Bibtex

@article{addf60086f1541d19a9ff8e3f80f85c6,
title = "An algorithm competition for automatic species identification from herbarium specimens",
abstract = "Premise: Plant biodiversity is threatened, yet many species remain undescribed. It is estimated that >50% of undescribed species have already been collected and are awaiting discovery in herbaria. Robust automatic species identification algorithms using machine learning could accelerate species discovery. Methods: To encourage the development of an automatic species identification algorithm, we submitted our Herbarium 2019 data set to the Fine-Grained Visual Categorization sub-competition (FGVC6) hosted on the Kaggle platform. We chose to focus on the flowering plant family Melastomataceae because we have a large collection of imaged herbarium specimens (46,469 specimens representing 683 species) and taxonomic expertise in the family. As is common for herbarium collections, some species in this data set are represented by few specimens and others by many. Results: In less than three months, the FGVC6 Herbarium 2019 Challenge drew 22 teams who entered 254 models for Melastomataceae species identification. The four best algorithms identified species with >88% accuracy. Discussion: The FGVC competitions provide a unique opportunity for computer vision and machine learning experts to address difficult species-recognition problems. The Herbarium 2019 Challenge brought together a novel combination of collections resources, taxonomic expertise, and collaboration between botanists and computer scientists.",
keywords = "artificial intelligence, computer vision, FGVC, herbarium specimen, Kaggle, machine learning, Melastomataceae",
author = "Little, {Damon P.} and Melissa Tulig and Tan, {Kiat Chuan} and Yulong Liu and Serge Belongie and Christine Kaeser-Chen and Michelangeli, {Fabi{\'a}n A.} and Kiran Panesar and Guha, {R. V.} and Ambrose, {Barbara A.}",
note = "Funding Information: We thank the New York Botanical Garden for support and funding from the National Science Foundation (IAA‐1444192, DEB‐1343612 and DEB‐0818399 to F.A.M.). Special thanks to the staff of the New York Botanical Garden, particularly Kim Watson and Nichole Tiernan for all the specimen digitization work. We also thank the organizers of FGVC, the Kaggle platform, and all the Herbarium 2019 competitors for taking on the challenge of this data set. Funding Information: We thank the New York Botanical Garden for support and funding from the National Science Foundation (IAA-1444192, DEB-1343612 and DEB-0818399 to F.A.M.). Special thanks to the staff of the New York Botanical Garden, particularly Kim Watson and Nichole Tiernan for all the specimen digitization work. We also thank the organizers of FGVC, the Kaggle platform, and all the Herbarium 2019 competitors for taking on the challenge of this data set. Publisher Copyright: {\textcopyright} 2020 Little et al. Applications in Plant Sciences is published by Wiley Periodicals, LLC on behalf of the Botanical Society of America",
year = "2020",
month = jun,
day = "1",
doi = "10.1002/aps3.11365",
language = "English",
volume = "8",
journal = "Applications in Plant Sciences",
issn = "2168-0450",
publisher = "Botanical Society of America",
number = "6",

}

RIS

TY - JOUR

T1 - An algorithm competition for automatic species identification from herbarium specimens

AU - Little, Damon P.

AU - Tulig, Melissa

AU - Tan, Kiat Chuan

AU - Liu, Yulong

AU - Belongie, Serge

AU - Kaeser-Chen, Christine

AU - Michelangeli, Fabián A.

AU - Panesar, Kiran

AU - Guha, R. V.

AU - Ambrose, Barbara A.

N1 - Funding Information: We thank the New York Botanical Garden for support and funding from the National Science Foundation (IAA‐1444192, DEB‐1343612 and DEB‐0818399 to F.A.M.). Special thanks to the staff of the New York Botanical Garden, particularly Kim Watson and Nichole Tiernan for all the specimen digitization work. We also thank the organizers of FGVC, the Kaggle platform, and all the Herbarium 2019 competitors for taking on the challenge of this data set. Funding Information: We thank the New York Botanical Garden for support and funding from the National Science Foundation (IAA-1444192, DEB-1343612 and DEB-0818399 to F.A.M.). Special thanks to the staff of the New York Botanical Garden, particularly Kim Watson and Nichole Tiernan for all the specimen digitization work. We also thank the organizers of FGVC, the Kaggle platform, and all the Herbarium 2019 competitors for taking on the challenge of this data set. Publisher Copyright: © 2020 Little et al. Applications in Plant Sciences is published by Wiley Periodicals, LLC on behalf of the Botanical Society of America

PY - 2020/6/1

Y1 - 2020/6/1

N2 - Premise: Plant biodiversity is threatened, yet many species remain undescribed. It is estimated that >50% of undescribed species have already been collected and are awaiting discovery in herbaria. Robust automatic species identification algorithms using machine learning could accelerate species discovery. Methods: To encourage the development of an automatic species identification algorithm, we submitted our Herbarium 2019 data set to the Fine-Grained Visual Categorization sub-competition (FGVC6) hosted on the Kaggle platform. We chose to focus on the flowering plant family Melastomataceae because we have a large collection of imaged herbarium specimens (46,469 specimens representing 683 species) and taxonomic expertise in the family. As is common for herbarium collections, some species in this data set are represented by few specimens and others by many. Results: In less than three months, the FGVC6 Herbarium 2019 Challenge drew 22 teams who entered 254 models for Melastomataceae species identification. The four best algorithms identified species with >88% accuracy. Discussion: The FGVC competitions provide a unique opportunity for computer vision and machine learning experts to address difficult species-recognition problems. The Herbarium 2019 Challenge brought together a novel combination of collections resources, taxonomic expertise, and collaboration between botanists and computer scientists.

AB - Premise: Plant biodiversity is threatened, yet many species remain undescribed. It is estimated that >50% of undescribed species have already been collected and are awaiting discovery in herbaria. Robust automatic species identification algorithms using machine learning could accelerate species discovery. Methods: To encourage the development of an automatic species identification algorithm, we submitted our Herbarium 2019 data set to the Fine-Grained Visual Categorization sub-competition (FGVC6) hosted on the Kaggle platform. We chose to focus on the flowering plant family Melastomataceae because we have a large collection of imaged herbarium specimens (46,469 specimens representing 683 species) and taxonomic expertise in the family. As is common for herbarium collections, some species in this data set are represented by few specimens and others by many. Results: In less than three months, the FGVC6 Herbarium 2019 Challenge drew 22 teams who entered 254 models for Melastomataceae species identification. The four best algorithms identified species with >88% accuracy. Discussion: The FGVC competitions provide a unique opportunity for computer vision and machine learning experts to address difficult species-recognition problems. The Herbarium 2019 Challenge brought together a novel combination of collections resources, taxonomic expertise, and collaboration between botanists and computer scientists.

KW - artificial intelligence

KW - computer vision

KW - FGVC

KW - herbarium specimen

KW - Kaggle

KW - machine learning

KW - Melastomataceae

UR - http://www.scopus.com/inward/record.url?scp=85087175472&partnerID=8YFLogxK

U2 - 10.1002/aps3.11365

DO - 10.1002/aps3.11365

M3 - Journal article

AN - SCOPUS:85087175472

VL - 8

JO - Applications in Plant Sciences

JF - Applications in Plant Sciences

SN - 2168-0450

IS - 6

M1 - e11365

ER -

ID: 301822923