The International Workshop on Osteoarthritis Imaging Knee MRI Segmentation Challenge: A Multi-Institute Evaluation and Analysis Framework on a Standardized Dataset

Publikation: Bidrag til tidsskriftTidsskriftartikelForskning

Standard

The International Workshop on Osteoarthritis Imaging Knee MRI Segmentation Challenge : A Multi-Institute Evaluation and Analysis Framework on a Standardized Dataset. / Desai, Arjun D.; Caliva, Francesco; Iriondo, Claudia; Khosravan, Naji; Mortazi, Aliasghar; Jambawalikar, Sachin; Torigian, Drew; Ellerman, Jutta; Akcakaya, Mehmet; Bagci, Ulas; Tibrewala, Radhika; Flament, Io; O`Brien, Matthew; Majumdar, Sharmila; Perslev, Mathias; Pai, Akshay; Igel, Christian; Dam, Erik B.; Gaj, Sibaji; Yang, Mingrui; Nakamura, Kunio; Li, Xiaojuan; Deniz, Cem M.; Juras, Vladimir; Regatte, Ravinder; Gold, Garry E.; Hargreaves, Brian A.; Pedoia, Valentina; Chaudhari, Akshay S.

I: arXiv, 29.04.2020.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskning

Harvard

Desai, AD, Caliva, F, Iriondo, C, Khosravan, N, Mortazi, A, Jambawalikar, S, Torigian, D, Ellerman, J, Akcakaya, M, Bagci, U, Tibrewala, R, Flament, I, O`Brien, M, Majumdar, S, Perslev, M, Pai, A, Igel, C, Dam, EB, Gaj, S, Yang, M, Nakamura, K, Li, X, Deniz, CM, Juras, V, Regatte, R, Gold, GE, Hargreaves, BA, Pedoia, V & Chaudhari, AS 2020, 'The International Workshop on Osteoarthritis Imaging Knee MRI Segmentation Challenge: A Multi-Institute Evaluation and Analysis Framework on a Standardized Dataset', arXiv.

APA

Desai, A. D., Caliva, F., Iriondo, C., Khosravan, N., Mortazi, A., Jambawalikar, S., ... Chaudhari, A. S. (2020). The International Workshop on Osteoarthritis Imaging Knee MRI Segmentation Challenge: A Multi-Institute Evaluation and Analysis Framework on a Standardized Dataset. arXiv.

Vancouver

Desai AD, Caliva F, Iriondo C, Khosravan N, Mortazi A, Jambawalikar S o.a. The International Workshop on Osteoarthritis Imaging Knee MRI Segmentation Challenge: A Multi-Institute Evaluation and Analysis Framework on a Standardized Dataset. arXiv. 2020 apr 29.

Author

Desai, Arjun D. ; Caliva, Francesco ; Iriondo, Claudia ; Khosravan, Naji ; Mortazi, Aliasghar ; Jambawalikar, Sachin ; Torigian, Drew ; Ellerman, Jutta ; Akcakaya, Mehmet ; Bagci, Ulas ; Tibrewala, Radhika ; Flament, Io ; O`Brien, Matthew ; Majumdar, Sharmila ; Perslev, Mathias ; Pai, Akshay ; Igel, Christian ; Dam, Erik B. ; Gaj, Sibaji ; Yang, Mingrui ; Nakamura, Kunio ; Li, Xiaojuan ; Deniz, Cem M. ; Juras, Vladimir ; Regatte, Ravinder ; Gold, Garry E. ; Hargreaves, Brian A. ; Pedoia, Valentina ; Chaudhari, Akshay S. / The International Workshop on Osteoarthritis Imaging Knee MRI Segmentation Challenge : A Multi-Institute Evaluation and Analysis Framework on a Standardized Dataset. I: arXiv. 2020.

Bibtex

@article{0ce17fed11e24feda26d0b1a264f7030,
title = "The International Workshop on Osteoarthritis Imaging Knee MRI Segmentation Challenge: A Multi-Institute Evaluation and Analysis Framework on a Standardized Dataset",
abstract = "Purpose: To organize a knee MRI segmentation challenge for characterizing the semantic and clinical efficacy of automatic segmentation methods relevant for monitoring osteoarthritis progression. Methods: A dataset partition consisting of 3D knee MRI from 88 subjects at two timepoints with ground-truth articular (femoral, tibial, patellar) cartilage and meniscus segmentations was standardized. Challenge submissions and a majority-vote ensemble were evaluated using Dice score, average symmetric surface distance, volumetric overlap error, and coefficient of variation on a hold-out test set. Similarities in network segmentations were evaluated using pairwise Dice correlations. Articular cartilage thickness was computed per-scan and longitudinally. Correlation between thickness error and segmentation metrics was measured using Pearson's coefficient. Two empirical upper bounds for ensemble performance were computed using combinations of model outputs that consolidated true positives and true negatives. Results: Six teams (T1-T6) submitted entries for the challenge. No significant differences were observed across all segmentation metrics for all tissues (p=1.0) among the four top-performing networks (T2, T3, T4, T6). Dice correlations between network pairs were high (>0.85). Per-scan thickness errors were negligible among T1-T4 (p=0.99) and longitudinal changes showed minimal bias (",
keywords = "eess.IV, cs.CV",
author = "Desai, {Arjun D.} and Francesco Caliva and Claudia Iriondo and Naji Khosravan and Aliasghar Mortazi and Sachin Jambawalikar and Drew Torigian and Jutta Ellerman and Mehmet Akcakaya and Ulas Bagci and Radhika Tibrewala and Io Flament and Matthew O`Brien and Sharmila Majumdar and Mathias Perslev and Akshay Pai and Christian Igel and Dam, {Erik B.} and Sibaji Gaj and Mingrui Yang and Kunio Nakamura and Xiaojuan Li and Deniz, {Cem M.} and Vladimir Juras and Ravinder Regatte and Gold, {Garry E.} and Hargreaves, {Brian A.} and Valentina Pedoia and Chaudhari, {Akshay S.}",
note = "Submitted to Radiology: Artificial Intelligence",
year = "2020",
month = "4",
day = "29",
language = "English",
journal = "arXiv",

}

RIS

TY - JOUR

T1 - The International Workshop on Osteoarthritis Imaging Knee MRI Segmentation Challenge

T2 - A Multi-Institute Evaluation and Analysis Framework on a Standardized Dataset

AU - Desai, Arjun D.

AU - Caliva, Francesco

AU - Iriondo, Claudia

AU - Khosravan, Naji

AU - Mortazi, Aliasghar

AU - Jambawalikar, Sachin

AU - Torigian, Drew

AU - Ellerman, Jutta

AU - Akcakaya, Mehmet

AU - Bagci, Ulas

AU - Tibrewala, Radhika

AU - Flament, Io

AU - O`Brien, Matthew

AU - Majumdar, Sharmila

AU - Perslev, Mathias

AU - Pai, Akshay

AU - Igel, Christian

AU - Dam, Erik B.

AU - Gaj, Sibaji

AU - Yang, Mingrui

AU - Nakamura, Kunio

AU - Li, Xiaojuan

AU - Deniz, Cem M.

AU - Juras, Vladimir

AU - Regatte, Ravinder

AU - Gold, Garry E.

AU - Hargreaves, Brian A.

AU - Pedoia, Valentina

AU - Chaudhari, Akshay S.

N1 - Submitted to Radiology: Artificial Intelligence

PY - 2020/4/29

Y1 - 2020/4/29

N2 - Purpose: To organize a knee MRI segmentation challenge for characterizing the semantic and clinical efficacy of automatic segmentation methods relevant for monitoring osteoarthritis progression. Methods: A dataset partition consisting of 3D knee MRI from 88 subjects at two timepoints with ground-truth articular (femoral, tibial, patellar) cartilage and meniscus segmentations was standardized. Challenge submissions and a majority-vote ensemble were evaluated using Dice score, average symmetric surface distance, volumetric overlap error, and coefficient of variation on a hold-out test set. Similarities in network segmentations were evaluated using pairwise Dice correlations. Articular cartilage thickness was computed per-scan and longitudinally. Correlation between thickness error and segmentation metrics was measured using Pearson's coefficient. Two empirical upper bounds for ensemble performance were computed using combinations of model outputs that consolidated true positives and true negatives. Results: Six teams (T1-T6) submitted entries for the challenge. No significant differences were observed across all segmentation metrics for all tissues (p=1.0) among the four top-performing networks (T2, T3, T4, T6). Dice correlations between network pairs were high (>0.85). Per-scan thickness errors were negligible among T1-T4 (p=0.99) and longitudinal changes showed minimal bias (

AB - Purpose: To organize a knee MRI segmentation challenge for characterizing the semantic and clinical efficacy of automatic segmentation methods relevant for monitoring osteoarthritis progression. Methods: A dataset partition consisting of 3D knee MRI from 88 subjects at two timepoints with ground-truth articular (femoral, tibial, patellar) cartilage and meniscus segmentations was standardized. Challenge submissions and a majority-vote ensemble were evaluated using Dice score, average symmetric surface distance, volumetric overlap error, and coefficient of variation on a hold-out test set. Similarities in network segmentations were evaluated using pairwise Dice correlations. Articular cartilage thickness was computed per-scan and longitudinally. Correlation between thickness error and segmentation metrics was measured using Pearson's coefficient. Two empirical upper bounds for ensemble performance were computed using combinations of model outputs that consolidated true positives and true negatives. Results: Six teams (T1-T6) submitted entries for the challenge. No significant differences were observed across all segmentation metrics for all tissues (p=1.0) among the four top-performing networks (T2, T3, T4, T6). Dice correlations between network pairs were high (>0.85). Per-scan thickness errors were negligible among T1-T4 (p=0.99) and longitudinal changes showed minimal bias (

KW - eess.IV

KW - cs.CV

M3 - Journal article

JO - arXiv

JF - arXiv

ER -

ID: 241415557