The ISMRM Open Science Initiative for Perfusion Imaging (OSIPI): Results from the OSIPI–Dynamic Contrast-Enhanced challenge

link to paper

The ISMRM Open Science Initiative for Perfusion Imaging (OSIPI): Results from the OSIPI–Dynamic Contrast-Enhanced challenge

Eve S. Shalom, Harrison Kim, Rianne A. van der Heijden, Zaki Ahmed, Reyna Patel, David A. Hormuth II, Julie C. DiCarlo, Thomas E. Yankeelov, Nicholas J. Sisco, Richard D. Dortch, Ashley M. Stokes, Marianna Inglese, Matthew Grech-Sollars, Nicola Toschi, Prativa Sahoo, Anup Singh, Sanjay K. Verma, Divya K. Rathore, Anum S. Kazerouni, Savannah C. Partridge, Eve LoCastro, Ramesh Paudyal, Ivan A. Wolansky, Amita Shukla-Dave, Pepijn Schouten, Oliver J. Gurney-Champion, Radovan Jiřík, Ondřej Macíček, Michal Bartoš, Jiří Vitouš, Ayesha Bharadwaj Das, S. Gene Kim, Louisa Bokacheva, Artem Mikheev, Henry Rusinek, Michael Berks, Penny L. Hubbard Cristinacce, Ross A. Little, Susan Cheung, James P. B. O’Connor, Geoff J. M. Parker, Brendan Moloney, Peter S. LaViolette, Samuel Bobholz, Savannah Duenweg, John Virostko, Hendrik O. Laue, Kyunghyun Sung, Ali Nabavizadeh, Hamidreza Saligheh Rad, Leland S. Hu, Steven Sourbron, Laura C. Bell, Anahita Fathi Kazerooni

Abstract

Purpose

Ktrans has often been proposed as a quantitative imaging biomarker for diagnosis, prognosis, and treatment response assessment for various tumors. None of the many software tools for Ktrans quantification are standardized. The ISMRM Open Science Initiative for Perfusion Imaging–Dynamic Contrast-Enhanced (OSIPI-DCE) challenge was designed to benchmark methods to better help the efforts to standardize Ktrans measurement.

Methods

A framework was created to evaluate Ktrans values produced by DCE-MRI analysis pipelines to enable benchmarking. The perfusion MRI community was invited to apply their pipelines for Ktranss quantification in glioblastoma from clinical and synthetic patients. Submissions were required to include the entrants’ Ktrans values, the applied software, and a standard operating procedure. These were evaluated using the proposed OSIPIgold score defined with accuracy, repeatability, and reproducibility components.

Results

Across the 10 received submissions, the OSIPIgold score ranged from 28% to 78% with a 59% median. The accuracy, repeatability, and reproducibility scores ranged from 0.54 to 0.92, 0.64 to 0.86, and 0.65 to 1.00, respectively (0–1 = lowest–highest). Manual arterial input function selection markedly affected the reproducibility and showed greater variability in Ktrans analysis than automated methods. Furthermore, provision of a detailed standard operating procedure was critical for higher reproducibility.

Conclusions

This study reports results from the OSIPI-DCE challenge and highlights the high inter-software variability within Ktrans estimation, providing a framework for ongoing benchmarking against the scores presented. Through this challenge, the participating teams were ranked based on the performance of their software tools in the particular setting of this challenge. In a real-world clinical setting, many of these tools may perform differently with different benchmarking methodology.