Imperial College London

DR BERNHARD KAINZ

Faculty of EngineeringDepartment of Computing

Reader in Medical Image Computing
 
 
 
//

Contact

 

+44 (0)20 7594 8349b.kainz Website CV

 
 
//

Location

 

372Huxley BuildingSouth Kensington Campus

//

Summary

 

Publications

Citation

BibTex format

@article{Maier-Hein:2024:10.1038/s41592-023-02151-z,
author = {Maier-Hein, L and Reinke, A and Godau, P and Tizabi, MD and Buettner, F and Christodoulou, E and Glocker, B and Isensee, F and Kleesiek, J and Kozubek, M and Reyes, M and Riegler, MA and Wiesenfarth, M and Kavur, AE and Sudre, CH and Baumgartner, M and Eisenmann, M and Heckmann-Nötzel, D and Rädsch, T and Acion, L and Antonelli, M and Arbel, T and Bakas, S and Benis, A and Blaschko, MB and Cardoso, MJ and Cheplygina, V and Cimini, BA and Collins, GS and Farahani, K and Ferrer, L and Galdran, A and van, Ginneken B and Haase, R and Hashimoto, DA and Hoffman, MM and Huisman, M and Jannin, P and Kahn, CE and Kainmueller, D and Kainz, B and Karargyris, A and Karthikesalingam, A and Kofler, F and Kopp-Schneider, A and Kreshuk, A and Kurc, T and Landman, BA and Litjens, G and Madani, A and Maier-Hein, K and Martel, AL and Mattson, P and Meijering, E and Menze, B and Moons, KGM and Müller, H and Nichyporuk, B and Nickel, F and Petersen, J and Rajpoot, N and Rieke, N and Saez-Rodriguez, J and S},
doi = {10.1038/s41592-023-02151-z},
journal = {Nature Methods},
pages = {195--212},
title = {Metrics reloaded: recommendations for image analysis validation},
url = {http://dx.doi.org/10.1038/s41592-023-02151-z},
volume = {21},
year = {2024}
}

RIS format (EndNote, RefMan)

TY  - JOUR
AB - Increasing evidence shows that flaws in machine learning (ML) algorithm validation are an underestimated global problem. In biomedical image analysis, chosen performance metrics often do not reflect the domain interest, and thus fail to adequately measure scientific progress and hinder translation of ML techniques into practice. To overcome this, we created Metrics Reloaded, a comprehensive framework guiding researchers in the problem-aware selection of metrics. Developed by a large international consortium in a multistage Delphi process, it is based on the novel concept of a problem fingerprint-a structured representation of the given problem that captures all aspects that are relevant for metric selection, from the domain interest to the properties of the target structure(s), dataset and algorithm output. On the basis of the problem fingerprint, users are guided through the process of choosing and applying appropriate validation metrics while being made aware of potential pitfalls. Metrics Reloaded targets image analysis problems that can be interpreted as classification tasks at image, object or pixel level, namely image-level classification, object detection, semantic segmentation and instance segmentation tasks. To improve the user experience, we implemented the framework in the Metrics Reloaded online tool. Following the convergence of ML methodology across application domains, Metrics Reloaded fosters the convergence of validation methodology. Its applicability is demonstrated for various biomedical use cases.
AU - Maier-Hein,L
AU - Reinke,A
AU - Godau,P
AU - Tizabi,MD
AU - Buettner,F
AU - Christodoulou,E
AU - Glocker,B
AU - Isensee,F
AU - Kleesiek,J
AU - Kozubek,M
AU - Reyes,M
AU - Riegler,MA
AU - Wiesenfarth,M
AU - Kavur,AE
AU - Sudre,CH
AU - Baumgartner,M
AU - Eisenmann,M
AU - Heckmann-Nötzel,D
AU - Rädsch,T
AU - Acion,L
AU - Antonelli,M
AU - Arbel,T
AU - Bakas,S
AU - Benis,A
AU - Blaschko,MB
AU - Cardoso,MJ
AU - Cheplygina,V
AU - Cimini,BA
AU - Collins,GS
AU - Farahani,K
AU - Ferrer,L
AU - Galdran,A
AU - van,Ginneken B
AU - Haase,R
AU - Hashimoto,DA
AU - Hoffman,MM
AU - Huisman,M
AU - Jannin,P
AU - Kahn,CE
AU - Kainmueller,D
AU - Kainz,B
AU - Karargyris,A
AU - Karthikesalingam,A
AU - Kofler,F
AU - Kopp-Schneider,A
AU - Kreshuk,A
AU - Kurc,T
AU - Landman,BA
AU - Litjens,G
AU - Madani,A
AU - Maier-Hein,K
AU - Martel,AL
AU - Mattson,P
AU - Meijering,E
AU - Menze,B
AU - Moons,KGM
AU - Müller,H
AU - Nichyporuk,B
AU - Nickel,F
AU - Petersen,J
AU - Rajpoot,N
AU - Rieke,N
AU - Saez-Rodriguez,J
AU - Sánchez,CI
AU - Shetty,S
AU - van,Smeden M
AU - Summers,RM
AU - Taha,AA
AU - Tiulpin,A
AU - Tsaftaris,SA
AU - Van,Calster B
AU - Varoquaux,G
AU - Jäger,PF
DO - 10.1038/s41592-023-02151-z
EP - 212
PY - 2024///
SN - 1548-7091
SP - 195
TI - Metrics reloaded: recommendations for image analysis validation
T2 - Nature Methods
UR - http://dx.doi.org/10.1038/s41592-023-02151-z
UR - https://www.ncbi.nlm.nih.gov/pubmed/38347141
UR - https://www.nature.com/articles/s41592-023-02150-0
VL - 21
ER -