“…The most frequent evaluative metrics were: concordance [ 30 ]; precision [ 31 ], sensitivity, specificity [ 29 , 40 , 41 , 43 , 44 , 53 , 58 , 60 ]; concordance, sensitivity, specificity [ 37 ]; accuracy [ 47 , 56 ]; accuracy—area under the curve (AUC) [ 45 , 46 , 55 ], accuracy, sensitivity, specificity [ 28 , 34 , 36 , 38 , 39 , 42 , 49 , 52 , 54 ]; receiver operating characteristics curve (ROC-AUC) [ 33 ]; accuracy, sensitivity (recall), specificity, F-measure, ROC-AUC, precision [ 51 ]; precision, recall and F1-score [ 61 ]; sensitivity, specificity and ROC-AUC [ 45 , 57 ]; sensitivity, specificity and IOU (intersection over union evaluating accuracy of the ROI (region of interest)) [ 32 ]; accuracy, sensitivity, specificity and ROC-AUC [ 27 , 35 , 48 , 50 ]; accuracy, precision, recall and F-score [ 62 ]; sensitivity, specificity and positive predictive value [ 59 ]. The definitions of the terms employed are provided in Table 1 .…”