With great interest, we have read the article by Mansour et al. [1], reporting on the use of deep transfer learning to identify early signs of hypoxic-ischemic brain injury (HIBI) on head computed tomography (HCT) scans. The authors report a very high accuracy (0.94) of their model with respect to the detection of HIBI signs on HCT scans performed within hours after the return of spontaneous circulation. The authors conclude that "Deep transfer learning reliably identifies HIBI in normal appearing findings on HCT performed within 3 h after ROSC in comatose survivors of a cardiac arrest" [1]. This interpretation is likely too optimistic.Deep learning networks show poor classification results and tend to be overfitted when trained on a very small data set [2]. A medical imaging data set of 54 HCT scans is a very small training data set. Further, we think that the following methodological issues could also contribute to overfitting in this study: (1) choice of the network, (2) the training pipeline (data augmentation, early stopping), and (3) principal component analysis (PCA) and repeated data usage.No justification was given for why a VGG19 network was chosen, although it has a significantly worse accuracy in the analysis of CT data than, for instance, ResNet-50 or . At the same time, it