“…Note the large difference in the size of the training data (between 10 and more than 15,000 PSGs) as well as in the size of the test data (between 8 and close to 3000). Table 3 summarizes validation results of AI-algorithms that applied a hold-out or crossvalidation; i.e., an internal validation based on data from the same dataset that has been used for training (Supratak et al, 2017;Sors et al, 2018;Phan et al, 2019;Zhang et al, 2019;Abou Jaoude et al, 2020;Guillot et al, 2020;Korkalainen et al, 2020;Sun et al, 2020a;Alvarez-Estevez and Rijsman, 2021;Fiorillo et al, 2021Fiorillo et al, , 2023bJia et al, 2021;Nasiri and Clifford, 2021;Olesen et al, 2021;Pathak et al, 2021;Vallat and Walker, 2021;Brandmayr et al, 2022;Cho et al, 2022;Ji et al, 2022;Li C. et al, 2022;Sharma et al, 2022;Yubo et al, 2022). Table 4 summarizes the validation results of AI-algorithms which have been validated in datasets completely unseen by the model (Anderer et al, 2018(Anderer et al, , 2022bBiswal et al, 2018;Patanaik et al, 2018;Stephansen et al, 2018;Zhang et al, 2019;Abou Jaoude et al, 2020;Alvarez-Estevez and Rijsman, 2021;Cesari et al, 2021Cesari et al, , 2022Vallat and Walker, 2021;Bakker et al, 2023).…”