“…As a result, many studies have focused on large pre-clinical pharmacogenomics datasets such as cancer cell lines as a proxy to patients (Barretina et al, 2012;Iorio et al, 2016). A majority of the current computational methods are trained on cell line datasets and then tested on other cell line or patient datasets (Sharifi-Noghabi et al, 2019b;Sakellaropoulos et al, 2019;Mourragui et al, 2019;Rampášek et al, 2019;Ding et al, 2018;Geeleher et al, 2017Geeleher et al, , 2014. However, cell lines and patients data, even with the same set of genes, do not have identical distributions due to the lack of an immune system and the tumor microenvironment in cell lines, which means a model cannot be trained on cell lines and then tested on patients (Mourragui et al, 2019).…”