“…As robustly and accurately as they may perform, these networks rely on sufficient data, both in amount and quality, which tends to be the bottleneck of their applicability in certain cases such as single-cell detection. While in more industrial applications (see (Grigorescu et al, 2019)for an overview of autonomous driving) a large amount of training data can be collected relatively easily: see the cityscapes dataset (Cordts et al, 2016) (available at https://www.cityscapes-dataset.com/ ) of traffic video frames using a car and camera to record and potentially non-expert individuals to label the objects, clinical data is considerably more difficult, due to ethical constraints, and expensive to gather as expert annotation is required. Datasets available in the public domain such as BBBC (Ljosa et al, 2012) at https://data.broadinstitute.org/bbbc/ , TNBC (Naylor et al, 2017(Naylor et al, , 2019 or TCGA (Cancer Genome Atlas Research Network, 2008;Kumar et al, 2017) and detection challenges including ISBI (Coelho et al, 2009), Kaggle ( https://www.kaggle.com/ ), ImageNet (Russakovsky et al, 2015) etc.…”