“…We started by compiling a list of 80 datasets employed in 26 papers working with symbolic regression problems published at GECCO from 2013 to 2017. From these 80 datasets, we could not nd the description of two synthetic datasets-Sext [12] and Nguyen-12 [12,16]-and we could not nd seven real-world datasets available on line-Dow chemical [28], Plasma protein binding level (PPB) [5,9,29], Tower data [14,15,29], NOX [1,2], Wind (WND) [15], Median lethal dose (toxicity/LD50) [5,9] and Human oral bioavailability [5,8,9,29] 1 .…”