“…Recent methods for learning heuristics combine or improve on existing heuristics (Arfaee, Zilles, and Holte 2011;Groshev et al 2018;Samadi, Felner, and Schaeffer 2008;Thayer, Dionne, and Ruml 2011;Garrett, Kaelbling, and Lozano-Pérez 2016;Shen, Trevizan, and Thiébaux 2020). All of these methods use supervised learning but differ in the encoding of the states, proposing, for instance, the use of images (Groshev et al 2018;Ma et al 2020;Katz et al 2018) or sophisticated network models (Shen, Trevizan, and Thiébaux 2020;Toyer et al 2018). A common approach is to do regression on the heuristic values obtained from precomputed plans (Shen, Trevizan, and Thiébaux 2020;Toyer et al 2018;Garrett, Kaelbling, and Lozano-Pérez 2016;Yoon, Fern, and Givan 2008), and for this reason, it is the baseline we used to compare against supervised methods.…”