“…To build a comprehensive data set of PAHs, we exanimated a large number of publications about soot particles mostly in the last ten years, but potential structures with S and N elements are excluded to simplify our analysis in the bandgap. In this work, we selected 323 PAHs ranging from 6 up to 96 carbon numbers (Lafleur et al, 1993;Elvati and Violi, 2013;Kislov et al, 2013;Lowe et al, 2015;Johansson et al, 2016;Zhang et al, 2016;Johansson et al, 2017;Adamson et al, 2018;Kholghy et al, 2018;Li et al, 2018;Commodo et al, 2019;Elvati et al, 2019;Giaccai and Miller, 2019;Kozliak et al, 2019;Schulz et al, 2019;Zhang, 2019;Frenklach and Mebel, 2020;Gavilan Marin et al, 2020;Gentile et al, 2020;Leon et al, 2020;Michelsen, 2020;Pascazio et al, 2020;Saldinger et al, 2020;Zhao et al, 2020;Chen et al, 2021;Shi et al, 2021;Wang et al, 2021). The molecular structures are all included in Supplementary Table S1 (see Supplementary Material).…”