“…Other clustering methods applied in the literature are not suitable for the co-optimization of supply and grid technologies: these include clustering based on electrical distance [70,12,23,2,10] (which we do not use because we want to optimize new grid reinforcements that alter electrical distances), spectral partitioning of the graph Laplacian matrix [34] (avoided for same reason), an adaptation of -means called -means++ combined with a max-regions algorithm applied to aggregate contiguous sites with similar wind, solar and electricity demand [65] (avoided since we want a coherent clustering of all network nodes and assets), hierarchical clustering based on a database of electricity demand, conventional generation and renewable profiles including a synthesized grid [47] (avoided for the same reason and because we do not want to alter the topology of the existing transmission grid), -means clustering based on renewable resources as well as economic, sociodemographic and geographical features [20] (avoided because we need a clustering focused on network reduction), as well as clustering based on zonal Power Transfer Distribution Factors (PTDFs) [19,53,64] (avoided because they encode electrical parameters that change with reinforcement), Available Tranfer Capacities (ATCs) [63] (avoided because they depend on predefined dispatch patterns) and locational marginal prices (LMP) [66] (again avoided because they depend on pre-defined dispatch patterns).…”