“…where γ (i,j) = c∈Ex j ,(i,j)∈c β xj c aggregates the weight for edge (i, j) from all its combinations, as shown in Figure 2. To bridge the optimization gap in topology search, we anneal the operation (Pham et al, 2018) 2.89 4.6 0.5 RL NAONet-WS (Luo et al, 2018) 3.53 3.1 0.4 NAO AmoebaNet-B (Real et al, 2019) 2.55± 0.05 2.8 3150 EA Hireachical Evolution (Liu et al, 2018b) 3.75± 0.12 15.7 300 EA PNAS (Liu et al, 2018a) 3.41± 0.09 3.2 225 SMBO DARTS 3.00 3.3 0.4 GD SNAS 2.85 2.8 1.5 GD GDAS (Dong & Yang, 2019) 2.93 2.5 0.2 GD P-DARTS 2.50 3.4 0.3 GD FairDARTS (Chu et al, 2019b) 2.54 2.8 0.4 GD PC-DARTS 2.57 ± 0.07 3.6 0.1 GD DropNAS (Hong et al, 2020) 2.58 ± 0.14 4.1 0.6 GD MergeNAS 2.73 ± 0.02 2.9 0.2 GD ASAP (Noy et al, 2020) 2.68 ± 0.11 2.5 0.2 GD SDARTS-ADV (Chen & Hsieh, 2020) 2.61 ± 0.02 3.3 1.3 GD DARTS- (Chu et al, 2020) 2.59 ± 0.08 3.5 0.4 GD DOTS 2.45 ± 0.04 4.2 0.2 GD weight α and edge combination weight β with the exponential schedule defined in Eq. 5.…”