“…All results are normalized to that of an ideal interconnect, in which we do not model any routing delay, contention, or queuing delays. We model only the wire delay over the manhattan distance between the sender and receiver node (30ps/mm [32] [4], [15], [24] to reduce traffic Benchmarks Used Splash-2 [35] barnes (ba), cholesky (ch), fft (ff), fmm (fm) lu (lu), ocean (oc), radiosity (rs), radix (rx) raytrace (ry), water-spatial (ws) Parsec [7] blackscholes (bl), fluidanimate (fl) Other em3d (em), ilink (il), jacobi (ja) mp3d (mp), shallow (sh), tsp (ts) The reason for TLLB's performance is its latency. In a medium-scale CMP like the one simulated here, the overall throughput demand seldom overwhelms the shared bus.…”