“…Segmentation of panoramic data, which is often captured through distortion-pronounced fisheye lenses [38], [39], [40] or multiple surround-view cameras [41], [42], [43], is challenging as it entails a set of hard tasks like distortion elimination, camera synchronization and calibration, as well as data fusion, resulting in higher latency and complexity. Yang et al introduce the PASS [7] and the DS-PASS [44] frameworks which naturally mitigate the effect of distortions by using a single-shot panoramic annular lens system, but come with an expensive memory-and computation cost, as it requires separating the panorama into multiple partitions for predictions, each resembling a narrow-FoV pinhole image.…”