OmniDet: Surround View Cameras Based Multi-Task Visual Perception Network for Autonomous Driving

Kumar, Varun; Yogamani, Senthil; Rashed, Hazem; Sitsu, Ganesh; Witt, Christian; Leang, Isabelle; Milz, Stefan; Mäder, Patrick

doi:10.1109/lra.2021.3062324

Cited by 64 publications

(28 citation statements)

References 40 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For the next generation, a unified CNN model with high synergies would be the likely path. We have recently published an initial prototype Omnidet [114] showing joint modelling of reconstruction and recognition. Figure 16 illustrates its high level architecture with cross links shown across the different tasks.…”

Section: Synergies In Next Generationmentioning

confidence: 99%

See 1 more Smart Citation

Near-field Perception for Low-Speed Vehicle Automation using Surround-view Fisheye Cameras

Eising¹,

Horgan²,

Yogamani³

2021

Preprint

Self Cite

View full text Add to dashboard Cite

Section: Synergies In Next Generationmentioning

confidence: 99%

“…Fig.16: Overview of our next generation unified multi-task visual perception framework. Refer to our OmniDet paper[114] for more details.…”

mentioning

confidence: 99%

Near-field Perception for Low-Speed Vehicle Automation using Surround-view Fisheye Cameras

Eising¹,

Horgan²,

Yogamani³

2021

Preprint

Self Cite

View full text Add to dashboard Cite

“…Segmentation of panoramic data, which is often captured through distortion-pronounced fisheye lenses [38], [39], [40] or multiple surround-view cameras [41], [42], [43], is challenging as it entails a set of hard tasks like distortion elimination, camera synchronization and calibration, as well as data fusion, resulting in higher latency and complexity. Yang et al introduce the PASS [7] and the DS-PASS [44] frameworks which naturally mitigate the effect of distortions by using a single-shot panoramic annular lens system, but come with an expensive memory-and computation cost, as it requires separating the panorama into multiple partitions for predictions, each resembling a narrow-FoV pinhole image.…”

Section: B Semantic Segmentation For 360 • Panoramic Imagesmentioning

confidence: 99%

Transfer beyond the Field of View: Dense Panoramic Semantic Segmentation via Unsupervised Domain Adaptation

Zhang¹,

Ma²,

Yang³

et al. 2021

Preprint

View full text Add to dashboard Cite

“…We start with the OmniDet [18] motion segmentation network using two-stream RGB only network. The network consists of two ResNet18 streams with shared weights and a motion segmentation decoder with deconv layers for upsampling to the higher resolution output.…”

Section: A Baseline Architecturementioning

confidence: 99%

VM-MODNet: Vehicle Motion aware Moving Object Detection for Autonomous Driving

Rashed¹,

Sallab²,

Yogamani³

2021

Preprint

Self Cite

View full text Add to dashboard Cite

Moving object Detection (MOD) is a critical task in autonomous driving as moving agents around the ego-vehicle need to be accurately detected for safe trajectory planning. It also enables appearance agnostic detection of objects based on motion cues. There are geometric challenges like motionparallax ambiguity which makes it a difficult problem. In this work, we aim to leverage the vehicle motion information and feed it into the model to have an adaptation mechanism based on ego-motion. The motivation is to enable the model to implicitly perform ego-motion compensation to improve performance. We convert the six degrees of freedom vehicle motion into a pixel-wise tensor which can be fed as input to the CNN model. The proposed model using Vehicle Motion Tensor (VMT) achieves an absolute improvement of 5.6% in mIoU over the baseline architecture. We also achieve state-ofthe-art results on the public KITTI MoSeg Extended dataset even compared to methods which make use of LiDAR and additional input frames. Our model is also lightweight and runs at 85 fps on a TitanX GPU. Qualitative results are provided in https://youtu.be/ezbfjti-kTk.

show abstract

OmniDet: Surround View Cameras Based Multi-Task Visual Perception Network for Autonomous Driving

Cited by 64 publications

References 40 publications

Near-field Perception for Low-Speed Vehicle Automation using Surround-view Fisheye Cameras

Near-field Perception for Low-Speed Vehicle Automation using Surround-view Fisheye Cameras

Transfer beyond the Field of View: Dense Panoramic Semantic Segmentation via Unsupervised Domain Adaptation

VM-MODNet: Vehicle Motion aware Moving Object Detection for Autonomous Driving

Contact Info

Product

Resources

About