Multimodal Sensing: Technologies and Applications 2019
DOI: 10.1117/12.2526067
|View full text |Cite
|
Sign up to set email alerts
|

Multimodal data fusion for object recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(10 citation statements)
references
References 18 publications
0
10
0
Order By: Relevance
“…In this section, we present the performance of the proposed YOLOrs architecture on the VEhicle Detection in Aerial Imagery (VEDAI) dataset [46]. The performance of YOLOrs is compared with that of unimodal YOLOv4 [26], EfficientDet (D0) [47], RetinaNet (with backbone ResNet 50) [43], and YOLOv3 [18] trained on RGB and IR images as well as their multimodal versions trained on concatenated RGB and IR images [45] as shown in Fig. 6.…”
Section: Experimental Studiesmentioning
confidence: 99%
See 1 more Smart Citation
“…In this section, we present the performance of the proposed YOLOrs architecture on the VEhicle Detection in Aerial Imagery (VEDAI) dataset [46]. The performance of YOLOrs is compared with that of unimodal YOLOv4 [26], EfficientDet (D0) [47], RetinaNet (with backbone ResNet 50) [43], and YOLOv3 [18] trained on RGB and IR images as well as their multimodal versions trained on concatenated RGB and IR images [45] as shown in Fig. 6.…”
Section: Experimental Studiesmentioning
confidence: 99%
“…In Fig. 10b, we plot the mAP values of multimodal approaches versus training epoch index and observe that the proposed YOLOrs outperforms YOLOv3 [45], RetinaNet, EfficientDet (D0) [47], and YOLOv4 [26] with early fusion approaches.…”
Section: ) Multimodal Datamentioning
confidence: 99%
“…To provide synchronization of collected color and thermal images the technique based on scene 3D reconstruction (Knyaz, 2019) was applied. Both color and thermal image sequences were used for scene 3D reconstruction by structure from motion technique.…”
Section: Scene 3d Model Reconstructionmentioning
confidence: 99%
“…The multimodal dataset generated and augmented using this technique was used for CNN training for the tasks of object detection and object re-identification (Knyaz, 2019), (Kniaz and Knyaz, 2019). The evaluation of CNN trained on the created multimodal dataset showed improving of the CNN performance for considered tasks.…”
Section: Figure 7 Image Orientation and Synchronizationmentioning
confidence: 99%
“…The LEART training set was used to train the modified network architecture [6]. This sample was collected using a DJI Mavic PRO UAV, equipped with an integrated visible camera, and an additional far-infrared camera (8-14 μm) MH-SM576-6 with a resolution of 640 × 480 pixels.…”
Section: Dataset Generationmentioning
confidence: 99%