MRD-YOLO: A Multispectral Object Detection Algorithm for Complex Road Scenes

Chaoyue Sun; Yajun Chen; Xiaoyang Qiu; Rongzhen Li; Longxiang You

doi:10.3390/s24103222

MRD-YOLO: A Multispectral Object Detection Algorithm for Complex Road Scenes

Sensors (Basel). 2024 May 18;24(10):3222. doi: 10.3390/s24103222.

Authors

Chaoyue Sun¹, Yajun Chen¹, Xiaoyang Qiu¹, Rongzhen Li¹, Longxiang You¹

Affiliation

¹ School of Electronic Information Engineering, China West Normal University, Nanchong 637009, China.

Abstract

Object detection is one of the core technologies for autonomous driving. Current road object detection mainly relies on visible light, which is prone to missed detections and false alarms in rainy, night-time, and foggy scenes. Multispectral object detection based on the fusion of RGB and infrared images can effectively address the challenges of complex and changing road scenes, improving the detection performance of current algorithms in complex scenarios. However, previous multispectral detection algorithms suffer from issues such as poor fusion of dual-mode information, poor detection performance for multi-scale objects, and inadequate utilization of semantic information. To address these challenges and enhance the detection performance in complex road scenes, this paper proposes a novel multispectral object detection algorithm called MRD-YOLO. In MRD-YOLO, we utilize interaction-based feature extraction to effectively fuse information and introduce the BIC-Fusion module with attention guidance to fuse different modal information. We also incorporate the SAConv module to improve the model's detection performance for multi-scale objects and utilize the AIFI structure to enhance the utilization of semantic information. Finally, we conduct experiments on two major public datasets, FLIR_Aligned and M³FD. The experimental results demonstrate that compared to other algorithms, the proposed algorithm achieves superior detection performance in complex road scenes.

Keywords: autonomous vehicle; computer vision; multi-modality fusion; object detection.

Grants and funding

463177/China West Normal University