Paper
8 June 2024 MSSF-DCNet: multi-scale selective fusion with dense connectivity network for sonar image object detection
Yu Dong, Jianlei Zhang, Chunyan Zhang
Author Affiliations +
Proceedings Volume 13171, Third International Conference on Algorithms, Microchips, and Network Applications (AMNA 2024); 131711U (2024) https://doi.org/10.1117/12.3032084
Event: 3rd International Conference on Algorithms, Microchips and Network Applications (AMNA 2024), 2024, Jinan, China
Abstract
In the field of underwater target recognition, forward-looking sonar images are widely applied in underwater rescue operations. The emergence of object detection technologies powered by deep learning has significantly enhanced the ability to recognize underwater targets. In object detection, the neck network, serving as a critical intermediary component, plays a vital role. However, traditional Feature Pyramid Networks (FPN) have two main problems: 1) During the feature fusion process, FPN does not modify the importance of features across various levels, resulting in imbalanced features at different scales and loss of scale information. 2) Lack of effective information transmission between features of different scales. In this article, we propose a novel neck network architecture, Multi Scale Selective Fusion with Dense Connectivity Network (MSSF-DCNet), which encompasses two components to tackle the previously mentioned challenges. The first one is the Multi Scale Selection Module, which effectively balances the weights of features at different levels during the feature fusion process by calculating and weighting weights for different scales, better preserving scale information. The second one is the Cross Scale Dense Connection module, which exchanges information between different feature layer levels. The model is capable of capturing global context information at every layer. thereby improving the detection capability of the neck network. By replacing the FPN with MSSF-DCNet in the Faster R-CNN framework, our model achieves an increase in Average Precision (AP) by 1.2, 4.0, and 2.6 points using MobileNet-v2, ResNet50, and SwinTransformer backbones, respectively. Furthermore, when employing ResNet50 as the backbone, MSSF-DCNet enhances the RetinaNet by 3.4 AP and ATSS by 4.1 AP. At the same time, we compared different neck networks with MSSF-DCNet on the Faster R-CNN baseline network, and MSSF-DCNet achieved the best performance in all metrics.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Yu Dong, Jianlei Zhang, and Chunyan Zhang "MSSF-DCNet: multi-scale selective fusion with dense connectivity network for sonar image object detection", Proc. SPIE 13171, Third International Conference on Algorithms, Microchips, and Network Applications (AMNA 2024), 131711U (8 June 2024); https://doi.org/10.1117/12.3032084
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Object detection

Neck

Feature fusion

Convolution

Design

Feature extraction

Deformation

Back to Top