MSSF-DCNet: multi-scale selective fusion with dense connectivity network for sonar image object detection

Yu Dong; Jianlei Zhang; Chunyan Zhang

doi:10.1117/12.3032084

8 June 2024 MSSF-DCNet: multi-scale selective fusion with dense connectivity network for sonar image object detection

Yu Dong, Jianlei Zhang, Chunyan Zhang

Proceedings Volume 13171, Third International Conference on Algorithms, Microchips, and Network Applications (AMNA 2024); 131711U (2024) https://doi.org/10.1117/12.3032084
Event: 3rd International Conference on Algorithms, Microchips and Network Applications (AMNA 2024), 2024, Jinan, China

Abstract

In the field of underwater target recognition, forward-looking sonar images are widely applied in underwater rescue operations. The emergence of object detection technologies powered by deep learning has significantly enhanced the ability to recognize underwater targets. In object detection, the neck network, serving as a critical intermediary component, plays a vital role. However, traditional Feature Pyramid Networks (FPN) have two main problems: 1) During the feature fusion process, FPN does not modify the importance of features across various levels, resulting in imbalanced features at different scales and loss of scale information. 2) Lack of effective information transmission between features of different scales. In this article, we propose a novel neck network architecture, Multi Scale Selective Fusion with Dense Connectivity Network (MSSF-DCNet), which encompasses two components to tackle the previously mentioned challenges. The first one is the Multi Scale Selection Module, which effectively balances the weights of features at different levels during the feature fusion process by calculating and weighting weights for different scales, better preserving scale information. The second one is the Cross Scale Dense Connection module, which exchanges information between different feature layer levels. The model is capable of capturing global context information at every layer. thereby improving the detection capability of the neck network. By replacing the FPN with MSSF-DCNet in the Faster R-CNN framework, our model achieves an increase in Average Precision (AP) by 1.2, 4.0, and 2.6 points using MobileNet-v2, ResNet50, and SwinTransformer backbones, respectively. Furthermore, when employing ResNet50 as the backbone, MSSF-DCNet enhances the RetinaNet by 3.4 AP and ATSS by 4.1 AP. At the same time, we compared different neck networks with MSSF-DCNet on the Faster R-CNN baseline network, and MSSF-DCNet achieved the best performance in all metrics.

(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.

Citation Download Citation

Yu Dong, Jianlei Zhang, and Chunyan Zhang "MSSF-DCNet: multi-scale selective fusion with dense connectivity network for sonar image object detection", Proc. SPIE 13171, Third International Conference on Algorithms, Microchips, and Network Applications (AMNA 2024), 131711U (8 June 2024); https://doi.org/10.1117/12.3032084

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $17.00

Non-members: $21.00 ADD TO CART

PROCEEDINGS
10 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Object detection

Neck

Feature fusion

Convolution

Design

Feature extraction

Deformation

Show All Keywords

Keywords/Phrases

Search In:

Publication Years