Paper
13 June 2024 HSDFormer: an improved transformer for remote sensing image semantic segmentation
Yang Bai, Suixiang Gao
Author Affiliations +
Proceedings Volume 13180, International Conference on Image, Signal Processing, and Pattern Recognition (ISPP 2024); 131800X (2024) https://doi.org/10.1117/12.3033799
Event: International Conference on Image, Signal Processing, and Pattern Recognition (ISPP 2024), 2024, Guangzhou, China
Abstract
The extraction of building locations is crucial in the field of remote sensing, commonly applied in tasks such as emergency response, urban planning, and environmental monitoring. Existing methods often employ convolutional models such as ResNet and U-Net. however, these models struggle to capture long-range features of geographical objects, limiting their practical effectiveness. Meanwhile, Transformer based models face challenges due to their quadratic computational complexity. Additionally, the characteristics of remote sensing images, including dense arrangement, small scale, and class imbalance, pose difficulties for traditional patch-merging modules in downsampling, leading to the loss of significant building information. To address such limitations, we present the HSDFormer model in this paper. The model achieves a reduction in the computational complexity of self-attention calculation through a straightforward sequential reduction process. Additionally, our model has optimized the downsampling process specifically for remote sensing semantic segmentation tasks, effectively minimizing the loss of crucial information during layer-wise feature extraction. We evaluate the HSDFormer on the WHU Building Dataset and Satellite Dataset II (East Asia). The results demonstrate the model’s effectiveness in accurately segmenting building locations, surpassing performance compared to other baseline models.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Yang Bai and Suixiang Gao "HSDFormer: an improved transformer for remote sensing image semantic segmentation", Proc. SPIE 13180, International Conference on Image, Signal Processing, and Pattern Recognition (ISPP 2024), 131800X (13 June 2024); https://doi.org/10.1117/12.3033799
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image segmentation

Remote sensing

Semantics

Transformers

Feature extraction

Performance modeling

Data modeling

Back to Top