TY - JOUR
T1 - QTU-Net
T2 - Quaternion Transformer-based U-Net for Water Body Extraction of RGB Satellite Image
AU - Wang, Mingzhi
AU - Li, Chunshan
AU - Yang, Xiaofei
AU - Chu, Dianhui
AU - Zhou, Zhiquan
AU - Lau, Raymond Y. K.
PY - 2024
Y1 - 2024
N2 - Deep learning models have achieved great success in water body extraction (WBE) from remote sensing image. However, existing deep learning-based extraction methods exhibit limitations in their ability to fully explore the intricate interconnections inherent in RGB color satellite imagery and to enhance semantic representation across diverse regions. Furthermore, these methods often struggle with challenges posed by the uneven distribution of water bodies at different scales within the image, as well as substantial color disparities between water and land areas. In this paper, we tackle WBE task from Quaternion domain and introduce a novel approach called Quaternion Transformer-based U-Net (QTU-Net) to address these challenges. Our method specifically leverages quaternion convolution operations to capture the holistic relationships among RGB channels, thereby enhancing the semantic representation of WBE. Additionally, we propose a Quantization Initialization module (QIM) to determine optimal RGB weights and facilitate the generation of quaternion data. To further improve the accuracy of water body delineation, we incorporate an innovative Multi-Scale Similarity Aggregation Attention (MSAA) component that enhances local similarity capture across various scales. Finally, we evaluate the proposed QTU-Net based on three publicly available benchmark datasets. The experimental results demonstrate that the proposed QTU-Net outperforms state-of-the-art baseline methods. © 2024 IEEE.
AB - Deep learning models have achieved great success in water body extraction (WBE) from remote sensing image. However, existing deep learning-based extraction methods exhibit limitations in their ability to fully explore the intricate interconnections inherent in RGB color satellite imagery and to enhance semantic representation across diverse regions. Furthermore, these methods often struggle with challenges posed by the uneven distribution of water bodies at different scales within the image, as well as substantial color disparities between water and land areas. In this paper, we tackle WBE task from Quaternion domain and introduce a novel approach called Quaternion Transformer-based U-Net (QTU-Net) to address these challenges. Our method specifically leverages quaternion convolution operations to capture the holistic relationships among RGB channels, thereby enhancing the semantic representation of WBE. Additionally, we propose a Quantization Initialization module (QIM) to determine optimal RGB weights and facilitate the generation of quaternion data. To further improve the accuracy of water body delineation, we incorporate an innovative Multi-Scale Similarity Aggregation Attention (MSAA) component that enhances local similarity capture across various scales. Finally, we evaluate the proposed QTU-Net based on three publicly available benchmark datasets. The experimental results demonstrate that the proposed QTU-Net outperforms state-of-the-art baseline methods. © 2024 IEEE.
KW - Convolutional neural network (CNN)
KW - Feature extraction
KW - Image color analysis
KW - Oil insulation
KW - Quaternion convolution
KW - Quaternions
KW - Remote sensing
KW - Satellite images
KW - Transformer Network
KW - Transformers
KW - U-Net
KW - Water body extraction
UR - http://www.scopus.com/inward/record.url?scp=85198371449&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-85198371449&origin=recordpage
U2 - 10.1109/TGRS.2024.3426475
DO - 10.1109/TGRS.2024.3426475
M3 - RGC 21 - Publication in refereed journal
SN - 0196-2892
VL - 62
JO - IEEE Transactions on Geoscience and Remote Sensing
JF - IEEE Transactions on Geoscience and Remote Sensing
M1 - 5634816
ER -