TY - JOUR
T1 - Improving Deep Learning on Point Cloud by Maximizing Mutual Information Across Layers
AU - Wang, Di
AU - Tang, Lulu
AU - Wang, Xu
AU - Luo, Luqing
AU - Yang, Zhi-Xin
PY - 2022/11
Y1 - 2022/11
N2 - It is a fundamental and vital task to enhance the perception capability of the point cloud learning network in 3D machine vision applications. Most existing methods utilize feature fusion and geometric transformation to improve point cloud learning without paying enough attention to mining further intrinsic information across multiple network layers. Motivated to improve consistency between hierarchical features and strengthen the perception capability of the point cloud network, we propose exploring whether maximizing the mutual information (MI) across shallow and deep layers is beneficial to improve representation learning on point clouds. A novel design of Maximizing Mutual Information (MMI) Module is proposed, which assists the training process of the main network to capture discriminative features of the input point clouds. Specifically, the MMI-based loss function is employed to constrain the differences of semantic information in two hierarchical features extracted from the shallow and deep layers of the network. Extensive experiments show that our method is generally applicable to point cloud tasks, including classification, shape retrieval, indoor scene segmentation, 3D object detection, and completion, and illustrate the efficacy of our proposed method and its advantages over existing ones. Our source code will be available at https://github.com/wendydidi/MMI.git.
AB - It is a fundamental and vital task to enhance the perception capability of the point cloud learning network in 3D machine vision applications. Most existing methods utilize feature fusion and geometric transformation to improve point cloud learning without paying enough attention to mining further intrinsic information across multiple network layers. Motivated to improve consistency between hierarchical features and strengthen the perception capability of the point cloud network, we propose exploring whether maximizing the mutual information (MI) across shallow and deep layers is beneficial to improve representation learning on point clouds. A novel design of Maximizing Mutual Information (MMI) Module is proposed, which assists the training process of the main network to capture discriminative features of the input point clouds. Specifically, the MMI-based loss function is employed to constrain the differences of semantic information in two hierarchical features extracted from the shallow and deep layers of the network. Extensive experiments show that our method is generally applicable to point cloud tasks, including classification, shape retrieval, indoor scene segmentation, 3D object detection, and completion, and illustrate the efficacy of our proposed method and its advantages over existing ones. Our source code will be available at https://github.com/wendydidi/MMI.git.
KW - Deep learning
KW - 3D vision
KW - Point clouds
KW - Mutual information
UR - http://www.scopus.com/inward/record.url?scp=85134428088&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-85134428088&origin=recordpage
U2 - 10.1016/j.patcog.2022.108892
DO - 10.1016/j.patcog.2022.108892
M3 - RGC 21 - Publication in refereed journal
SN - 0031-3203
VL - 131
JO - Pattern Recognition
JF - Pattern Recognition
M1 - 108892
ER -