Improving Deep Learning on Point Cloud by Maximizing Mutual Information Across Layers

Di Wang, Lulu Tang, Xu Wang, Luqing Luo, Zhi-Xin Yang*

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

12 Citations (Scopus)

Abstract

It is a fundamental and vital task to enhance the perception capability of the point cloud learning network in 3D machine vision applications. Most existing methods utilize feature fusion and geometric transformation to improve point cloud learning without paying enough attention to mining further intrinsic information across multiple network layers. Motivated to improve consistency between hierarchical features and strengthen the perception capability of the point cloud network, we propose exploring whether maximizing the mutual information (MI) across shallow and deep layers is beneficial to improve representation learning on point clouds. A novel design of Maximizing Mutual Information (MMI) Module is proposed, which assists the training process of the main network to capture discriminative features of the input point clouds. Specifically, the MMI-based loss function is employed to constrain the differences of semantic information in two hierarchical features extracted from the shallow and deep layers of the network. Extensive experiments show that our method is generally applicable to point cloud tasks, including classification, shape retrieval, indoor scene segmentation, 3D object detection, and completion, and illustrate the efficacy of our proposed method and its advantages over existing ones. Our source code will be available at https://github.com/wendydidi/MMI.git.
Original languageEnglish
Article number108892
JournalPattern Recognition
Volume131
Online published8 Jul 2022
DOIs
Publication statusPublished - Nov 2022

Research Keywords

  • Deep learning
  • 3D vision
  • Point clouds
  • Mutual information

Fingerprint

Dive into the research topics of 'Improving Deep Learning on Point Cloud by Maximizing Mutual Information Across Layers'. Together they form a unique fingerprint.

Cite this