Multimodal Image Classification by Multiview Latent Pattern Extraction, Selection, and Correlation

Jianghong Ma, Weixuan Kou, Mingquan Lin, Carmen C. M. Cho, Bernard Chiu*

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

4 Citations (Scopus)

Abstract

The large amount of data available in the modern big data era opens new opportunities to expand our knowledge by integrating information from heterogeneous sources. Multiview learning has recently achieved tremendous success in deriving complementary information from multiple data modalities. This article proposes a framework called multiview latent space projection (MVLSP) to integrate features extracted from multiple sources in a discriminative way to facilitate binary and multiclass classifications. Our approach is associated with three innovations. First, most existing multiview learning algorithms promote pairwise consistency between two views and do not have a natural extension to applications with more than two views. MVLSP finds optimum mappings from a common latent space to match the feature space in each of the views. As the matching is performed on a view-by-view basis, the framework can be readily extended to multiview applications. Second, feature selection in the common latent space can be readily achieved by adding a class view, which matches the latent space representations of training samples with their corresponding labels. Then, high-order view correlations are extracted by considering feature-label correlations. Third, a technique is proposed to optimize the integration of different latent patterns based on their correlations. The experimental results on the prostate image dataset demonstrate the effectiveness of the proposed method. © 2022 IEEE.
Original languageEnglish
Pages (from-to)8134-8148
JournalIEEE Transactions on Neural Networks and Learning Systems
Volume35
Issue number6
Online published5 Dec 2022
DOIs
Publication statusPublished - Jun 2024

Funding

.This work was supported in part by the Innovation and Technology Commission of Hong Kong under Project ITS/387/18 and in part by the Research Grant Council of Hong Kong, SAR, China, under Project CityU 11205421.

Research Keywords

  • Common latent space
  • Gleason grade prediction
  • high-order view correlations
  • latent pattern selection and correlation
  • multiview latent space projection (MVLSP)

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'Multimodal Image Classification by Multiview Latent Pattern Extraction, Selection, and Correlation'. Together they form a unique fingerprint.

Cite this