TY - JOUR
T1 - Supervised graph clustering for cancer subtyping based on survival analysis and integration of multi-omic tumor data
AU - Liu, Cheng
AU - Cao, Wenming
AU - Wu, Si
AU - Shen, Wenjun
AU - Jiang, Dazhi
AU - Yu, Zhiwen
AU - Wong, Hau-San
PY - 2022/3
Y1 - 2022/3
N2 - Identifying cancer subtypes by integration of multi-omic data is beneficial to improve the understanding of disease progression, and provides more precise treatment for patients. Cancer subtypes identification is usually accomplished by clustering patients with unsupervised learning approaches. Thus, most existing integrative cancer subtyping methods are performed in an entirely unsupervised way. An integrative cancer subtyping approach can be improved to discover clinically more relevant cancer subtypes when considering the clinical survival response variables. In this study, we propose a Survival Supervised Graph Clustering (S2GC) for cancer subtyping by taking into consideration survival information. Specifically, we use a graph to represent similarity of patients, and develop a multi-omic survival analysis embedding with patient-to-patient similarity graph learning for cancer subtype identification. The multi-view (omic) survival analysis model and graph of patients are jointly learned in a unified way. In the proposed model, the survival analysis model and adaptive graph learning could positively reinforce each other. Consequently, the survival time can be considered as supervised information to improve the quality of the similarity graph and explore clinically more relevant subgroups of patients. Experiments on several representative multi-omic cancer datasets demonstrate that the proposed method achieves better results than a number of state-of-the-art methods. The results
also suggest that our method is able to identify biologically meaningful subgroups for different cancer types.
AB - Identifying cancer subtypes by integration of multi-omic data is beneficial to improve the understanding of disease progression, and provides more precise treatment for patients. Cancer subtypes identification is usually accomplished by clustering patients with unsupervised learning approaches. Thus, most existing integrative cancer subtyping methods are performed in an entirely unsupervised way. An integrative cancer subtyping approach can be improved to discover clinically more relevant cancer subtypes when considering the clinical survival response variables. In this study, we propose a Survival Supervised Graph Clustering (S2GC) for cancer subtyping by taking into consideration survival information. Specifically, we use a graph to represent similarity of patients, and develop a multi-omic survival analysis embedding with patient-to-patient similarity graph learning for cancer subtype identification. The multi-view (omic) survival analysis model and graph of patients are jointly learned in a unified way. In the proposed model, the survival analysis model and adaptive graph learning could positively reinforce each other. Consequently, the survival time can be considered as supervised information to improve the quality of the similarity graph and explore clinically more relevant subgroups of patients. Experiments on several representative multi-omic cancer datasets demonstrate that the proposed method achieves better results than a number of state-of-the-art methods. The results
also suggest that our method is able to identify biologically meaningful subgroups for different cancer types.
KW - Multi-omic data
KW - Cancer subtype identification
KW - Survival analysis
KW - Data integration
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-85118646531&origin=recordpage
U2 - 10.1109/TCBB.2020.3010509
DO - 10.1109/TCBB.2020.3010509
M3 - RGC 21 - Publication in refereed journal
SN - 1545-5963
VL - 19
SP - 1193
EP - 1202
JO - IEEE/ACM Transactions on Computational Biology and Bioinformatics
JF - IEEE/ACM Transactions on Computational Biology and Bioinformatics
IS - 2
ER -