Skip to main navigation Skip to search Skip to main content

XPXP: Improving polygenic prediction by cross-population and cross-phenotype analysis

Jiashun Xiao, Mingxuan Cai, Xianghong Hu, Xiang Wan*, Gang Chen*, Can Yang*

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

Motivation: As increasing sample sizes from genome-wide association studies (GWASs), polygenic risk scores (PRSs) have shown great potential in personalized medicine with disease risk prediction, prevention and treatment. However, the PRS constructed using European samples becomes less accurate when it is applied to individuals from non-European populations. It is an urgent task to improve the accuracy of PRSs in under-represented populations, such as African populations and East Asian populations. 
Results: In this article, we propose a cross-population and cross-phenotype (XPXP) method for construction of PRSs in under-represented populations. XPXP can construct accurate PRSs by leveraging biobank-scale datasets in European populations and multiple GWASs of genetically correlated phenotypes. XPXP also allows to incorporate population-specific and phenotype-specific effects, and thus further improves the accuracy of PRS. Through comprehensive simulation studies and real data analysis, we demonstrated that our XPXP outperformed existing PRS approaches. We showed that the height PRSs constructed by XPXP achieved 9% and 18% improvement over the runner-up method in terms of predicted R2 in East Asian and African populations, respectively. We also showed that XPXP substantially improved the stratification ability in identifying individuals at high genetic risk of type 2 diabetes.
Original languageEnglish
Pages (from-to)1947-1955
JournalBioinformatics
Volume38
Issue number7
Online published18 Jan 2022
DOIs
Publication statusPublished - 1 Apr 2022
Externally publishedYes

Funding

This work was supported in part by National Key R&D Program of China [2020YFA0713900]; Hong Kong Research Grant Council [16307818, 16301419, 16308120]; Hong Kong Innovation and Technology Fund [PRP/029/19FX]; Hong Kong University of Science and Technology [startup grant R9405, Z0428 from the Big Data Institute]; and the Open Research Fund from Shenzhen Research Institute of Big Data [2019ORF01004]. The computational task for this work was partially performed using the X-GPU cluster supported by the RGC Collaborative Research Fund [C6021-19EF].

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being
  2. SDG 4 - Quality Education
    SDG 4 Quality Education

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'XPXP: Improving polygenic prediction by cross-population and cross-phenotype analysis'. Together they form a unique fingerprint.

Cite this