Abstract
Large-scale genome-wide association studies (GWAS) have offered an exciting opportunity to discover putative causal genes or risk factors associated with diseases by using SNPs as instrumental variables (IVs). However, conventional approaches assume linear causal relations partly for simplicity and partly for the availability of GWAS summary data. In this work, we propose a novel model for transcriptome-wide association studies (TWAS) to incorporate nonlinear relationships across IVs, an exposure/gene, and an outcome, which is robust against violations of the valid IV assumptions, permits the use of GWAS summary data, and covers two-stage least squares (2SLS) as a special case. We decouple the estimation of a marginal causal effect and a nonlinear transformation, where the former is estimated via sliced inverse regression and a sparse instrumental variable regression, and the latter is estimated by a ratio-adjusted inverse regression. On this ground, we propose an inferential procedure. An application of the proposed method to the ADNI gene expression data and the IGAP GWAS summary data identifies 18 causal genes associated with Alzheimer’s disease, including APOE and TOMM40, in addition to 7 other genes missed by 2SLS considering only linear relationships. Our findings suggest that nonlinear modeling is required to unleash the power of IV regression for identifying potentially nonlinear gene-trait associations. The source code and accompanying software nl-causal can be accessed through the link: https://github.com/statmlben/nonlinear-causal. © 2024 B. Dai, C. Li, H. Xue, W. Pan & X. Shen.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the Third Conference on Causal Learning and Reasoning |
| Editors | Francesco Locatello, Vanessa Didelez |
| Publisher | ML Research Press |
| Pages | 793-826 |
| Volume | 236 |
| Publication status | Published - Apr 2024 |
| Event | 3rd Conference on Causal Learning and Reasoning, CLeaR 2024 - Los Angeles, United States Duration: 1 Apr 2024 → 3 Apr 2024 https://proceedings.mlr.press/v236/ |
Conference
| Conference | 3rd Conference on Causal Learning and Reasoning, CLeaR 2024 |
|---|---|
| Place | United States |
| City | Los Angeles |
| Period | 1/04/24 → 3/04/24 |
| Internet address |
Funding
We thank the reviewers and the area chair for many insightful comments and suggestions. This research was supported by HK GRF grants 14304823, 24302422 and CUHK Science Direct Grant for Research, NSF grant DMS-1952539, NIH grants R01 GM113250, R01 GM126002, RF1 AG067924, U01 AG073079, R01 AG074858, and R01 AG065636.
Research Keywords
- GWAS
- nonlinear causal effect
- sliced inverse regression
- TWAS
RGC Funding Information
- RGC-funded