Statistical Inference of Cell-type Proportions Estimated from Bulk Expression Data

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

View graph of relations

Author(s)

  • Biao Cai
  • Jingfei Zhang
  • Hongyu Li
  • Chang Su
  • Hongyu Zhao

Related Research Unit(s)

Detail(s)

Original languageEnglish
Pages (from-to)2521-2532
Number of pages13
Journal / PublicationJournal of the American Statistical Association
Volume119
Issue number548
Online published20 Sept 2024
Publication statusPublished - 2024

Abstract

There is a growing interest in cell-type-specific analysis from bulk samples with a mixture of different cell types. A critical first step in such analyses is the accurate estimation of cell-type proportions in a bulk sample. Although many methods have been proposed recently, quantifying the uncertainties associated with the estimated cell-type proportions has not been well studied. Lack of consideration of these uncertainties can lead to missed or false findings in downstream analyses. In this article, we introduce a flexible statistical deconvolution framework that allows a general and subject-specific covariance of bulk gene expressions. Under this framework, we propose a decorrelated constrained least squares method called DECALS that estimates cell-type proportions as well as the sampling distribution of the estimates. Simulation studies demonstrate that DECALS can accurately quantify the uncertainties in the estimated proportions whereas other methods fail. Applying DECALS to analyze bulk gene expression data of post mortem brain samples from the ROSMAP and GTEx projects, we show that taking into account the uncertainties in the estimated cell-type proportions can lead to more accurate identifications of cell-type-specific differentially expressed genes and transcripts between different subject groups, such as between Alzheimer’s disease patients and controls and between males and females. © 2024 American Statistical Association.

Research Area(s)

  • cell type deconvolution, cell-type-specific analysis, cell-type proportions, decorrelation, uncertainty quantification

Citation Format(s)

Statistical Inference of Cell-type Proportions Estimated from Bulk Expression Data. / Cai, Biao; Zhang, Jingfei; Li, Hongyu et al.
In: Journal of the American Statistical Association, Vol. 119, No. 548, 2024, p. 2521-2532.

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review