Communication-efficient estimation of quantile matrix regression for massive datasets

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

2 Scopus Citations
View graph of relations

Author(s)

  • Yaohong Yang
  • Lei Wang
  • Jiamin Liu
  • Rui Li
  • Heng Lian

Related Research Unit(s)

Detail(s)

Original languageEnglish
Article number107812
Journal / PublicationComputational Statistics and Data Analysis
Volume187
Online published27 Jun 2023
Publication statusPublished - Nov 2023

Abstract

In modern scientific applications, more and more data sets contain natural matrix predictors and traditional regression methods are not directly applicable. Matrix regression has been adapted to such data structure and received increasing attention in recent years. In this paper, we consider estimation of the conditional quantile in high-dimensional regularized matrix regression with a nuclear norm penalty and establish the convergence rate of the estimator. In order to construct a quantile matrix regression estimator in the distributed setting or for massive data sets, we propose a regularized communication-efficient surrogate loss (CSL) function. The proposed CSL method only needs the worker machines to compute the gradient based on local data and the central machine solves a regularized estimation problem. We prove that the estimation error based on the proposed CSL method matches the estimation error bound of the centralized method that analyzes the entire data set. An alternating direction method of multipliers algorithm is developed to efficiently obtain the distributed CSL estimator. The finite-sample performance of the proposed estimator is studied through simulations and an application to Beijing Air Quality data set. © 2023 Elsevier B.V.

Research Area(s)

  • Communication-efficient surrogate loss, Distributed estimator, Divide and conquer, Empirical processes, High-dimensional matrix regression

Citation Format(s)

Communication-efficient estimation of quantile matrix regression for massive datasets. / Yang, Yaohong; Wang, Lei; Liu, Jiamin et al.
In: Computational Statistics and Data Analysis, Vol. 187, 107812, 11.2023.

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review