Communication-efficient estimation of quantile matrix regression for massive datasets
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Article number | 107812 |
Journal / Publication | Computational Statistics and Data Analysis |
Volume | 187 |
Online published | 27 Jun 2023 |
Publication status | Published - Nov 2023 |
Link(s)
Abstract
In modern scientific applications, more and more data sets contain natural matrix predictors and traditional regression methods are not directly applicable. Matrix regression has been adapted to such data structure and received increasing attention in recent years. In this paper, we consider estimation of the conditional quantile in high-dimensional regularized matrix regression with a nuclear norm penalty and establish the convergence rate of the estimator. In order to construct a quantile matrix regression estimator in the distributed setting or for massive data sets, we propose a regularized communication-efficient surrogate loss (CSL) function. The proposed CSL method only needs the worker machines to compute the gradient based on local data and the central machine solves a regularized estimation problem. We prove that the estimation error based on the proposed CSL method matches the estimation error bound of the centralized method that analyzes the entire data set. An alternating direction method of multipliers algorithm is developed to efficiently obtain the distributed CSL estimator. The finite-sample performance of the proposed estimator is studied through simulations and an application to Beijing Air Quality data set. © 2023 Elsevier B.V.
Research Area(s)
- Communication-efficient surrogate loss, Distributed estimator, Divide and conquer, Empirical processes, High-dimensional matrix regression
Citation Format(s)
Communication-efficient estimation of quantile matrix regression for massive datasets. / Yang, Yaohong; Wang, Lei; Liu, Jiamin et al.
In: Computational Statistics and Data Analysis, Vol. 187, 107812, 11.2023.
In: Computational Statistics and Data Analysis, Vol. 187, 107812, 11.2023.
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review