Skip to main navigation Skip to search Skip to main content

Communication-efficient distributed estimation of partially linear additive models for large-scale data

  • Junzhuo Gao
  • , Lei Wang*
  • *Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

Distributed estimation for parametric models has drawn attention in modern statistical learning, but few studies focus on semiparametric models. In this paper, we propose two communication-efficient distributed estimators for partially linear additive models with high-dimensional covariates. The commonly used B-spline basis functions are first applied to approximate the nonparametric functions and then we construct a profiled communication-efficient surrogate loss function with Lasso penalty based on one local machine solving the final optimization problem. Further, to reduce the effect of local machines and improve the stability of the algorithm, a profiled gradient-enhanced loss estimator is derived. The resulting two estimators and their theoretical convergence rates for both parametric and nonparametric components are established. The finite-sample performance of the proposed estimators is studied through simulations and an application to appliances energy prediction data set is also presented. © 2023 Elsevier Inc.
Original languageEnglish
Pages (from-to)185-201
JournalInformation Sciences
Volume631
Online published28 Feb 2023
DOIs
Publication statusPublished - Jun 2023
Externally publishedYes

Research Keywords

  • B-spline
  • Communication-efficient surrogate loss
  • Distributed estimator
  • Partially linear additive models

Fingerprint

Dive into the research topics of 'Communication-efficient distributed estimation of partially linear additive models for large-scale data'. Together they form a unique fingerprint.

Cite this