Robust kernel-based distribution regression

Zhan Yu*, Daniel W. C. Ho, Zhongjie Shi, Ding-Xuan Zhou

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

12 Citations (Scopus)

Abstract

Regularization schemes for regression have been widely studied in learning theory and inverse problems. In this paper, we study regularized distribution regression (DR) which involves two stages of sampling, and aims at regressing from probability measures to real-valued responses by regularization over a reproducing kernel Hilbert space. Many important tasks in statistical learning and inverse problems can be treated in this framework. Examples include multi-instance learning and point estimation for problems without analytical solutions. Recently, theoretical analysis on DR has been carried out via kernel ridge regression and several interesting learning behaviors have been observed. However, the topic has not been explored and understood beyond the least squares based DR. By introducing a robust loss function lσ for two-stage sampling problems, we present a novel robust distribution regression (RDR) scheme. With a windowing function V and a scaling parameter σ which can be appropriately chosen, lσ can include a wide range of commonly used loss functions that enrich the theme of DR. Moreover, the loss lσ is not necessarily convex, which enlarges the regression class (least squares) in the literature of DR. Learning rates in different regularity ranges of the regression function are comprehensively studied and derived via integral operator techniques. The scaling parameter σ is shown to be crucial in providing robustness and satisfactory learning rates of RDR.
Original languageEnglish
Article number105014
JournalInverse Problems
Volume37
Issue number10
Online published21 Sept 2021
DOIs
Publication statusPublished - Oct 2021

Research Keywords

  • learning theory
  • distribution regression
  • robust regression
  • integral operator
  • learning rate
  • consistency
  • correntropy

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'Robust kernel-based distribution regression'. Together they form a unique fingerprint.

Cite this