Skip to main navigation Skip to search Skip to main content

Extreme values identification in regression using a peaks-over-threshold approach

  • Tong Siu Tung Wong*
  • , Wai Keung Li
  • *Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

The problem of heavy tail in regression models is studied. It is proposed that regression models are estimated by a standard procedure and a statistical check for heavy tail using residuals is conducted as a tool for regression diagnostic. Using the peaks-over-threshold approach, the generalized Pareto distribution quantifies the degree of heavy tail by the extreme value index. The number of excesses is determined by means of an innovative threshold model which partitions the random sample into extreme values and ordinary values. The overall decision on a significant heavy tail is justified by both a statistical test and a quantile–quantile plot. The usefulness of the approach includes justification of goodness of fit of the estimated regression model and quantification of the occurrence of extremal events. The proposed methodology is supplemented by surface ozone level in the city center of Leeds.
Original languageEnglish
Pages (from-to)566-576
JournalJournal of Applied Statistics
Volume42
Issue number3
Online published11 Nov 2014
DOIs
Publication statusPublished - 2015
Externally publishedYes

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 11 - Sustainable Cities and Communities
    SDG 11 Sustainable Cities and Communities

Research Keywords

  • exponential threshold model
  • extreme value index
  • ozone
  • peaks-over-threshold
  • regression diagnostic

Fingerprint

Dive into the research topics of 'Extreme values identification in regression using a peaks-over-threshold approach'. Together they form a unique fingerprint.

Cite this