Abstract
The problem of heavy tail in regression models is studied. It is proposed that regression models are estimated by a standard procedure and a statistical check for heavy tail using residuals is conducted as a tool for regression diagnostic. Using the peaks-over-threshold approach, the generalized Pareto distribution quantifies the degree of heavy tail by the extreme value index. The number of excesses is determined by means of an innovative threshold model which partitions the random sample into extreme values and ordinary values. The overall decision on a significant heavy tail is justified by both a statistical test and a quantile–quantile plot. The usefulness of the approach includes justification of goodness of fit of the estimated regression model and quantification of the occurrence of extremal events. The proposed methodology is supplemented by surface ozone level in the city center of Leeds.
| Original language | English |
|---|---|
| Pages (from-to) | 566-576 |
| Journal | Journal of Applied Statistics |
| Volume | 42 |
| Issue number | 3 |
| Online published | 11 Nov 2014 |
| DOIs | |
| Publication status | Published - 2015 |
| Externally published | Yes |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 11 Sustainable Cities and Communities
Research Keywords
- exponential threshold model
- extreme value index
- ozone
- peaks-over-threshold
- regression diagnostic
Fingerprint
Dive into the research topics of 'Extreme values identification in regression using a peaks-over-threshold approach'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver