Assessing local influence in principal component analysis with application to haematology study data

Wing K. Fung, Hong Gu, Liming Xiang, Kelvin K.W. Yau

    Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

    4 Citations (Scopus)

    Abstract

    In many medical and health studies, high-dimensional data are often encountered. Principal component analysis (PCA) is a commonly used technique to reduce such data to a few components that includes most of the information provided by the original data. However, PCA is known to be very sensitive to some abnormal observations. Therefore, it is essential to assess such sensitivity in PCA. In this paper, the assessments of local influence based on generalized influence function are developed under the case-weights and additive perturbation schemes, along with a discussion of the perturbation scheme and the generalized influence function approach. When perturbing different variables of the data, it is noted that the directions of the largest joint local influence for the eigenvalues are all the same. Moreover, these directions are completely determined by the score values of the observations, to which an approximate cut-off point is given. The proposed methods are applied to analyse a set of haematology study data for illustration. Results add new insights in finding influential observations in the studied data set. Copyright © 2006 John Wiley & Sons, Ltd.
    Original languageEnglish
    Pages (from-to)2730-2744
    JournalStatistics in Medicine
    Volume26
    Issue number13
    DOIs
    Publication statusPublished - 15 Jun 2007

    Research Keywords

    • Influence function
    • Local influence
    • Perturbation
    • Principal component analysis

    Fingerprint

    Dive into the research topics of 'Assessing local influence in principal component analysis with application to haematology study data'. Together they form a unique fingerprint.

    Cite this