Skip to main navigation Skip to search Skip to main content

Penalized cluster analysis with applications to family data

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

The goal of cluster analysis is to assign observations into clusters so that observations in the same cluster are similar in some sense. Many clustering methods have been developed in the statistical literature, but these methods are inappropriate for clustering family data, which possess intrinsic familial structure. To incorporate the familial structure, we propose a form of penalized cluster analysis with a tuning parameter controlling the tradeoff between the observation dissimilarity and the familial structure. The tuning parameter is selected based on the concept of clustering stability. The effectiveness of the method is illustrated via simulations and an application to a family study of asthma. © 2011 Elsevier B.V. All rights reserved.
Original languageEnglish
Pages (from-to)2128-2136
JournalComputational Statistics and Data Analysis
Volume55
Issue number6
DOIs
Publication statusPublished - 1 Jun 2011
Externally publishedYes

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Research Keywords

  • Consistency
  • Cross-validation
  • K-means
  • Kinship
  • Stability

Fingerprint

Dive into the research topics of 'Penalized cluster analysis with applications to family data'. Together they form a unique fingerprint.

Cite this