posted on 2011-05-27, 00:00authored byYixin Fang, Junhui Wang
Cluster analysis is the assignment of observations into clusters so that observations in the same cluster are similar in some sense, and many clustering methods have been developed. However, these methods cannot be applied to family data, which possess intrinsic familial structure. To take the familial structure into account, we propose a form of penalized cluster analysis with a tuning parameter controlling its influence. The tuning parameter can be selected based on the concept of clustering stability. The method can also be applied to other cluster data such as panel data. The method is illustrated via simulations and an application to a family study of asthma.
Funding
The asthma dataset was originally collected as part of the Collaborative Study on the Genetics of Asthma sponsored by the National Heart, Lung, and Blood Institute.
History
Publisher Statement
NOTICE: this is the author’s version of a work that was accepted for publication in Computational Statistics and Data Analysis. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Computational Statistics and Data Analysis, [Vol 55, Issue 6, (June 1, 2011)] DOI: 10.1016/j.csda.2011.01.004. The original publication is available at www.elsevier.com.