Serial and parallel implementations of model-based clustering via parsimonious Gaussian mixture models
Citation:
P.D. McNicholas, T.B. Murphy, A.F. McDaid, D. Frost, Serial and parallel implementations of model-based clustering via parsimonious Gaussian mixture models, Computational Statistics & Data Analysis, 2009Download Item:
Abstract:
Model-based clustering using a family of Gaussian mixture models, with parsimonious factor analysis like covariance structure, is described and an efficient algorithm for its implementation is presented. This algorithm uses the alternating expectation-conditional maximization (AECM) variant of the expectation-maximization (EM) algorithm. Two central issues around the implementation of this family of models, namely model selection and convergence criteria, are discussed. These central issues also have implications for other model-based clustering techniques and for the implementation of techniques like the EM algorithm, in general. The Bayesian information criterion (BIC) is used for model selection and Aitken?s acceleration, which is shown to outperform the lack of progress criterion, is used to determine convergence. A brief introduction to parallel computing is then given before the implementation of this algorithm in parallel is facilitated within the master?slave paradigm. A simulation study is then carried out to confirm the effectiveness of this parallelization. The resulting software is applied to two datasets to demonstrate its effectiveness when compared to existing software.
Author's Homepage:
http://people.tcd.ie/dfrostDescription:
PUBLISHED
Author: FROST, DERMOT
Type of material:
Journal ArticleSeries/Report no:
Computational Statistics & Data AnalysisAvailability:
Full text availableKeywords:
Administrative Staff AuthorsDOI:
http://dx.doi.org/10.1016/j.csda.2009.02.011Metadata
Show full item recordLicences: