Dimension Reduction and Clustering using Non-Elliptical Mixtures

dc.contributor.advisorMcNicholas, Paul D.
dc.contributor.authorMorris, Katherine
dc.date.accessioned2014-03-31T15:26:52Z
dc.date.available2014-03-31T15:26:52Z
dc.date.copyright2014-01
dc.date.created2014-01-17
dc.date.issued2014-03-31
dc.degree.departmentDepartment of Mathematics and Statisticsen_US
dc.degree.grantorUniversity of Guelphen_US
dc.degree.nameDoctor of Philosophyen_US
dc.degree.programmeMathematics and Statisticsen_US
dc.description.abstractFinite mixtures of non-elliptical distributions (specifically the shifted asymmetric Laplace and the generalized hyperbolic) are considered to introduce dimension reduction methods for model-based clustering. The approaches are based on existing work on reducing dimensionality in the case of finite Gaussian mixtures. The methods rely on identifying a reduced subspace of the data by considering the extent to which group means and group covariances vary. This subspace contains linear combinations of the original data, which are ordered by importance via the associated eigenvalues. Observations can be projected onto the subspace and the resulting set of variables captures most of the clustering structure available in the data. The algorithms are illustrated using simulated and real data. Furthermore, methods of detecting outliers are developed for model-based clustering using mixtures of contaminated shifted asymmetric Laplace distributions, and mixtures of contaminated skew-normal distributions. The approaches are based on existing work for outlier detection in the context of contaminated Gaussian mixtures. The main idea is to introduce a contamination factor which increases the dispersion of the fitted distribution by altering the skewness and covariance parameters. An expectation-conditional maximization algorithm is employed to obtain maximum likelihood estimates for the parameters in the model. Thus each observation is given a posterior probability of belonging to a particular group, and of being an outlier or not. The performance of the methods is tested on simulated and real data.en_US
dc.identifier.urihttp://hdl.handle.net/10214/7877
dc.language.isoenen_US
dc.publisherUniversity of Guelphen_US
dc.rights.licenseAll items in the Atrium are protected by copyright with all rights reserved unless otherwise indicated.
dc.subjectdimension reductionen_US
dc.subjectmixture modelsen_US
dc.subjectmodel-based clusteringen_US
dc.subjectgeneralized hyperbolicen_US
dc.subjectshifted asymmetric Laplaceen_US
dc.subjectskew-normalen_US
dc.subjectoutlier detectionen_US
dc.titleDimension Reduction and Clustering using Non-Elliptical Mixturesen_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Morris_Katherine_201401_Phd.pdf
Size:
2.35 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: