Main content

Fitting Generalized Zero-Inflated Poisson Regression Mixture Models to Bacteria Microbiome Data

Show full item record

Title: Fitting Generalized Zero-Inflated Poisson Regression Mixture Models to Bacteria Microbiome Data
Author: Chen, Siyu
Department: Department of Mathematics and Statistics
Program: Mathematics and Statistics
Advisor: Feng, Zeny
Abstract: Gut microbial dysbiosis contributes to the risk of colorectal cancer, thus it is important to study the gut mucosal microbiome. Gut bacteria microbiome data has the features of excess zeros and overdispersion that restrict the use of fitting traditional Poisson regression models to this kind of count data. We propose the use of the generalized zero-inflated Poisson (GZIP) regression mixture model for analyzing such data. When fitting a mixture model, we need to specify the number of components in a given population. However, the number of components is unknown. In this thesis, the Bayesian information criterion (BIC) is used to identify a preferred model with a pre-specified number of components. The EM algorithm is used to estimate parameters and the performance of the models is assessed by simulation studies. The GZIP mixture model is applied to gut bacteria microbiome data from a colorectal cancer study. We only consider the carcinoma and healthy groups as a health state covariate and select the best fitted GZIP model to each bacteria genus from models of two, three, or four components. Some special cases where the proposed methods failed to be applied are also discussed.
URI: http://hdl.handle.net/10214/12612
Date: 2018-04
Terms of Use: All items in the Atrium are protected by copyright with all rights reserved unless otherwise indicated.


Files in this item

Files Size Format View
Chen_Siyu_201804_Msc.pdf 583.0Kb PDF View/Open

This item appears in the following Collection(s)

Show full item record