Novel Pipeline for Large-Scale Comparative Population Genetics

Thumbnail Image
Majoros, Samantha
Journal Title
Journal ISSN
Volume Title
University of Guelph

This study determined population genetic structure measures, compared these measures across species with different biological traits; and created efficient, reproducible, reusable programming modules that are publicly available for future research. Cytochrome C Oxidase subunit I gene sequences from Diptera (true fly) species from Greenland and Canada were used as a case study and proof of concept. I hypothesized that population genetic structure measures will be influenced by the biological traits of organisms. Data were pulled from public databases, as well as taxon-specific literature. The R pipeline includes fifteen modules that can be adapted and applied to a diverse set of animal groups, geographic regions, genes, and traits. Habitat, larval diet, geographical distance, latitude, and longitude were all significantly related to population genetic structure in Diptera. Overall, this study has created efficient, reusable bioinformatics modules, as well as provided insight into the factors affecting population genetic structure in Northern fly communities.

Bioinformatics, Population Genetics, R Programming