Main content

Large-scale clustering of antigen receptor gene sequence data using hyper-dimensional point packing

Show simple item record

dc.contributor.advisor Keller, Stefan
dc.contributor.author Chang, Haiyang
dc.date.accessioned 2018-08-20T12:46:58Z
dc.date.available 2018-08-20T12:46:58Z
dc.date.copyright 2018-07-20
dc.date.created 2018-08-10
dc.date.issued 2018-08-20
dc.identifier.uri http://hdl.handle.net/10214/14079
dc.description.abstract Lymphocytes generate abundant antigen receptor (AR) genes to recognize an almost infinite number of epitopes. One challenge is to group AR sequences based on the recognition of a common epitope. Traditional clustering methods are based on hierarchical clustering, which comes at a significant computational cost due to pairwise genetic distance comparisons. In this thesis, a point packing strategy was applied to incrementally break down the data into subsets, which limits pairwise sequence comparison to the final cluster level. Sub-setting was achieved by picking maximally spaced anchor sequences from a dataset, iteratively, and assigning the remaining sequences to the closest anchor. This results in an inverted tree with anchor sequences as nodes and a descending anchor distance gradient for each layer. In addition, new sequences can be added to a clustered dataset by comparison with existing anchor nodes to achieve quick positioning and substantially reduce the computational burden. en_US
dc.description.sponsorship NSERC Discovery: 301-021000-401122-000000 GPA: 300-021000-071886-000000 en_US
dc.language.iso en en_US
dc.rights Attribution-NonCommercial-NoDerivs 2.5 Canada *
dc.rights.uri http://creativecommons.org/licenses/by-nc-nd/2.5/ca/ *
dc.subject clustering en_US
dc.subject antigen receptor en_US
dc.subject anchor en_US
dc.subject bioinformatics en_US
dc.title Large-scale clustering of antigen receptor gene sequence data using hyper-dimensional point packing en_US
dc.type Thesis en_US
dc.degree.programme Bioinformatics en_US
dc.degree.name Master of Science en_US
dc.degree.department Department of Pathobiology en_US
dc.rights.license All items in the Atrium are protected by copyright with all rights reserved unless otherwise indicated.


Files in this item

Files Size Format View Description
Chang_Haiyang_201808_Msc.pdf 3.661Mb PDF View/Open Main article

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 2.5 Canada Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 2.5 Canada