Main content

A New Reclassification Method for Highly Uncertain Microarray Data in Allergy Gene Prediction

Show full item record

Title: A New Reclassification Method for Highly Uncertain Microarray Data in Allergy Gene Prediction
Author: Paul, Jasmin
Department: Department of Computing and Information Science
Program: Computer Science
Advisor: Chiu, David
Abstract: The analysis of microarray data is a challenging task because of the large dimensionality and small sample size involved. Although a few methods are available to address the problem of small sample size, they are not sufficiently successful in dealing with microarray data from extremely small (~<20) sample sizes. We propose a method to incorporate information from diverse sources to analyze the microarray data so as to improve the predictability of significant genes. A transformed data set, including statistical parameters, literature mining and gene ontology data, is evaluated. We performed classification experiments to identify potential allergy-related genes. Feature selection is used to identify the effect of features on classifier behaviour. An exploratory and domain knowledge analysis was performed on noisy real-life allergy data, and a subset of genes was selected as positive and negative class. A new set of transformed variables, depending on the mean and standard deviation statistics of the data distribution and other data sources, was identified. Significant allergy- and immune-related genes from the microarray data were selected. Experiments showed that classification predictability of significant genes can be improved. Important features from the transformed variable set were also identified.
Date: 2012-03
Terms of Use: All items in the Atrium are protected by copyright with all rights reserved unless otherwise indicated.

Files in this item

Files Size Format View
jpaul-0582048.pdf 1.590Mb PDF View/Open

This item appears in the following Collection(s)

Show full item record