Main content

Improved Nefclass For Datasets With Skewed Feature Values

Show simple item record

dc.contributor.advisor Hamilton-Wright, Andrew
dc.contributor.author Yousefi, Jamileh
dc.date.accessioned 2018-04-19T18:44:38Z
dc.date.available 2018-04-19T18:44:38Z
dc.date.copyright 2018-03
dc.date.created 2018-03-26
dc.date.issued 2018-04-19
dc.identifier.uri http://hdl.handle.net/10214/12610
dc.description.abstract Most machine learning algorithms perform poorly on datasets with skewed feature values distribution. Skewed feature values are commonly observed in biological and medical datasets. This poses a challenge for the classification of medical data. Neuro-fuzzy systems are common machine learning approaches in the medical domain because of their ability to learn fuzzy rules from training data and represent the rules in an understandable way. Therefore, addressing skewness in neuro-fuzzy systems is a topic of interest because of their applicability in the medical domain. In this thesis, the NEFCLASS neuro-fuzzy classifier is extended to provide improved classification accuracy over the original NEFCLASS classifier when trained on skewed data. In order to improve accuracy, we used two methods. Firstly, we used two alternative discretization methods. Secondly, we devised several asymmetric linguistic hedges. The accuracy-transparency trade-off is also one of the most notable challenges when applying machine learning tools in the medical domain. Therefore, the second problem addressed is improving the transparency of NEFCLASS without significant accuracy deterioration. We have devised a statistical rule pruning algorithm which uses adjusted residuals to reduce the number of rules, thus improving transparency. Moreover, a hybrid approach combining the above approaches is proposed. The algorithms have been evaluated on synthetic F-Distributed and Circular-Uniform Distributed datasets. Additionally, they have been assessed using real-world electromyography and Wisconsin Diagnostic Breast Cancer datasets, which are known to have highly skewed feature values. We evaluated the accuracy of the classifiers using misclassification percentages, and the transparency of the rule-based classifiers using the number of rules. Both independently and in combination, our three approaches provide a considerable improvement in classification accuracy and transparency on skewed data. This research can contribute to an improvement in decision-making in healthcare or any other area where a significant fraction of the domain data has highly skewed distributions of feature values. In particular, our strategy has led to greater diagnostic accuracy to distinguish neuromuscular diseases using electromyography data. This methodology is not limited to NEFCLASS and neuro-fuzzy systems because our approaches are not directly tied to the structure of NEFCLASS. Hence, we expect that our techniques can be applied to any application in which fuzzy logic is used. Furthermore, our rule pruning approach has the potential to be used in other fuzzy and non-fuzzy classifiers. en_US
dc.description.sponsorship NSERC, the National Sciences and Engineering Research Council of Canada en_US
dc.language.iso en en_US
dc.rights Attribution-NonCommercial-NoDerivs 2.5 Canada *
dc.rights.uri http://creativecommons.org/licenses/by-nc-nd/2.5/ca/ *
dc.subject Nefclass en_US
dc.subject skewness en_US
dc.subject Neuro-fuzzy systems en_US
dc.subject machine learning en_US
dc.subject asymmetric linguistic hedges en_US
dc.subject rule pruning en_US
dc.subject adjusted residual en_US
dc.subject discretization method en_US
dc.subject EMG en_US
dc.subject medical domain en_US
dc.subject MME en_US
dc.subject CAIM en_US
dc.subject feature value skewness en_US
dc.title Improved Nefclass For Datasets With Skewed Feature Values en_US
dc.type Thesis en_US
dc.degree.programme Computer Science en_US
dc.degree.name Doctor of Philosophy en_US
dc.degree.department School of Computer Science en_US


Files in this item

Files Size Format View
Jamileh_Yousefi_201804_PhD.pdf 14.52Mb PDF View/Open

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 2.5 Canada Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 2.5 Canada