Main content

Automatically Coding Occupation Titles to a Standard Occupation Classification

Show simple item record

dc.contributor.advisor Song, Fei
dc.contributor.advisor Grewal, Gary
dc.contributor.author Nahoomi, Negin
dc.date.accessioned 2018-09-05T18:42:47Z
dc.date.available 2018-09-05T18:42:47Z
dc.date.copyright 2018-09
dc.date.created 2018-08-31
dc.date.issued 2018-09-05
dc.identifier.uri http://hdl.handle.net/10214/14251
dc.description.abstract Occupation Coding is the process of classifying job titles into one or multiple categories that are usually organized into a hierarchy. Historically, the task of classifying job titles to standard classifications was done manually. However, the drawbacks of manual coding have led researchers to develop automatic methods for occupation coding. We compare the classic machine learning approaches and the deep learning approaches on classifying job titles to Standard Occupational Classification (SOC). We implement flat and hierarchical models using Naïve Bayes, Maximum Entropy (MaxEnt), Support Vector Machines (SVM), and Convolutional Neural Networks (CNN) to code job titles to SOC. For this purpose, 65,962 SOC labeled job titles are collected from publicly available sources. These job titles are extremely short with an average of three words per job title. Our experimental results show that MaxEnt, SVM, and CNN perform similarly and are better than Naïve Bayes on coding job titles to SOC. en_US
dc.language.iso en en_US
dc.rights Attribution-NoDerivs 2.5 Canada *
dc.rights.uri http://creativecommons.org/licenses/by-nd/2.5/ca/ *
dc.subject automatic occupation coding en_US
dc.subject multi-label classification en_US
dc.subject hierarchical classification en_US
dc.subject short text classification en_US
dc.subject machine learning en_US
dc.subject deep learning en_US
dc.subject convolutional neural network en_US
dc.subject svm en_US
dc.subject maximum entropy en_US
dc.subject naive bayes en_US
dc.title Automatically Coding Occupation Titles to a Standard Occupation Classification en_US
dc.type Thesis en_US
dc.degree.programme Computer Science en_US
dc.degree.name Master of Science en_US
dc.degree.department School of Computer Science en_US


Files in this item

Files Size Format View
Nahoomi_Negin_201809_Msc.pdf 1.587Mb PDF View/Open

This item appears in the following Collection(s)

Show simple item record

Attribution-NoDerivs 2.5 Canada Except where otherwise noted, this item's license is described as Attribution-NoDerivs 2.5 Canada