Main content

Automatically Coding Occupation Titles to a Standard Occupation Classification

Show full item record

Title: Automatically Coding Occupation Titles to a Standard Occupation Classification
Author: Nahoomi, Negin
Department: School of Computer Science
Program: Computer Science
Advisor: Song, FeiGrewal, Gary
Abstract: Occupation Coding is the process of classifying job titles into one or multiple categories that are usually organized into a hierarchy. Historically, the task of classifying job titles to standard classifications was done manually. However, the drawbacks of manual coding have led researchers to develop automatic methods for occupation coding. We compare the classic machine learning approaches and the deep learning approaches on classifying job titles to Standard Occupational Classification (SOC). We implement flat and hierarchical models using Naïve Bayes, Maximum Entropy (MaxEnt), Support Vector Machines (SVM), and Convolutional Neural Networks (CNN) to code job titles to SOC. For this purpose, 65,962 SOC labeled job titles are collected from publicly available sources. These job titles are extremely short with an average of three words per job title. Our experimental results show that MaxEnt, SVM, and CNN perform similarly and are better than Naïve Bayes on coding job titles to SOC.
URI: http://hdl.handle.net/10214/14251
Date: 2018-09
Rights: Attribution-NoDerivs 2.5 Canada
Terms of Use: All items in the Atrium are protected by copyright with all rights reserved unless otherwise indicated.


Files in this item

Files Size Format View
Nahoomi_Negin_201809_Msc.pdf 1.587Mb PDF View/Open

This item appears in the following Collection(s)

Show full item record

Attribution-NoDerivs 2.5 Canada Except where otherwise noted, this item's license is described as Attribution-NoDerivs 2.5 Canada