arXiv 2402.13604

Breaking the HISCO Barrier: Automatic Occupational Standardization with OccCANINE

By Christian Møller Dahl, Torben Johansen, et al.

Published 2024-02-21

Citation lineage

Review the prior work and downstream research connected to this paper.

This paper introduces a new tool, OccCANINE, to automatically transform occupational descriptions into the HISCO classification system. The manual work involved in processing and classifying occupational descriptions is error-prone, tedious, and time-consuming. We finetune a preexisting language model (CANINE) to do this automatically, thereby performing in seconds and minutes what previously took days and weeks. Th…

View the original paper on arXiv