arXiv 2402.13604
Breaking the HISCO Barrier: Automatic Occupational Standardization with OccCANINE
By Christian Møller Dahl, Torben Johansen, et al.
Published 2024-02-21
Mindmap
Browse the paper's core ideas, clusters, and relationships in a structured outline.
This paper introduces a new tool, OccCANINE, to automatically transform occupational descriptions into the HISCO classification system. The manual work involved in processing and classifying occupational descriptions is error-prone, tedious, and time-consuming. We finetune a preexisting language model (CANINE) to do this automatically, thereby performing in seconds and minutes what previously took days and weeks. Th…