arXiv 2402.13604
Breaking the HISCO Barrier: Automatic Occupational Standardization with OccCANINE
By Christian Møller Dahl, Torben Johansen, et al.
Published 2024-02-21
Wiki summary
Explore the paper's summary, context, and related research on Papiers.
This paper introduces a new tool, OccCANINE, to automatically transform occupational descriptions into the HISCO classification system. The manual work involved in processing and classifying occupational descriptions is error-prone, tedious, and time-consuming. We finetune a preexisting language model (CANINE) to do this automatically, thereby performing in seconds and minutes what previously took days and weeks. Th…