arXiv 1912.13318

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

By Yiheng Xu, Minghao Li, et al.

Published 2019-12-31

Wiki summary

Explore the paper's summary, context, and related research on Papiers.

Pre-training techniques have been verified successfully in a variety of NLP tasks in recent years. Despite the widespread use of pre-training models for NLP applications, they almost exclusively focus on text-level manipulation, while neglecting layout and style information that is vital for document image understanding. In this paper, we propose the LayoutLM to jointly model interactions between text and layout inf…

View the original paper on arXiv