arXiv 2104.09864

RoFormer: Enhanced Transformer with Rotary Position Embedding

By Jianlin Su, Yu Lu, et al.

Published 2021-04-20

Wiki summary

Explore the paper's summary, context, and related research on Papiers.

Position encoding recently has shown effective in the transformer architecture. It enables valuable supervision for dependency modeling between elements at different positions of the sequence. In this paper, we first investigate various methods to integrate positional information into the learning process of transformer-based language models. Then, we propose a novel method named Rotary Position Embedding(RoPE) to e…

View the original paper on arXiv