arXiv 2104.09864
RoFormer: Enhanced Transformer with Rotary Position Embedding
By Jianlin Su, Yu Lu, et al.
Published 2021-04-20
Wiki summary
Explore the paper's summary, context, and related research on Papiers.
Position encoding recently has shown effective in the transformer architecture. It enables valuable supervision for dependency modeling between elements at different positions of the sequence. In this paper, we first investigate various methods to integrate positional information into the learning process of transformer-based language models. Then, we propose a novel method named Rotary Position Embedding(RoPE) to eā¦