arXiv 1905.13497
Attention Is (not) All You Need for Commonsense Reasoning
By Tassilo Klein and Moin Nabi
Published 2019-05-31
Discussion
Read the public discussion and references gathered around this paper.
The recently introduced BERT model exhibits strong performance on several language understanding benchmarks. In this paper, we describe a simple re-implementation of BERT for commonsense reasoning. We show that the attentions produced by BERT can be directly utilized for tasks such as the Pronoun Disambiguation Problem and Winograd Schema Challenge. Our proposed attention-guided commonsense reasoning method is conce…