arXiv 2311.06720

Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer

By Bowen Tan, Yun Zhu, et al.

Published 2023-11-12

Citation lineage

Review the prior work and downstream research connected to this paper.

Large language models (LLMs) such as T0, FLAN, and OPT-IML, excel in multi-tasking under a unified instruction-following paradigm, where they also exhibit remarkable generalization abilities to unseen tasks. Despite their impressive performance, these LLMs, with sizes ranging from several billion to hundreds of billions of parameters, demand substantial computational resources, making their training and inference ex…

View the original paper on arXiv