arXiv 2311.06720
Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer
By Bowen Tan, Yun Zhu, et al.
Published 2023-11-12
Mindmap
Browse the paper's core ideas, clusters, and relationships in a structured outline.
Large language models (LLMs) such as T0, FLAN, and OPT-IML, excel in multi-tasking under a unified instruction-following paradigm, where they also exhibit remarkable generalization abilities to unseen tasks. Despite their impressive performance, these LLMs, with sizes ranging from several billion to hundreds of billions of parameters, demand substantial computational resources, making their training and inference ex…