arXiv 2311.06720

Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer

By Bowen Tan, Yun Zhu, et al.

Published 2023-11-12

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

Large language models (LLMs) such as T0, FLAN, and OPT-IML, excel in multi-tasking under a unified instruction-following paradigm, where they also exhibit remarkable generalization abilities to unseen tasks. Despite their impressive performance, these LLMs, with sizes ranging from several billion to hundreds of billions of parameters, demand substantial computational resources, making their training and inference ex…

View the original paper on arXiv