arXiv 2305.16307
IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages
By Jay Gala, Pranjal A. Chitale, et al.
Published 2023-05-25
Mindmap
Browse the paper's core ideas, clusters, and relationships in a structured outline.
India has a rich linguistic landscape with languages from 4 major language families spoken by over a billion people. 22 of these languages are listed in the Constitution of India (referred to as scheduled languages) are the focus of this work. Given the linguistic diversity, high-quality and accessible Machine Translation (MT) systems are essential in a country like India. Prior to this work, there was (i) no parall…