arXiv 2508.01710

CultureGuard: Towards Culturally-Aware Dataset and Guard Model for Multilingual Safety Applications

By Raviraj Joshi, Rakesh Paul, et al.

Published 2025-08-03

Citation lineage

Review the prior work and downstream research connected to this paper.

The increasing use of Large Language Models (LLMs) in agentic applications highlights the need for robust safety guard models. While content safety in English is well-studied, non-English languages lack similar advancements due to the high cost of collecting culturally aligned labeled datasets. We present CultureGuard, a novel solution for curating culturally aligned, high-quality safety datasets across multiple lan…

View the original paper on arXiv