arXiv 2511.20689
Morality in AI. A plea to embed morality in LLM architectures and frameworks
By Gunter Bombaerts, Bram Delisse, et al.
Published 2025-11-21
Citation lineage
Review the prior work and downstream research connected to this paper.
Large language models (LLMs) increasingly mediate human decision-making and behaviour. Ensuring LLM processing of moral meaning therefore has become a critical challenge. Current approaches rely predominantly on bottom-up methods such as fine-tuning and reinforcement learning from human feedback. We propose a fundamentally different approach: embedding moral meaning processing directly into the architectural mechani…