arXiv 2511.20689
Morality in AI. A plea to embed morality in LLM architectures and frameworks
By Gunter Bombaerts, Bram Delisse, et al.
Published 2025-11-21
Discussion
Read the public discussion and references gathered around this paper.
Large language models (LLMs) increasingly mediate human decision-making and behaviour. Ensuring LLM processing of moral meaning therefore has become a critical challenge. Current approaches rely predominantly on bottom-up methods such as fine-tuning and reinforcement learning from human feedback. We propose a fundamentally different approach: embedding moral meaning processing directly into the architectural mechani…