arXiv 2511.20689

Morality in AI. A plea to embed morality in LLM architectures and frameworks

By Gunter Bombaerts, Bram Delisse, et al.

Published 2025-11-21

Discussion

Read the public discussion and references gathered around this paper.

Large language models (LLMs) increasingly mediate human decision-making and behaviour. Ensuring LLM processing of moral meaning therefore has become a critical challenge. Current approaches rely predominantly on bottom-up methods such as fine-tuning and reinforcement learning from human feedback. We propose a fundamentally different approach: embedding moral meaning processing directly into the architectural mechani…

View the original paper on arXiv