AI Alignment Strategies from a Risk Perspective: Independent Safety Mechanisms or Shared Failures?
Published in IASEAI'26: International Association for Safe and Ethical AI Conference, 2026
Recommended citation: Leonard Dung, Florian Mai. (2026). "AI Alignment Strategies from a Risk Perspective: Independent Safety Mechanisms or Shared Failures?" IASEAI'26: International Association for Safe and Ethical AI Conference. https://arxiv.org/abs/2510.11235
We analyze overlap in failure modes across alignment techniques to assess the limits of defense-in-depth risk mitigation.
