📑 arXiv 3d ago
RL-STPA: Adapting System-Theoretic Hazard Analysis for Safety-Critical Reinforcement Learning
RL-STPA adapts System-Theoretic Process Analysis for reinforcement learning safety through hierarchical subtask decomposition, coverage-guided perturbation testing, and iterative hazard feedback loops. Addresses distributional shift and emergent behaviors unique to neural RL policies in safety-critical deployments.