Study Tests RL for Broad, Persistent Alignment Beyond Training Distribution
Why does this matter? As AI moves into more diverse, high‑stakes environments, the promise of alignment hinges on whether models stay on course when they encounter tasks they never saw in training.