There’s a gap between what AI promises and how it actually performs in the real world. One area where that gap becomes very obvious is in long-running, stateful work—things like data migrations, multi-step refactors, or anything that doesn’t happen in a single clean execution