Training on the Edge of Stability is Caused by Layerwise Jacobian Alignment

Lowell and Kastner, 2024. “Training on the Edge of Stability is Caused by Layerwise Jacobian Alignment.”