Training on the Edge of Stability is Caused by Layerwise Jacobian Alignment

Lowell and Kastner, 2024. "Training on the Edge of Stability is Caused by Layerwise Jacobian Alignment."

Although the code for this paper has not been released, a reimplementation of the exponential Euler solver can be found at:

https://github.com/TheoremEngine/stable-solvers

Back to papers