What do near-optimal learning rate schedules look like?
arXiv:2603.10301v1 Announce Type: new Abstract: A basic unanswered question in neural network training is: what is the best learning rate schedule shape for a given …
Hiroki Naganuma, Atish Agarwala, Priya Kasimbeg, George E. Dahl
3 views