H

Hiroki Naganuma, Atish Agarwala, Priya Kasimbeg, George E. Dahl

Articles by Hiroki Naganuma, Atish Agarwala, Priya Kasimbeg, George E. Dahl

Academic · 1 min

What do near-optimal learning rate schedules look like?

arXiv:2603.10301v1 Announce Type: new Abstract: A basic unanswered question in neural network training is: what is the best learning rate schedule shape for a given …

Hiroki Naganuma, Atish Agarwala, Priya Kasimbeg, George E. Dahl
3 views