Lorenza: Enhancing Generalization in Low-Rank Gradient LLM Training via Efficient Zeroth-Order Ad...
Yehonathan Refael, Iftach Arbel, Ofir Lindenbaum, Tom Tirer.
Action editor: Robert Gower.
openreview.net/forum?id=YyA51…
#optimizers #memory #optimizer
2
352