Liyuan Liu (Lucas)
Liyuan Liu
Home
Goal
Publications
Selected
Full List
Honors
Highlights
All Honors
Activities
Services
Experience
Contact
Blog
CV
Initialization
Understanding the Difficulty of Training Transformers
Transformers have been proved effective for many deep learning tasks. Training transformers, however, requires non-trivial efforts regarding carefully designing learning rate schedulers and cutting-edge optimizers (the standard SGD fails to train …
Cite
×