Reference

Improving Generalization Performance by Switching from Adam to SGD

Keskar, N. S., & Socher, R. (2017). Improving generalization performance by switching from adam to sgd. arXiv preprint arXiv:1712.07628.

0

1

Updated 2020-11-16

Tags

Data Science