Relation

List of Common Hyperparameters in Deep Learning

Hyperparameters related to neural network structure:

  • Number of hidden layers (Depth)
  • Number of hidden units (Width)
  • Dropout method
  • Activation function for each layer
  • Weights Initialization

Hyperparameters related to training algorithm:

  • Learning rate α\alpha
  • Momentum parameter β0.9\beta \sim 0.9
  • β10.9,β20.999,ϵ108\beta_1 \sim 0.9, \beta_2 \sim 0.999, \epsilon \sim 10^{-8}
  • Number of Gradient descent iterations
  • Mini-batch size
  • Optimizer algorithm
  • Learning rate decay
  • Regularization rate λ\lambda

0

3

Updated 2021-10-23

Tags

Data Science