Hidden layer sizes, number of layers - Learning rate, weight decay - Dropout rate, batch size - Activation function