I have this written down in a number of notebooks but I’m leaving it here for two reasons:
- It just took me twenty minutes to find the right notebook.
- It’s Hack The Hub in Belfast and it’s all about Machine Learning and AI.
How Many Nodes In A Hidden Layer?
I want to get a rough idea of the number of nodes to use in a hidden layer in my neural network. Too few or too many and this can have an impact on the accuracy of your training model. You’ll see the outputs of your training accuracy during evaluation (accuracy and F1 scores).
There is a common equation available to give us a rough number.
A scaling factor multiplied by the total number of input and output nodes, divided by the number of samples in the training.
In Clojure it looks like this:
user> (defn node-calc [inputs outputs sample-size scaling] (double (/ sample-size (* scaling (+ inputs outputs)))))
The scaling factor is just an arbitrary number between 2 and 10. It’s worth mapping through the range to get a feel for the scores.
Let’s Build a Case
My neural network has 1 input node and ten output nodes (ten possible prediction results), to train I’ve got 474 instances of input data to train. I’m going to map the scaling factor from 2 to 10 so I can see the range of node results.
user> (clojure.pprint/pprint (map (fn [s] (node-calc 1 11 474 s)) (range 2 11))) (19.75 13.16666666666667 9.875 7.9 6.583333333333333 5.642857142857143 4.9375 4.388888888888889 3.95)
Guess how many times I’d run the model training and evaluation? I’d test all the rounded up/down results and see how the F1 score looks.