Why doesn't it apply to machines? If I have two equally powerful computers
that are built with parallel processing neural network architectures, both
initialized with the same weights and trained equally, what difference does
it make if one is implemented in meat and the other in silicon? Or even in
a simulation on a super fast scalar processor?