Fixed a bug in visit_layer_parameter_gradients() and visit_layer_parameters()
caused by num_computational_layers being wrong when tax layers were placed as the first layer. These visit functions being wrong also caused multi-GPU support to not work on such networks.
Showing
Please
register
or
sign in
to comment