Fixed double counting of mini-batches for the purposes of solver termination

when multiple GPUs are used.

Fixed double counting of mini-batches for the purposes of solver termination
when multiple GPUs are used.
339ac50d · Davis King · 0f180a68 · 339ac50d
Commit 339ac50d authored Apr 27, 2016 by Davis King
Show whitespace changes
Inline Side-by-side

Showing with 3 additions and 1 deletion

trainer.h dlib/dnn/trainer.h +3 -1

No files found.
--- a/dlib/dnn/trainer.h
+++ b/dlib/dnn/trainer.h
@@ -513,8 +513,10 @@ namespace dlib
                for (size_t i = 0; i < devices.size(); ++i)
                    losses[i] = std::async(std::launch::async,[&,i](){ return compute_parameter_gradients(i, next_job, pick_which_run_update); });
                // aggregate loss values from all the network computations.
+                double theloss = 0;
                for (auto&& loss : losses)
-                    record_loss(loss.get());
+                    theloss += loss.get();
+                record_loss(theloss/losses.size());

                // Now, if there is more than one active device we need to synchronize the
                // gradient updates between devices.  So we do that now.