1. 20 Feb, 2019 5 commits
  2. 18 Feb, 2019 1 commit
  3. 07 Feb, 2019 1 commit
  4. 20 Jan, 2019 2 commits
  5. 09 Jan, 2019 2 commits
  6. 08 Jan, 2019 2 commits
    • Juha Reunanen's avatar
      Apply the #1514 fix even on non-Windows platforms, and change all remaining… · 5c623e3c
      Juha Reunanen authored
      Apply the #1514 fix even on non-Windows platforms, and change all remaining cudaStreamSynchronize calls (#1596)
      
      * Apply the #1514 fix even on non-Windows platforms
      
      * Try to fix #1513 even more by circumventing the remaining cudaStreamSynchronize calls
      
      * Make the fix apply only if CUDA_VERSION == 10000
      
      * Make the fix apply on CUDA 9.2 also
      
      * CHECK_CUDA(cudaStreamSynchronize(stream));
      5c623e3c
    • Chris Mayo's avatar
      Reliably detect AVX2 at runtime on Linux (#1614) · 9ca72b3e
      Chris Mayo authored
      ECX must be set to 0 to read this flag, the easiest way to achieve this
      is to use __cpuid_count with 0 as the second argument.
      9ca72b3e
  7. 07 Jan, 2019 3 commits
  8. 06 Jan, 2019 1 commit
    • Juha Reunanen's avatar
      Add U-net style skip connections to the semantic-segmentation example (#1600) · f685cb42
      Juha Reunanen authored
      * Add concat_prev layer, and U-net example for semantic segmentation
      
      * Allow to supply mini-batch size as command-line parameter
      
      * Decrease default mini-batch size from 30 to 24
      
      * Resize t1, if needed
      
      * Use DenseNet-style blocks instead of residual learning
      
      * Increase default mini-batch size to 50
      
      * Increase default mini-batch size from 50 to 60
      
      * Resize even during the backward step, if needed
      
      * Use resize_bilinear_gradient for the backward step
      
      * Fix function call ambiguity problem
      
      * Clear destination before adding gradient
      
      * Works OK-ish
      
      * Add more U-tags
      
      * Tweak default mini-batch size
      
      * Define a simpler network when using Microsoft Visual C++ compiler; clean up the DenseNet stuff (leaving it for a later PR)
      
      * Decrease default mini-batch size from 24 to 23
      
      * Define separate dnn filename for MSVC++ and not
      
      * Add documentation for the resize_to_prev layer; move the implementation so that it comes after mult_prev
      
      * Fix previous typo
      
      * Minor formatting changes
      
      * Reverse the ordering of levels
      
      * Increase the learning-rate stopping criterion back to 1e-4 (was 1e-8)
      
      * Use more U-tags even on Windows
      
      * Minor formatting
      
      * Latest MSVC 2017 builds fast, so there's no need to limit the depth any longer
      
      * Tweak default mini-batch size again
      
      * Even though latest MSVC can now build the extra layers, it does not mean we should add them!
      
      * Fix naming
      f685cb42
  9. 05 Jan, 2019 1 commit
  10. 11 Dec, 2018 1 commit
    • Facundo Galán's avatar
      Add pointwise_divide function (#1586) · 61a021c9
      Facundo Galán authored
      * Add pointwise_divide operator
      
      * Add tests for pointwise_divide function
      
      * Replace in affine layer pointwise_multiply for division by its equivalent pointwise_divide.
      61a021c9
  11. 06 Dec, 2018 2 commits
  12. 05 Dec, 2018 1 commit
  13. 25 Nov, 2018 2 commits
  14. 21 Nov, 2018 1 commit
  15. 20 Nov, 2018 1 commit
  16. 14 Nov, 2018 1 commit
  17. 13 Nov, 2018 1 commit
  18. 12 Nov, 2018 2 commits
  19. 11 Nov, 2018 2 commits
  20. 09 Nov, 2018 1 commit
  21. 05 Nov, 2018 1 commit
  22. 26 Oct, 2018 1 commit
  23. 23 Oct, 2018 1 commit
  24. 29 Sep, 2018 4 commits