Wrote replacements for set_tensor() and scale_tensor() since the previous
versions were calling into cuDNN, however, the cuDNN functions for doing this are horrifically slow, well over 100x slower than they should be, which is surprising since these functions are so trivial.
Showing
Please
register
or
sign in
to comment