Made the thread local variables that hold the cudnn and cublas context objects
not destruct and recreate themselves when you switch devices. Instead, they keep a table of context objects, for each thread and device, reusing as necessary. This prevents churn in the context objects when you are switching back and forth between devices inside a single thread.
Showing
Please
register
or
sign in
to comment