Made the layer's backward() function take a copy of the output tensor
vector from that layer so that it can be reused in any computations. Most layers won't use it but for some backward computations having it allows them to be implemented more efficiently.
Showing
Please
register
or
sign in
to comment