Commit 776678a3 authored by Davis King's avatar Davis King

Fixed spelling error in comment

parent 2a0b4619
...@@ -261,7 +261,7 @@ namespace dlib ...@@ -261,7 +261,7 @@ namespace dlib
CUDA's y thread index (e.g. threadIdx.y) instead of the x index. CUDA's y thread index (e.g. threadIdx.y) instead of the x index.
Therefore, if you launch a cuda kernel with a statement like: Therefore, if you launch a cuda kernel with a statement like:
dim3 blocks(10,1); dim3 blocks(10,1);
dim3 threads(32,32); // You need to have x any not equal to 1 to get parallelism over both loops. dim3 threads(32,32); // You need to have x and y not equal to 1 to get parallelism over both loops.
add_arrays<<<blocks,threads>>>(a,b,out,nr,nc); add_arrays<<<blocks,threads>>>(a,b,out,nr,nc);
You can perform a nested 2D parallel for loop rather than doing just a You can perform a nested 2D parallel for loop rather than doing just a
1D for loop. 1D for loop.
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment