... | @@ -45,7 +45,7 @@ Example timeline once the data has loaded: |
... | @@ -45,7 +45,7 @@ Example timeline once the data has loaded: |
|
|
|
|
|
### PC Sampling
|
|
### PC Sampling
|
|
|
|
|
|
[PC Sampling](http://docs.nvidia.com/cuda/profiler-users-guide/#pc-sampling) tells us where we are spending most of the time in our code, on a line-by-line basis. Here's a [quick video](/uploads/8ab48e54f89e07520ed6679fa4558404/nvvp.mp4) that shows how to get to the PC Sampling area in ```nvvp```.
|
|
[PC Sampling](http://docs.nvidia.com/cuda/profiler-users-guide/#pc-sampling) tells us where we are spending most of the time in our code, on a line-by-line basis. Here's a [quick video](/uploads/8ab48e54f89e07520ed6679fa4558404/nvvp.mp4) that shows how to get to the PC Sampling area in ```nvvp```. You will have to link your code to ```nvvp``` so ensure that you have a local copy and that you choose the correct file. Otherwise, you will have to click on the button that looks like a pencil (![unlinker](/uploads/b09d045799e0cd1ac4a86218d6c79449/Screen_Shot_2017-09-07_at_09.30.53.png)) to unlink and re-link your code.
|
|
|
|
|
|
At the moment the majority of the time spent in our code is on waiting for stuff to come down from global memory to registers, which we hope to improve with caching.
|
|
At the moment the majority of the time spent in our code is on waiting for stuff to come down from global memory to registers, which we hope to improve with caching.
|
|
|
|
|
... | | ... | |