Advertisement
Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- Detected 1 CUDA Capable device(s)
- -----------------------------------
- Device 0> GeForce GTX 970
- -----------------------------------
- CUDA Driver Version / Runtime Version 8.0 / 8.0
- Cuda Capability Major/Minor version number 5.2
- Total amount of global memory 4.00 GBytes (4294967295 byte
- s)
- GPU clock rate 1241 MHz (1.24 GHz)
- Memory Clock rate 3523 Mhz
- Memory Bus Width 256-bit
- L2 Cache Size 1835008 bytes
- Max Texture Dimension Size (x,y,z) 1D=(65536), 2D=(65536,65536)
- , 3D=(4096,4096,4096)
- Max Layered Texture Size (x,y,z) 1D=(16384) x 2048, 2D=(16384
- ,16384) x 2048
- Total amount of constant memory 65536 bytes
- Total amount of shared memory per block 49152 bytes
- Total number of registers available per block 65536
- Warp size 32
- Maximum number of threads per multiprocessor 2048
- Maximum number of threads per block 1024
- Maximum sizes of each dimension of a block 1024 x 1024 x 64
- Maximum sizes of each dimension of a grid 2147483647 x 65535 x 65535
- Maximum memory pitch 2147483647 bytes
- -----------------------------------
- Using Device 0> GeForce GTX 970
- -----------------------------------------------------
- Starting arrayAdd
- Array Dimension 2097152
- -----------------------------------------------------
- Timer - initialData Array: 131522us
- Timer - sumArrayOnHost: 1939us
- --- Function sumArrayOnGPU
- <<<(65536,1,1), (32,1,1)>>> Elapsed Time:211us - ERROR
- <<<(32768,1,1), (64,1,1)>>> Elapsed Time:452us - OK
- <<<(16384,1,1), (128,1,1)>>> Elapsed Time:424us - OK
- <<<(8192,1,1), (256,1,1)>>> Elapsed Time:426us - OK
- <<<(4096,1,1), (512,1,1)>>> Elapsed Time:427us - OK
- <<<(2048,1,1), (1024,1,1)>>> Elapsed Time:440us - OK
- -----------------------------------------------------
- Starting matrixAdd
- Matrix Dimension 8192x8192
- -----------------------------------------------------
- Timer - initialData Matrix: 3938445us
- Timer - sumMatrixOnHost: 56526us
- --- Function sumMatrixOnGPU2D
- <<<(256,256,1), (32,32,1)>>> Elapsed Time:5807us - OK
- <<<(256,512,1), (32,16,1)>>> Elapsed Time:5769us - OK
- <<<(512,512,1), (16,16,1)>>> Elapsed Time:5837us - OK
- --- Function sumMatrixOnGPU1D
- <<<(256,1,1), (32,1,1)>>> Elapsed Time:6562us - OK
- <<<(128,1,1), (64,1,1)>>> Elapsed Time:6495us - OK
- <<<(64,1,1), (128,1,1)>>> Elapsed Time:6572us - OK
- <<<(32,1,1), (256,1,1)>>> Elapsed Time:6380us - OK
- <<<(16,1,1), (512,1,1)>>> Elapsed Time:6535us - OK
- <<<(8,1,1), (1024,1,1)>>> Elapsed Time:6182us - OK
- --- Function sumMatrixOnGPUMix
- <<<(256,8192,1), (32,1,1)>>> Elapsed Time:5756us - OK
- <<<(128,8192,1), (64,1,1)>>> Elapsed Time:5641us - OK
- <<<(64,8192,1), (128,1,1)>>> Elapsed Time:5681us - OK
- <<<(32,8192,1), (256,1,1)>>> Elapsed Time:5692us - OK
- <<<(16,8192,1), (512,1,1)>>> Elapsed Time:5723us - OK
- <<<(8,8192,1), (1024,1,1)>>> Elapsed Time:5762us - OK
- Press Enter to quit...
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement