Advertisement
Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- ==31692== NVPROF is profiling process 31692, command: ./devicequery
- ==31692== Warning: Unified Memory Profiling is not supported on the underlying platform. System requirements for unified memory can be found at: http://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#um-requirements
- CUDA Device Query...
- There are 1 CUDA devices.
- CUDA Device #0
- Major revision number: 5
- Minor revision number: 3
- Name: NVIDIA Tegra X1
- Total global memory: 4148756480
- Total shared memory per block: 49152
- Total registers per block: 32768
- Warp size: 32
- Maximum memory pitch: 2147483647
- Maximum threads per block: 1024
- Maximum dimension 0 of block: 1024
- Maximum dimension 1 of block: 1024
- Maximum dimension 2 of block: 64
- Maximum dimension 0 of grid: 2147483647
- Maximum dimension 1 of grid: 65535
- Maximum dimension 2 of grid: 65535
- Clock rate: 921600
- Total constant memory: 65536
- Texture alignment: 512
- Concurrent copy and execution: Yes
- Number of multiprocessors: 1
- Kernel execution timeout: Yes
- Press any key to exit...
- ==31692== Profiling application: ./devicequery
- ==31692== Profiling result:
- No kernels were profiled.
- Type Time(%) Time Calls Avg Min Max Name
- API calls: 77.10% 382.88us 96 3.9880us 1.4060us 92.189us cuDeviceGetAttribute
- 16.29% 80.887us 1 80.887us 80.887us 80.887us cudaGetDeviceProperties
- 2.30% 11.406us 3 3.8020us 2.5000us 5.1560us cuDeviceGetCount
- 1.77% 8.8020us 1 8.8020us 8.8020us 8.8020us cuDeviceTotalMem
- 1.08% 5.3640us 2 2.6820us 1.9270us 3.4370us cuDeviceGet
- 0.49% 2.4480us 1 2.4480us 2.4480us 2.4480us cuDeviceGetName
- 0.48% 2.3960us 1 2.3960us 2.3960us 2.3960us cuDeviceGetUuid
- 0.48% 2.3950us 1 2.3950us 2.3950us 2.3950us cudaGetDeviceCount
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement