Advertisement
illwieckz

amdprocl intelcl nvidiacl

Jun 11th, 2017
354
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 27.77 KB | None | 0 0
  1. $ ls /opt/amdgpupro-opencl/lib
  2. libamdocl12cl64.so libamdocl64.so libdrm_amdpro.so.1 libdrm_amdpro.so.1.0.0 libdrm.so.2 libdrm.so.2.4.0
  3.  
  4. $ ls /opt/intel/opencl/
  5. clbltfne9_img_cbk.o clbltfnh8_img_cbk.o clbltfnl9_img_cbk.o clbltfnshared.rtl kernel-4.7.patch libcommon_clang.so libigdbcl.so libigdmcl.so libIntelOpenCL.so libOpenCL.so libtbbmalloc.so libtbb.so.2 __ocl_svml_e9.so OpenCL.pc
  6. clbltfne9_img_cbk.rtl clbltfnh8_img_cbk.rtl clbltfnl9_img_cbk.rtl igdclbif.bin libclang_compiler.so libcpu_device.so libigdccl.so libigdrcl.so libmd.so libOpenCL.so.1 libtbbmalloc.so.2 LICENSE __ocl_svml_h8.so
  7. clbltfne9.rtl clbltfnh8.rtl clbltfnl9.rtl include libcl_logger.so libiga64.so libigdfcl.so libintelocl.so libOclCpuBackEnd.so libtask_executor.so libtbb.so NOTICES __ocl_svml_l9.so
  8.  
  9. $ ls /usr/lib/nvidia-375/
  10. alt_ld.so.conf libEGL.so.1 libGLESv2_nvidia.so.2 libGL.so.1.0.0 libnvcuvid.so libnvidia-compiler.so libnvidia-encode.so.1 libnvidia-glsi.so.375.26 libnvidia-ml.so.375.26 libnvidia-wfb.so.375.26
  11. bin libGLdispatch.so.0 libGLESv2_nvidia.so.375.26 libGLX_indirect.so.0 libnvcuvid.so.1 libnvidia-compiler.so.1 libnvidia-encode.so.375.26 libnvidia-ifr.so libnvidia-opencl.so.1 libOpenGL.so
  12. ld.so.conf libGLESv1_CM_nvidia.so.1 libGLESv2.so libGLX_nvidia.so.0 libnvcuvid.so.375.26 libnvidia-compiler.so.375.26 libnvidia-fatbinaryloader.so.375.26 libnvidia-ifr.so.1 libnvidia-opencl.so.375.26 libOpenGL.so.0
  13. libEGL_nvidia.so.0 libGLESv1_CM_nvidia.so.375.26 libGLESv2.so.2 libGLX_nvidia.so.375.26 libnvidia-cfg.so libnvidia-eglcore.so.375.26 libnvidia-fbc.so.1 libnvidia-ifr.so.375.26 libnvidia-ptxjitcompiler.so.375.26 tls
  14. libEGL_nvidia.so.375.26 libGLESv1_CM.so libGL.so libGLX.so libnvidia-cfg.so.1 libnvidia-egl-wayland.so.375.26 libnvidia-fbc.so.375.26 libnvidia-ml.so libnvidia-tls.so.375.26 vdpau
  15. libEGL.so libGLESv1_CM.so.1 libGL.so.1 libGLX.so.0 libnvidia-cfg.so.375.26 libnvidia-encode.so libnvidia-glcore.so.375.26 libnvidia-ml.so.1 libnvidia-wfb.so.1 xorg
  16.  
  17. $ cat .local/bin/amdprocl
  18. #! /bin/sh
  19.  
  20. export LD_LIBRARY_PATH="${LD_LIBRARY_PATH}:/opt/amdgpupro-opencl/lib/"
  21.  
  22. ${@}
  23.  
  24. #EOF
  25.  
  26. $ cat .local/bin/intelcl
  27. #! /bin/sh
  28.  
  29. export LD_LIBRARY_PATH="${LD_LIBRARY_PATH}:/opt/intel/opencl"
  30.  
  31. ${@}
  32.  
  33. #EOF
  34.  
  35. $ cat .local/bin/nvidiacl
  36. #! /bin/sh
  37.  
  38. export LD_LIBRARY_PATH="${LD_LIBRARY_PATH}:/usr/lib/nvidia-375/"
  39.  
  40. ${@}
  41.  
  42. #EOF
  43.  
  44. $ amdprocl intelcl nvidiacl clinfo | grep 'Platform Name\|Device Name' | tail -n +4
  45. Platform Name Intel(R) OpenCL
  46. Device Name Intel(R) HD Graphics
  47. Device Name Intel(R) Core(TM) i7-4810MQ CPU @ 2.80GHz
  48. Platform Name NVIDIA CUDA
  49. Device Name Quadro K1100M
  50. Platform Name AMD Accelerated Parallel Processing
  51. Device Name Intel(R) Core(TM) i7-4810MQ CPU @ 2.80GHz
  52.  
  53. $ amdprocl intelcl nvidiacl clinfo
  54. Number of platforms 3
  55. Platform Name Intel(R) OpenCL
  56. Platform Vendor Intel(R) Corporation
  57. Platform Version OpenCL 1.2
  58. Platform Profile FULL_PROFILE
  59. Platform Extensions cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_depth_images cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_spir
  60. Platform Extensions function suffix INTEL
  61.  
  62. Platform Name NVIDIA CUDA
  63. Platform Vendor NVIDIA Corporation
  64. Platform Version OpenCL 1.2 CUDA 8.0.0
  65. Platform Profile FULL_PROFILE
  66. Platform Extensions cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_copy_opts cl_khr_gl_event
  67. Platform Extensions function suffix NV
  68.  
  69. Platform Name AMD Accelerated Parallel Processing
  70. Platform Vendor Advanced Micro Devices, Inc.
  71. Platform Version OpenCL 2.0 AMD-APP (2236.5)
  72. Platform Profile FULL_PROFILE
  73. Platform Extensions cl_khr_icd cl_amd_event_callback cl_amd_offline_devices
  74. Platform Extensions function suffix AMD
  75.  
  76. Platform Name Intel(R) OpenCL
  77. Number of devices 2
  78. Device Name Intel(R) HD Graphics
  79. Device Vendor Intel(R) Corporation
  80. Device Vendor ID 0x8086
  81. Device Version OpenCL 1.2
  82. Driver Version r3.1.58620
  83. Device OpenCL C Version OpenCL C 1.2
  84. Device Type GPU
  85. Device Profile FULL_PROFILE
  86. Max compute units 20
  87. Max clock frequency 0MHz
  88. Device Partition (core)
  89. Max number of sub-devices 0
  90. Supported partition types by <unknown> (0x7F6700000000)
  91. Max work item dimensions 3
  92. Max work item sizes 256x256x256
  93. Max work group size 256
  94. Preferred work group size multiple 32
  95. Preferred / native vector sizes
  96. char 16 / 16
  97. short 8 / 8
  98. int 4 / 4
  99. long 1 / 1
  100. half 0 / 0 (n/a)
  101. float 1 / 1
  102. double 0 / 0 (n/a)
  103. Half-precision Floating-point support (n/a)
  104. Single-precision Floating-point support (core)
  105. Denormals No
  106. Infinity and NANs Yes
  107. Round to nearest Yes
  108. Round to zero Yes
  109. Round to infinity Yes
  110. IEEE754-2008 fused multiply-add Yes
  111. Support is emulated in software No
  112. Correctly-rounded divide and sqrt operations Yes
  113. Double-precision Floating-point support (n/a)
  114. Address bits 64, Little-Endian
  115. Global memory size 1709598311 (1.592GiB)
  116. Error Correction support No
  117. Max memory allocation 854799155 (815.2MiB)
  118. Unified memory for Host and Device Yes
  119. Minimum alignment for any data type 128 bytes
  120. Alignment of base address 1024 bits (128 bytes)
  121. Global Memory cache type Read/Write
  122. Global Memory cache size 524288
  123. Global Memory cache line 64 bytes
  124. Image support Yes
  125. Max number of samplers per kernel 16
  126. Max size for 1D images from buffer 53424947 pixels
  127. Max 1D or 2D image array size 2048 images
  128. Max 2D image size 16384x16384 pixels
  129. Max 3D image size 2048x2048x2048 pixels
  130. Max number of read image args 128
  131. Max number of write image args 128
  132. Local memory type Local
  133. Local memory size 65536 (64KiB)
  134. Max constant buffer size 854799155 (815.2MiB)
  135. Max number of constant args 8
  136. Max size of kernel argument 1024
  137. Queue properties
  138. Out-of-order execution No
  139. Profiling Yes
  140. Prefer user sync for interop Yes
  141. Profiling timer resolution 80ns
  142. Execution capabilities
  143. Run OpenCL kernels Yes
  144. Run native kernels No
  145. SPIR versions 1.2
  146. printf() buffer size 4194304 (4MiB)
  147. Built-in kernels block_motion_estimate_intel;block_advanced_motion_estimate_check_intel;block_advanced_motion_estimate_bidirectional_check_intel
  148. Motion Estimation accelerator version (Intel) 2
  149. Device Available Yes
  150. Compiler Available Yes
  151. Linker Available Yes
  152. Device Extensions cl_intel_accelerator cl_intel_advanced_motion_estimation cl_intel_driver_diagnostics cl_intel_motion_estimation cl_intel_packed_yuv cl_intel_required_subgroup_size cl_intel_subgroups cl_intel_va_api_media_sharing cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_depth_images cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_spir
  153.  
  154. Device Name Intel(R) Core(TM) i7-4810MQ CPU @ 2.80GHz
  155. Device Vendor Intel(R) Corporation
  156. Device Vendor ID 0x8086
  157. Device Version OpenCL 1.2 (Build 330)
  158. Driver Version 1.2.0.330
  159. Device OpenCL C Version OpenCL C 1.2
  160. Device Type CPU
  161. Device Profile FULL_PROFILE
  162. Max compute units 8
  163. Max clock frequency 2800MHz
  164. Device Partition (core)
  165. Max number of sub-devices 8
  166. Supported partition types by counts, equally, by names (Intel)
  167. Max work item dimensions 3
  168. Max work item sizes 8192x8192x8192
  169. Max work group size 8192
  170. Preferred work group size multiple 128
  171. Preferred / native vector sizes
  172. char 1 / 32
  173. short 1 / 16
  174. int 1 / 8
  175. long 1 / 4
  176. half 0 / 0 (n/a)
  177. float 1 / 8
  178. double 1 / 4 (cl_khr_fp64)
  179. Half-precision Floating-point support (n/a)
  180. Single-precision Floating-point support (core)
  181. Denormals Yes
  182. Infinity and NANs Yes
  183. Round to nearest Yes
  184. Round to zero No
  185. Round to infinity No
  186. IEEE754-2008 fused multiply-add No
  187. Support is emulated in software No
  188. Correctly-rounded divide and sqrt operations No
  189. Double-precision Floating-point support (cl_khr_fp64)
  190. Denormals Yes
  191. Infinity and NANs Yes
  192. Round to nearest Yes
  193. Round to zero Yes
  194. Round to infinity Yes
  195. IEEE754-2008 fused multiply-add Yes
  196. Support is emulated in software No
  197. Correctly-rounded divide and sqrt operations No
  198. Address bits 64, Little-Endian
  199. Global memory size 16426655744 (15.3GiB)
  200. Error Correction support No
  201. Max memory allocation 4106663936 (3.825GiB)
  202. Unified memory for Host and Device Yes
  203. Minimum alignment for any data type 128 bytes
  204. Alignment of base address 1024 bits (128 bytes)
  205. Global Memory cache type Read/Write
  206. Global Memory cache size 262144
  207. Global Memory cache line 64 bytes
  208. Image support Yes
  209. Max number of samplers per kernel 480
  210. Max size for 1D images from buffer 256666496 pixels
  211. Max 1D or 2D image array size 2048 images
  212. Max 2D image size 16384x16384 pixels
  213. Max 3D image size 2048x2048x2048 pixels
  214. Max number of read image args 480
  215. Max number of write image args 480
  216. Local memory type Global
  217. Local memory size 32768 (32KiB)
  218. Max constant buffer size 131072 (128KiB)
  219. Max number of constant args 480
  220. Max size of kernel argument 3840 (3.75KiB)
  221. Queue properties
  222. Out-of-order execution Yes
  223. Profiling Yes
  224. Local thread execution (Intel) Yes
  225. Prefer user sync for interop No
  226. Profiling timer resolution 1ns
  227. Execution capabilities
  228. Run OpenCL kernels Yes
  229. Run native kernels Yes
  230. SPIR versions 1.2
  231. printf() buffer size 1048576 (1024KiB)
  232. Built-in kernels
  233. Device Available Yes
  234. Compiler Available Yes
  235. Linker Available Yes
  236. Device Extensions cl_khr_icd cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_khr_depth_images cl_khr_3d_image_writes cl_intel_exec_by_local_thread cl_khr_spir cl_khr_fp64
  237.  
  238. Platform Name NVIDIA CUDA
  239. Number of devices 1
  240. Device Name Quadro K1100M
  241. Device Vendor NVIDIA Corporation
  242. Device Vendor ID 0x10de
  243. Device Version OpenCL 1.2 CUDA
  244. Driver Version 375.26
  245. Device OpenCL C Version OpenCL C 1.2
  246. Device Type GPU
  247. Device Profile FULL_PROFILE
  248. Device Topology (NV) PCI-E, 01:00.0
  249. Max compute units 2
  250. Max clock frequency 705MHz
  251. Compute Capability (NV) 3.0
  252. Device Partition (core)
  253. Max number of sub-devices 1
  254. Supported partition types None
  255. Max work item dimensions 3
  256. Max work item sizes 1024x1024x64
  257. Max work group size 1024
  258. Preferred work group size multiple 32
  259. Warp size (NV) 32
  260. Preferred / native vector sizes
  261. char 1 / 1
  262. short 1 / 1
  263. int 1 / 1
  264. long 1 / 1
  265. half 0 / 0 (n/a)
  266. float 1 / 1
  267. double 1 / 1 (cl_khr_fp64)
  268. Half-precision Floating-point support (n/a)
  269. Single-precision Floating-point support (core)
  270. Denormals Yes
  271. Infinity and NANs Yes
  272. Round to nearest Yes
  273. Round to zero Yes
  274. Round to infinity Yes
  275. IEEE754-2008 fused multiply-add Yes
  276. Support is emulated in software No
  277. Correctly-rounded divide and sqrt operations Yes
  278. Double-precision Floating-point support (cl_khr_fp64)
  279. Denormals Yes
  280. Infinity and NANs Yes
  281. Round to nearest Yes
  282. Round to zero Yes
  283. Round to infinity Yes
  284. IEEE754-2008 fused multiply-add Yes
  285. Support is emulated in software No
  286. Correctly-rounded divide and sqrt operations No
  287. Address bits 64, Little-Endian
  288. Global memory size 2095251456 (1.951GiB)
  289. Error Correction support No
  290. Max memory allocation 523812864 (499.5MiB)
  291. Unified memory for Host and Device No
  292. Integrated memory (NV) No
  293. Minimum alignment for any data type 128 bytes
  294. Alignment of base address 4096 bits (512 bytes)
  295. Global Memory cache type Read/Write
  296. Global Memory cache size 32768
  297. Global Memory cache line 128 bytes
  298. Image support Yes
  299. Max number of samplers per kernel 32
  300. Max size for 1D images from buffer 134217728 pixels
  301. Max 1D or 2D image array size 2048 images
  302. Max 2D image size 16384x16384 pixels
  303. Max 3D image size 4096x4096x4096 pixels
  304. Max number of read image args 256
  305. Max number of write image args 16
  306. Local memory type Local
  307. Local memory size 49152 (48KiB)
  308. Registers per block (NV) 65536
  309. Max constant buffer size 65536 (64KiB)
  310. Max number of constant args 9
  311. Max size of kernel argument 4352 (4.25KiB)
  312. Queue properties
  313. Out-of-order execution Yes
  314. Profiling Yes
  315. Prefer user sync for interop No
  316. Profiling timer resolution 1000ns
  317. Execution capabilities
  318. Run OpenCL kernels Yes
  319. Run native kernels No
  320. Kernel execution timeout (NV) No
  321. Concurrent copy and kernel execution (NV) Yes
  322. Number of async copy engines 1
  323. printf() buffer size 1048576 (1024KiB)
  324. Built-in kernels
  325. Device Available Yes
  326. Compiler Available Yes
  327. Linker Available Yes
  328. Device Extensions cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_copy_opts cl_khr_gl_event
  329.  
  330. Platform Name AMD Accelerated Parallel Processing
  331. Number of devices 1
  332. Device Name Intel(R) Core(TM) i7-4810MQ CPU @ 2.80GHz
  333. Device Vendor GenuineIntel
  334. Device Vendor ID 0x1002
  335. Device Version OpenCL 1.2 AMD-APP (2236.5)
  336. Driver Version 2236.5 (sse2,avx)
  337. Device OpenCL C Version OpenCL C 1.2
  338. Device Type CPU
  339. Device Profile FULL_PROFILE
  340. Device Board Name (AMD)
  341. Device Topology (AMD) (n/a)
  342. Max compute units 8
  343. Max clock frequency 3700MHz
  344. Device Partition (core, cl_ext_device_fission)
  345. Max number of sub-devices 8
  346. Supported partition types equally, by counts, by affinity domain
  347. Supported affinity domains L3 cache, L2 cache, L1 cache, next partitionable
  348. Supported partition types (ext) equally, by counts, by affinity domain
  349. Supported affinity domains (ext) L3 cache, L2 cache, L1 cache, next fissionable
  350. Max work item dimensions 3
  351. Max work item sizes 1024x1024x1024
  352. Max work group size 1024
  353. Preferred work group size multiple 1
  354. Preferred / native vector sizes
  355. char 16 / 16
  356. short 8 / 8
  357. int 4 / 4
  358. long 2 / 2
  359. half 4 / 4 (n/a)
  360. float 8 / 8
  361. double 4 / 4 (cl_khr_fp64)
  362. Half-precision Floating-point support (n/a)
  363. Single-precision Floating-point support (core)
  364. Denormals Yes
  365. Infinity and NANs Yes
  366. Round to nearest Yes
  367. Round to zero Yes
  368. Round to infinity Yes
  369. IEEE754-2008 fused multiply-add Yes
  370. Support is emulated in software No
  371. Correctly-rounded divide and sqrt operations Yes
  372. Double-precision Floating-point support (cl_khr_fp64)
  373. Denormals Yes
  374. Infinity and NANs Yes
  375. Round to nearest Yes
  376. Round to zero Yes
  377. Round to infinity Yes
  378. IEEE754-2008 fused multiply-add Yes
  379. Support is emulated in software No
  380. Correctly-rounded divide and sqrt operations No
  381. Address bits 64, Little-Endian
  382. Global memory size 16426655744 (15.3GiB)
  383. Error Correction support No
  384. Max memory allocation 4106663936 (3.825GiB)
  385. Unified memory for Host and Device Yes
  386. Minimum alignment for any data type 128 bytes
  387. Alignment of base address 1024 bits (128 bytes)
  388. Global Memory cache type Read/Write
  389. Global Memory cache size 32768
  390. Global Memory cache line 64 bytes
  391. Image support Yes
  392. Max number of samplers per kernel 16
  393. Max size for 1D images from buffer 65536 pixels
  394. Max 1D or 2D image array size 2048 images
  395. Max 2D image size 8192x8192 pixels
  396. Max 3D image size 2048x2048x2048 pixels
  397. Max number of read image args 128
  398. Max number of write image args 64
  399. Local memory type Global
  400. Local memory size 32768 (32KiB)
  401. Max constant buffer size 65536 (64KiB)
  402. Max number of constant args 8
  403. Max size of kernel argument 4096 (4KiB)
  404. Queue properties
  405. Out-of-order execution No
  406. Profiling Yes
  407. Prefer user sync for interop Yes
  408. Profiling timer resolution 1ns
  409. Profiling timer offset since Epoch (AMD) 1497203026521170561ns (Sun Jun 11 19:43:46 2017)
  410. Execution capabilities
  411. Run OpenCL kernels Yes
  412. Run native kernels Yes
  413. SPIR versions 1.2
  414. printf() buffer size 65536 (64KiB)
  415. Built-in kernels
  416. Device Available Yes
  417. Compiler Available Yes
  418. Linker Available Yes
  419. Device Extensions cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_gl_sharing cl_ext_device_fission cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_spir cl_khr_gl_event
  420.  
  421. NULL platform behavior
  422. clGetPlatformInfo(NULL, CL_PLATFORM_NAME, ...) No platform
  423. clGetDeviceIDs(NULL, CL_DEVICE_TYPE_ALL, ...) No platform
  424. clCreateContext(NULL, ...) [default] No platform
  425. clCreateContext(NULL, ...) [other] Success [INTEL]
  426. clCreateContextFromType(NULL, CL_DEVICE_TYPE_CPU) No platform
  427. clCreateContextFromType(NULL, CL_DEVICE_TYPE_GPU) No platform
  428. clCreateContextFromType(NULL, CL_DEVICE_TYPE_ACCELERATOR) No platform
  429. clCreateContextFromType(NULL, CL_DEVICE_TYPE_CUSTOM) No platform
  430. clCreateContextFromType(NULL, CL_DEVICE_TYPE_ALL) No platform
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement