otubo

stream on pserver1

Jan 19th, 2016
97
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 4.71 KB | None | 0 0
  1. root@pserver1:~# ./stream
  2. -------------------------------------------------------------
  3. STREAM version $Revision: 5.10 $
  4. -------------------------------------------------------------
  5. This system uses 8 bytes per array element.
  6. -------------------------------------------------------------
  7. Array size = 2500000 (elements), Offset = 0 (elements)
  8. Memory per array = 19.1 MiB (= 0.0 GiB).
  9. Total memory required = 57.2 MiB (= 0.1 GiB).
  10. Each kernel will be executed 10 times.
  11. The *best* time for each kernel (excluding the first iteration)
  12. will be used to compute the reported bandwidth.
  13. -------------------------------------------------------------
  14. Number of Threads requested = 48
  15. Number of Threads counted = 48
  16. -------------------------------------------------------------
  17. Your clock granularity/precision appears to be 1 microseconds.
  18. Each test below will take on the order of 5883 microseconds.
  19. (= 5883 clock ticks)
  20. Increase the size of the arrays if this shows that
  21. you are not getting at least 20 clock ticks per test.
  22. -------------------------------------------------------------
  23. WARNING -- The above is only a rough guideline.
  24. For best results, please be sure you know the
  25. precision of your system timer.
  26. -------------------------------------------------------------
  27. Function Best Rate MB/s Avg time Min time Max time
  28. Copy: 31034.4 0.001320 0.001289 0.001436
  29. Scale: 17912.9 0.002265 0.002233 0.002377
  30. Add: 19867.2 0.003085 0.003020 0.003259
  31. Triad: 20100.5 0.003202 0.002985 0.003579
  32. -------------------------------------------------------------
  33. Solution Validates: avg error less than 1.000000e-13 on all three arrays
  34. -------------------------------------------------------------
  35. root@pserver1:~# numastat
  36. node0 node1 node2 node3
  37. numa_hit 426500 786579 163910 896998
  38. numa_miss 0 0 0 0
  39. numa_foreign 0 0 0 0
  40. interleave_hit 7305 7252 7313 7248
  41. local_node 425470 779319 156510 889659
  42. other_node 1030 7260 7400 7339
  43.  
  44. node4 node5 node6 node7
  45. numa_hit 93984 235942 47527 102039
  46. numa_miss 0 0 0 0
  47. numa_foreign 0 0 0 0
  48. interleave_hit 7296 7253 7302 7249
  49. local_node 86601 228602 40138 95576
  50. other_node 7383 7340 7389 6463
  51. root@pserver1:~# ./stream
  52. -------------------------------------------------------------
  53. STREAM version $Revision: 5.10 $
  54. -------------------------------------------------------------
  55. This system uses 8 bytes per array element.
  56. -------------------------------------------------------------
  57. Array size = 2500000 (elements), Offset = 0 (elements)
  58. Memory per array = 19.1 MiB (= 0.0 GiB).
  59. Total memory required = 57.2 MiB (= 0.1 GiB).
  60. Each kernel will be executed 10 times.
  61. The *best* time for each kernel (excluding the first iteration)
  62. will be used to compute the reported bandwidth.
  63. -------------------------------------------------------------
  64. Number of Threads requested = 48
  65. Number of Threads counted = 48
  66. -------------------------------------------------------------
  67. Your clock granularity/precision appears to be 1 microseconds.
  68. Each test below will take on the order of 5275 microseconds.
  69. (= 5275 clock ticks)
  70. Increase the size of the arrays if this shows that
  71. you are not getting at least 20 clock ticks per test.
  72. -------------------------------------------------------------
  73. WARNING -- The above is only a rough guideline.
  74. For best results, please be sure you know the
  75. precision of your system timer.
  76. -------------------------------------------------------------
  77. Function Best Rate MB/s Avg time Min time Max time
  78. Copy: 136511.1 0.000321 0.000293 0.000419
  79. Scale: 141341.3 0.000303 0.000283 0.000361
  80. Add: 144216.8 0.000532 0.000416 0.001333
  81. Triad: 145973.5 0.000436 0.000411 0.000498
  82. -------------------------------------------------------------
  83. Solution Validates: avg error less than 1.000000e-13 on all three arrays
  84. -------------------------------------------------------------
Add Comment
Please, Sign In to add comment