Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- root@pserver4:~# ./stream
- -------------------------------------------------------------
- STREAM version $Revision: 5.10 $
- -------------------------------------------------------------
- This system uses 8 bytes per array element.
- -------------------------------------------------------------
- Array size = 2500000 (elements), Offset = 0 (elements)
- Memory per array = 19.1 MiB (= 0.0 GiB).
- Total memory required = 57.2 MiB (= 0.1 GiB).
- Each kernel will be executed 10 times.
- The *best* time for each kernel (excluding the first iteration)
- will be used to compute the reported bandwidth.
- -------------------------------------------------------------
- Number of Threads requested = 48
- Number of Threads counted = 48
- -------------------------------------------------------------
- Your clock granularity/precision appears to be 1 microseconds.
- Each test below will take on the order of 4989 microseconds.
- (= 4989 clock ticks)
- Increase the size of the arrays if this shows that
- you are not getting at least 20 clock ticks per test.
- -------------------------------------------------------------
- WARNING -- The above is only a rough guideline.
- For best results, please be sure you know the
- precision of your system timer.
- -------------------------------------------------------------
- Function Best Rate MB/s Avg time Min time Max time
- Copy: 37803.6 0.003118 0.001058 0.005882
- Scale: 26630.5 0.002516 0.001502 0.005776
- Add: 26571.5 0.003216 0.002258 0.006718
- Triad: 25168.3 0.003607 0.002384 0.007583
- -------------------------------------------------------------
- Solution Validates: avg error less than 1.000000e-13 on all three arrays
- -------------------------------------------------------------
- root@pserver4:~# numastat
- node0 node1 node2 node3
- numa_hit 457885 813917 242146 1041351
- numa_miss 0 0 0 0
- numa_foreign 0 0 0 0
- interleave_hit 7239 7304 7244 7311
- local_node 456916 806527 234816 1034939
- other_node 969 7390 7330 6412
- node4 node5 node6 node7
- numa_hit 108965 279307 45559 122921
- numa_miss 0 0 0 0
- numa_foreign 0 0 0 0
- interleave_hit 7250 7295 7254 7295
- local_node 101627 271923 38219 115560
- other_node 7338 7384 7340 7361
- root@pserver4:~# ./stream
- -------------------------------------------------------------
- STREAM version $Revision: 5.10 $
- -------------------------------------------------------------
- This system uses 8 bytes per array element.
- -------------------------------------------------------------
- Array size = 2500000 (elements), Offset = 0 (elements)
- Memory per array = 19.1 MiB (= 0.0 GiB).
- Total memory required = 57.2 MiB (= 0.1 GiB).
- Each kernel will be executed 10 times.
- The *best* time for each kernel (excluding the first iteration)
- will be used to compute the reported bandwidth.
- -------------------------------------------------------------
- Number of Threads requested = 48
- Number of Threads counted = 48
- -------------------------------------------------------------
- Your clock granularity/precision appears to be 1 microseconds.
- Each test below will take on the order of 3278 microseconds.
- (= 3278 clock ticks)
- Increase the size of the arrays if this shows that
- you are not getting at least 20 clock ticks per test.
- -------------------------------------------------------------
- WARNING -- The above is only a rough guideline.
- For best results, please be sure you know the
- precision of your system timer.
- -------------------------------------------------------------
- Function Best Rate MB/s Avg time Min time Max time
- Copy: 133364.2 0.003400 0.000300 0.005561
- Scale: 139461.5 0.003486 0.000287 0.005730
- Add: 142179.8 0.003055 0.000422 0.005984
- Triad: 144631.2 0.003109 0.000415 0.006044
- -------------------------------------------------------------
- Solution Validates: avg error less than 1.000000e-13 on all three arrays
- -------------------------------------------------------------
- root@pserver4:~#
Add Comment
Please, Sign In to add comment