Guest User

Untitled

a guest
Nov 23rd, 2017
64
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 11.57 KB | None | 0 0
  1. ======== Profiling result:
  2. Time(%) Time Calls Avg Min Max Name
  3. 26.16% 336.86ms 854 394.45us 373.72us 430.24us fesft_dp$radiation_rg_$ck_L2127_40
  4. 19.93% 256.67ms 520 493.59us 446.33us 545.63us fesft_dp$radiation_rg_$ck_L2596_80
  5. 7.83% 100.84ms 2267 44.479us 831ns 1.5350ms [CUDA memcpy HtoD]
  6. 6.92% 89.109ms 520 171.36us 168.83us 180.22us fesft_dp$radiation_rg_$ck_L2596_82
  7. 4.53% 58.266ms 660 88.282us 85.919us 93.471us fesft_dp$radiation_rg_$ck_L2638_84
  8. 3.36% 43.242ms 1080 40.038us 36.992us 42.304us fesft_dp$radiation_rg_$ck_L2161_42
  9. 2.64% 33.982ms 6000 5.6630us 5.3430us 9.3440us fesft_dp$radiation_rg_$ck_L1914_28
  10. 1.94% 24.943ms 801 31.139us 640ns 204.25us [CUDA memcpy DtoH]
  11. 1.88% 24.158ms 80 301.98us 298.05us 320.73us fesft_dp$radiation_rg_$ck_L2051_37
  12. 1.77% 22.825ms 60 380.42us 374.08us 410.36us fesft_dp$radiation_rg_$ck_L2497_71
  13. 1.70% 21.912ms 3600 6.0860us 5.7910us 10.048us fesft_dp$radiation_rg_$ck_L2383_53
  14. 1.69% 21.744ms 6000 3.6240us 3.2960us 8.3520us fesft_dp$radiation_rg_$ck_L1914_29
  15. 1.59% 20.411ms 360 56.698us 9.5040us 66.656us copytoblock3d$src_block_fields_$ck_L1401_1
  16. 1.52% 19.525ms 6000 3.2540us 3.1350us 5.8880us fesft_dp$radiation_rg_$ck_L1914_31
  17. 1.21% 15.603ms 6000 2.6000us 2.4950us 9.1840us fesft_dp$radiation_rg_$ck_L1914_30
  18. 1.15% 14.818ms 3600 4.1150us 3.8710us 7.7120us fesft_dp$radiation_rg_$ck_L2383_54
  19. 1.13% 14.524ms 3600 4.0340us 3.8390us 8.1600us fesft_dp$radiation_rg_$ck_L2383_57
  20. 0.99% 12.775ms 100 127.75us 126.05us 134.02us fesft_dp$radiation_rg_$ck_L1925_32
  21. 0.90% 11.642ms 3600 3.2330us 3.1040us 9.2800us fesft_dp$radiation_rg_$ck_L2383_56
  22. 0.81% 10.461ms 3600 2.9050us 2.7520us 7.5520us fesft_dp$radiation_rg_$ck_L2383_55
  23. 0.80% 10.295ms 60 171.58us 169.05us 179.97us fesft_dp$radiation_rg_$ck_L2497_73
  24. 0.63% 8.0963ms 520 15.569us 14.943us 16.960us fesft_dp$radiation_rg_$ck_L2596_77
  25. 0.58% 7.4695ms 140 53.353us 51.903us 55.455us fesft_dp$radiation_rg_$ck_L2617_83
  26. 0.55% 7.0862ms 226 31.355us 27.648us 34.784us fesft_dp$radiation_rg_$ck_L2144_41
  27. 0.53% 6.8257ms 60 113.76us 111.42us 120.61us fesft_dp$radiation_rg_$ck_L2515_74
  28. 0.48% 6.1290ms 140 43.778us 42.880us 44.608us fesft_dp$radiation_rg_$ck_L2675_85
  29. 0.46% 5.9702ms 220 27.137us 26.368us 28.736us fesft_dp$radiation_rg_$ck_L2190_43
  30. 0.46% 5.9577ms 20 297.88us 292.00us 316.03us radiation_rg_organize$radiation_rg_org_$ck_L2210_14
  31. 0.45% 5.8370ms 20 291.85us 287.55us 309.18us fesft_dp$radiation_rg_$ck_L1813_24
  32. 0.39% 5.0782ms 20 253.91us 249.31us 272.45us radiation_rg_organize$radiation_rg_org_$ck_L2602_30
  33. 0.37% 4.7820ms 120 39.850us 38.816us 42.848us copyfromblock3d$src_block_fields_$ck_L1610_6
  34. 0.30% 3.8332ms 520 7.3710us 7.0080us 8.1280us fesft_dp$radiation_rg_$ck_L2596_79
  35. 0.30% 3.8042ms 80 47.551us 45.759us 51.071us fesft_dp$radiation_rg_$ck_L2067_38
  36. 0.28% 3.5428ms 520 6.8130us 6.4000us 7.6160us fesft_dp$radiation_rg_$ck_L2596_81
  37. 0.27% 3.5329ms 60 58.881us 57.631us 61.631us fesft_dp$radiation_rg_$ck_L2786_89
  38. 0.25% 3.1834ms 140 22.738us 21.984us 25.056us fesft_dp$radiation_rg_$ck_L2568_75
  39. 0.22% 2.8761ms 100 28.761us 28.096us 31.008us fesft_dp$radiation_rg_$ck_L2244_46
  40. 0.22% 2.8642ms 520 5.5080us 5.1200us 6.1440us fesft_dp$radiation_rg_$ck_L2596_78
  41. 0.22% 2.8177ms 100 28.176us 26.944us 29.856us fesft_dp$radiation_rg_$ck_L1891_26
  42. 0.19% 2.4765ms 100 24.765us 23.904us 27.104us fesft_dp$radiation_rg_$ck_L2223_45
  43. 0.17% 2.2060ms 60 36.767us 35.967us 39.712us fesft_dp$radiation_rg_$ck_L2709_87
  44. 0.16% 2.0679ms 1000 2.0670us 1.6640us 3.7760us copytoblock2d$src_block_fields_$ck_L1439_4
  45. 0.16% 2.0475ms 20 102.38us 101.12us 106.56us radiation_rg_organize$radiation_rg_org_$ck_L2344_15
  46. 0.15% 1.8786ms 520 3.6120us 3.3280us 4.0320us fesft_dp$radiation_rg_$ck_L2596_76
  47. 0.14% 1.8608ms 60 31.013us 30.496us 31.808us fesft_dp$radiation_rg_$ck_L2689_86
  48. 0.14% 1.7496ms 20 87.482us 86.079us 92.127us radiation_rg_organize$radiation_rg_org_$ck_L3130_34
  49. 0.12% 1.6017ms 20 80.087us 78.239us 85.727us fesft_dp$radiation_rg_$ck_L1751_20
  50. 0.10% 1.3224ms 60 22.040us 21.504us 24.544us fesft_dp$radiation_rg_$ck_L2353_50
  51. 0.10% 1.3144ms 220 5.9740us 5.3430us 7.2960us fesft_dp$radiation_rg_$ck_L2099_39
  52. 0.10% 1.2696ms 20 63.481us 61.376us 72.287us radiation_rg_organize$radiation_rg_org_$ck_L2041_9
  53. 0.10% 1.2610ms 80 15.762us 15.040us 17.088us fesft_dp$radiation_rg_$ck_L2203_44
  54. 0.10% 1.2517ms 20 62.586us 61.343us 65.920us radiation_rg_organize$radiation_rg_org_$ck_L3467_36
  55. 0.07% 942.33us 480 1.9630us 1.7280us 8.2240us copyfromblock2d$src_block_fields_$ck_L1652_9
  56. 0.07% 840.57us 20 42.028us 41.216us 43.935us fesft_dp$radiation_rg_$ck_L1783_22
  57. 0.05% 701.60us 60 11.693us 11.104us 12.672us fesft_dp$radiation_rg_$ck_L2497_68
  58. 0.05% 689.78us 20 34.489us 33.440us 36.704us radiation_rg_organize$radiation_rg_org_$ck_L2512_26
  59. 0.05% 639.58us 20 31.978us 31.040us 34.176us radiation_rg_organize$radiation_rg_org_$ck_L2380_17
  60. 0.05% 591.65us 20 29.582us 29.056us 30.720us radiation_rg_organize$radiation_rg_org_$ck_L2419_20
  61. 0.04% 454.46us 60 7.5740us 7.2960us 8.2880us fesft_dp$radiation_rg_$ck_L2497_70
  62. 0.03% 432.06us 60 7.2000us 6.7840us 7.8080us fesft_dp$radiation_rg_$ck_L2746_88
  63. 0.03% 424.28us 40 10.607us 10.176us 11.456us compute_sunshine_conditions$radiation_interface_$ck_L1704_13
  64. 0.03% 395.33us 60 6.5880us 6.1440us 7.1680us fesft_dp$radiation_rg_$ck_L2497_72
  65. 0.03% 370.94us 20 18.546us 17.983us 20.416us fesft_dp$radiation_rg_$ck_L1669_17
  66. 0.03% 340.80us 60 5.6790us 5.1840us 6.0160us fesft_dp$radiation_rg_$ck_L2497_69
  67. 0.03% 325.34us 60 5.4220us 5.1200us 6.0160us fesft_dp$radiation_rg_$ck_L2806_90
  68. 0.02% 293.44us 20 14.671us 14.272us 15.200us radiation_rg_organize$radiation_rg_org_$ck_L2395_18
  69. 0.02% 274.08us 20 13.703us 10.528us 15.104us calc_rad_corrections$radiation_interface_$ck_L2218_32
  70. 0.02% 244.22us 100 2.4420us 2.2720us 2.7200us fesft_dp$radiation_rg_$ck_L1914_27
  71. 0.02% 241.18us 20 12.059us 11.680us 13.247us radiation_rg_organize$radiation_rg_org_$ck_L2469_24
  72. 0.02% 209.60us 60 3.4930us 3.2320us 3.8400us fesft_dp$radiation_rg_$ck_L2497_67
  73. 0.02% 199.97us 20 9.9980us 9.4720us 10.528us radiation_rg_organize$radiation_rg_org_$ck_L3094_33
  74. 0.02% 194.97us 60 3.2490us 2.9760us 3.6150us fesft_dp$radiation_rg_$ck_L2367_51
  75. 0.01% 169.60us 20 8.4790us 8.1920us 9.3120us radiation_rg_organize$radiation_rg_org_$ck_L2444_22
  76. 0.01% 134.34us 20 6.7160us 6.3680us 7.2320us fesft_dp$radiation_rg_$ck_L1735_19
  77. 0.01% 130.75us 60 2.1790us 2.0800us 2.4960us fesft_dp$radiation_rg_$ck_L2383_52
  78. 0.01% 113.34us 20 5.6670us 5.3760us 6.3680us radiation_rg_organize$radiation_rg_org_$ck_L2565_28
  79. 0.01% 113.15us 20 5.6570us 5.3120us 5.8560us fesft_dp$radiation_rg_$ck_L1771_21
  80. 0.01% 109.60us 20 5.4790us 5.3440us 5.9200us fesft_dp$radiation_rg_$ck_L2032_36
  81. 0.01% 108.51us 40 2.7120us 2.1760us 3.4560us compute_sunshine_conditions$radiation_interface_$ck_L1759_14
  82. 0.01% 101.79us 20 5.0890us 4.7360us 5.6000us fesft_dp$radiation_rg_$ck_L2837_92
  83. 0.01% 100.13us 20 5.0060us 4.7680us 5.4070us radiation_rg_organize$radiation_rg_org_$ck_L1824_4
  84. 0.01% 86.208us 20 4.3100us 4.0320us 4.5120us radiation_rg_organize$radiation_rg_org_$ck_L1948_7
  85. 0.01% 82.303us 40 2.0570us 1.6640us 2.7830us compute_sunshine_conditions$radiation_interface_$ck_L1597_10
  86. 0.01% 77.248us 20 3.8620us 3.4880us 4.2880us radiation_rg_organize$radiation_rg_org_$ck_L3443_35
  87. 0.01% 73.951us 20 3.6970us 3.4880us 4.1590us fesft_dp$radiation_rg_$ck_L2265_47
  88. 0.01% 73.087us 20 3.6540us 3.3600us 4.0640us radiation_rg_organize$radiation_rg_org_$ck_L2530_27
  89. 0.01% 71.870us 20 3.5930us 3.3920us 3.8080us fesft_dp$radiation_rg_$ck_L2335_49
  90. 0.01% 66.590us 20 3.3290us 2.9120us 3.9040us radiation_prepare$radiation_interface_$ck_L1349_1
  91. 0.00% 62.048us 20 3.1020us 2.9120us 3.3280us fesft_dp$radiation_rg_$ck_L1701_18
  92. 0.00% 55.264us 20 2.7630us 2.6240us 3.0720us fesft_dp$radiation_rg_$ck_L2298_48
  93. 0.00% 51.742us 20 2.5870us 2.3680us 2.6880us radiation_rg_organize$radiation_rg_org_$ck_L2434_21
  94. 0.00% 51.360us 20 2.5680us 2.4000us 2.6880us fesft_dp$radiation_rg_$ck_L1800_23
  95. 0.00% 50.496us 20 2.5240us 2.2720us 2.6880us radiation_rg_organize$radiation_rg_org_$ck_L2459_23
  96. 0.00% 49.600us 20 2.4800us 2.2720us 2.6560us radiation_rg_organize$radiation_rg_org_$ck_L2370_16
  97. 0.00% 48.928us 20 2.4460us 2.2720us 2.7200us radiation_organize$radiation_interface_$ck_L2047_30
  98. 0.00% 45.600us 20 2.2800us 2.1440us 2.4320us fesft_dp$radiation_rg_$ck_L1853_25
  99. 0.00% 42.208us 20 2.1100us 1.9840us 2.2400us radiation_rg_organize$radiation_rg_org_$ck_L2409_19
  100. 0.00% 40.031us 20 2.0010us 1.9190us 2.1440us radiation_rg_organize$radiation_rg_org_$ck_L2482_25
  101. 0.00% 39.520us 20 1.9760us 1.8880us 2.2720us radiation_rg_organize$radiation_rg_org_$ck_L2855_101
  102. 0.00% 2.2080us 1 2.2080us 2.2080us 2.2080us test_physics$src_physics_$ck_L346_2
  103.  
  104. ======== API calls:
  105. Time(%) Time Calls Avg Min Max Name
  106. 61.16% 2.42077s 1 2.42077s 2.42077s 2.42077s cuCtxCreate
  107. 15.37% 608.27ms 4887 124.47us 1.1930us 7.2228ms cuStreamSynchronize
  108. 13.26% 524.77ms 2267 231.48us 6.5330us 9.9543ms cuMemcpyHtoD
  109. 5.67% 224.58ms 53921 4.1640us 3.5250us 985.04us cuLaunchKernel
  110. 2.18% 86.302ms 5 17.260ms 432.42us 44.843ms cuModuleLoadData
  111. 1.15% 45.689ms 801 57.040us 14.559us 379.07us cuMemcpyDtoH
  112. 0.59% 23.495ms 160 146.84us 1.7510us 994.16us cuCtxSynchronize
  113. 0.57% 22.712ms 332 68.410us 2.1080us 324.57us cuMemAlloc
  114. 0.03% 1.0416ms 1 1.0416ms 1.0416ms 1.0416ms cuMemHostAlloc
  115. 0.00% 159.25us 1 159.25us 159.25us 159.25us cuStreamCreate
  116. 0.00% 120.68us 98 1.2310us 285ns 5.2820us cuModuleGetFunction
  117. 0.00% 39.634us 196 202ns 141ns 1.0300us cuFuncGetAttribute
  118. 0.00% 18.859us 34 554ns 269ns 2.5630us cuEventCreate
  119. 0.00% 18.083us 98 184ns 141ns 335ns cuFuncSetCacheConfig
  120. 0.00% 5.8750us 5 1.1750us 846ns 1.5270us cuMemHostGetDevicePointer
  121. 0.00% 4.5140us 5 902ns 454ns 1.2430us cuModuleGetGlobal
  122. 0.00% 2.5620us 3 854ns 315ns 1.7020us cuDeviceGetCount
  123. 0.00% 1.7660us 7 252ns 170ns 476ns cuDeviceGetAttribute
  124. 0.00% 1.3350us 3 445ns 226ns 715ns cuDeviceGet
  125. 0.00% 456ns 1 456ns 456ns 456ns cuCtxSetCurrent
  126. 0.00% 333ns 1 333ns 333ns 333ns cuCtxGetCurrent
  127. + /project/c01/install_old/daint/serialbox/gnu/bin/compare Field_rank0.json radiation-standalone_rank0.json
  128. Field_rank0
  129. radiation-standalone_rank0
Add Comment
Please, Sign In to add comment