mahmoodn

accelsim-chiplet

Jul 1st, 2022
242
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 429.87 KB | None | 0 0
  1. mnaderan@rtx3080:~$ git clone -b dev https://github.ugent.be/mnaderan/accelsim-chiplet
  2. Cloning into 'accelsim-chiplet'...
  3. Username for 'https://github.ugent.be': mnaderan
  4. Password for 'https://mnaderan@github.ugent.be':
  5. remote: Enumerating objects: 5949, done.
  6. remote: Counting objects: 100% (534/534), done.
  7. remote: Compressing objects: 100% (261/261), done.
  8. remote: Total 5949 (delta 328), reused 424 (delta 269), pack-reused 5415
  9. Receiving objects: 100% (5949/5949), 1.52 MiB | 11.17 MiB/s, done.
  10. Resolving deltas: 100% (3721/3721), done.
  11. mnaderan@rtx3080:~$ cd accelsim-chiplet/
  12. mnaderan@rtx3080:accelsim-chiplet$ source gpu-simulator/setup_environment.sh
  13. Cloning into '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim'...
  14. Username for 'https://github.ugent.be': mnaderan
  15. Password for 'https://mnaderan@github.ugent.be':
  16. remote: Enumerating objects: 15686, done.
  17. remote: Counting objects: 100% (237/237), done.
  18. remote: Compressing objects: 100% (177/177), done.
  19. remote: Total 15686 (delta 115), reused 92 (delta 60), pack-reused 15449
  20. Receiving objects: 100% (15686/15686), 34.79 MiB | 11.03 MiB/s, done.
  21. Resolving deltas: 100% (11729/11729), done.
  22. Already on 'dev'
  23. Your branch is up to date with 'origin/dev'.
  24. GPGPU-Sim version 4.2.0 (build gpgpu-sim_git-commit-60af80a7d140a30c781ca485707a1d4dbb8031fa-modified_594.0) configured with AccelWattch.
  25.  
  26. ----------------------------------------------------------------------------
  27. INFO - If you only care about PTX execution, ignore this message. GPGPU-Sim supports PTX execution in modern CUDA.
  28. If you want to run PTXPLUS (sm_1x SASS) with a modern card configuration - set the envronment variable
  29. $PTXAS_CUDA_INSTALL_PATH to point a CUDA version compabible with your card configurations (i.e. 8+ for PASCAL, 9+ for VOLTA etc..)
  30. For example: "export $PTXAS_CUDA_INSTALL_PATH=/usr/local/cuda-9.1"
  31.  
  32. The following text describes why:
  33. If you are using PTXPLUS, only sm_1x is supported and it requires that the app and simulator binaries are compiled in CUDA 4.2 or less.
  34. The simulator requires it since CUDA headers desribe struct sizes in the exec which change from gen to gen.
  35. The apps require 4.2 because new versions of CUDA tools have dropped parsing support for generating sm_1x
  36. When running using modern config (i.e. volta) and PTXPLUS with CUDA 4.2, the $PTXAS_CUDA_INSTALL_PATH env variable is required to get proper register usage
  37. (and hence occupancy) using a version of CUDA that knows the register usage on the real card.
  38.  
  39. ----------------------------------------------------------------------------
  40. setup_environment succeeded
  41. mnaderan@rtx3080:accelsim-chiplet$ make -j -C gpu-simulator/
  42. make: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator'
  43. if [ ! -d ./bin/release ]; then mkdir -p ./bin/release; fi;
  44. if [ ! -d ./build/release ]; then mkdir -p ./build/release; fi;
  45. touch ./build/release/main.makedepend
  46. makedepend -f./build/release/main.makedepend -p./build/release/ main.cc 2> /dev/null
  47. make -C trace-driven depend
  48. make -C trace-parser depend
  49. make -C gpgpu-sim
  50. echo "const char *g_accelsim_version=\"accelsim-commit-9e19b6621cc94d89d8f7b18e65d802ad6979c195_modified_0.0\";" > ./build/release/accelsim_version.h
  51. make[1]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/trace-driven'
  52. make[1]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/trace-parser'
  53. g++ -Wall -O3 -g3 -fPIC -std=c++11 -I./build/release -I./trace-driven -I./trace-parser -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/usr/local/cuda-11.6/include -c main.cc -o ./build/release/main.o
  54. make[1]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim'
  55. touch ../build/release/trace-driven.Makefile.makedepend
  56. touch ../build/release/trace-parser.Makefile.makedepend
  57. makedepend -f../build/release/trace-driven.Makefile.makedepend -p../build/release/ trace_driven.cc 2> /dev/null
  58. makedepend -f../build/release/trace-parser.Makefile.makedepend -p../build/release/ trace_parser.cc 2> /dev/null
  59. make[1]: 'depend' is up to date.
  60. make[1]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/trace-parser'
  61. make -C trace-parser
  62. make[1]: 'depend' is up to date.
  63. make[1]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/trace-driven'
  64. make -C trace-driven
  65. make[1]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/trace-parser'
  66. make[1]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/trace-driven'
  67. touch ../build/release/trace-parser.Makefile.makedepend
  68. makedepend -f../build/release/trace-parser.Makefile.makedepend -p../build/release/ trace_parser.cc 2> /dev/null
  69. touch ../build/release/trace-driven.Makefile.makedepend
  70. makedepend -f../build/release/trace-driven.Makefile.makedepend -p../build/release/ trace_driven.cc 2> /dev/null
  71. g++ -O3 -g3 -fPIC -std=c++11 -Wall -I/usr/local/cuda-11.6/include -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -c trace_driven.cc -o ../build/release/trace_driven.o
  72. g++ -O3 -g3 -fPIC -std=c++11 -Wall -I/usr/local/cuda-11.6/include -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -c trace_parser.cc -o ../build/release/trace_parser.o
  73.  
  74. Building GPGPU-Sim version 4.2.0 (build gpgpu-sim_git-commit-60af80a7d140a30c781ca485707a1d4dbb8031fa_modified_0.0) with CUDA version 11.6
  75.  
  76. if [ ! -d lib/gcc-9.4.0/cuda-11060/release ]; then mkdir -p lib/gcc-9.4.0/cuda-11060/release; fi;
  77. Warning: gpgpu-sim is building without opencl support. Make sure NVOPENCL_LIBDIR and NVOPENCL_INCDIR are set
  78. if [ ! -d /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda ]; then mkdir -p /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda; fi;
  79. if [ ! -d /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim ]; then mkdir -p /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim; fi;
  80. if [ ! -d /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/decuda_pred_table ]; then mkdir -p /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/decuda_pred_table; fi;
  81. if [ ! -d /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim ]; then mkdir -p /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim; fi;
  82. if [ ! -d /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libopencl ]; then mkdir -p /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libopencl; fi;
  83. if [ ! -d /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libopencl/bin ]; then mkdir -p /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libopencl/bin; fi;
  84. if [ ! -d /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2 ]; then mkdir -p /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2; fi;
  85. if [ ! -d /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus ]; then mkdir -p /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus; fi;
  86. if [ ! -d /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch ]; then mkdir -p /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch; fi;
  87. if [ ! -d /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/cacti ]; then mkdir -p /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/cacti; fi;
  88. make -C ./src/cuda-sim/ depend
  89. make -C /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ depend
  90. make[2]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/cuda-sim'
  91. make[2]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch'
  92. make[3]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch'
  93. touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/Makefile.makedepend
  94. makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/Makefile.makedepend -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/ Ucache.cc XML_Parse.cc arbiter.cc area.cc array.cc bank.cc basic_circuit.cc basic_components.cc cacti_interface.cc component.cc core.cc crossbar.cc decoder.cc htree2.cc interconnect.cc io.cc iocontrollers.cc logic.cc main.cc mat.cc memoryctrl.cc noc.cc nuca.cc parameter.cc processor.cc router.cc sharedcache.cc subarray.cc technology.cc uca.cc wire.cc xmlParser.cc gpgpu_sim_wrapper.cc 2> /dev/null
  95. make -C ./cacti/ depend
  96. make[4]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/cacti'
  97. make[5]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/cacti'
  98. touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/cacti/Makefile.makedepend
  99. makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/cacti/Makefile.makedepend -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/cacti/ area.cc bank.cc mat.cc main.cc Ucache.cc io.cc technology.cc basic_circuit.cc parameter.cc decoder.cc component.cc uca.cc subarray.cc wire.cc htree2.cc cacti_interface.cc router.cc nuca.cc crossbar.cc arbiter.cc 2> /dev/null
  100. make[5]: Nothing to be done for 'depend'.
  101. make[5]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/cacti'
  102. make[4]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/cacti'
  103. make[3]: Nothing to be done for 'depend'.
  104. make[3]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch'
  105. make[2]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch'
  106. make -C /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/
  107. make[2]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch'
  108. mkdir -p obj_opt
  109. make[3]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch'
  110. touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/Makefile.makedepend
  111. makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/Makefile.makedepend -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/ Ucache.cc XML_Parse.cc arbiter.cc area.cc array.cc bank.cc basic_circuit.cc basic_components.cc cacti_interface.cc component.cc core.cc crossbar.cc decoder.cc htree2.cc interconnect.cc io.cc iocontrollers.cc logic.cc main.cc mat.cc memoryctrl.cc noc.cc nuca.cc parameter.cc processor.cc router.cc sharedcache.cc subarray.cc technology.cc uca.cc wire.cc xmlParser.cc gpgpu_sim_wrapper.cc 2> /dev/null
  112. make -C ./cacti/ depend
  113. make[4]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/cacti'
  114. make[5]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/cacti'
  115. touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/cacti/Makefile.makedepend
  116. makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/cacti/Makefile.makedepend -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/cacti/ area.cc bank.cc mat.cc main.cc Ucache.cc io.cc technology.cc basic_circuit.cc parameter.cc decoder.cc component.cc uca.cc subarray.cc wire.cc htree2.cc cacti_interface.cc router.cc nuca.cc crossbar.cc arbiter.cc 2> /dev/null
  117. make[5]: Nothing to be done for 'depend'.
  118. make[5]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/cacti'
  119. make[4]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/cacti'
  120. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c cacti/Ucache.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/Ucache.o
  121. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c XML_Parse.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/XML_Parse.o
  122. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c cacti/arbiter.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/arbiter.o
  123. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c cacti/area.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/area.o
  124. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c array.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/array.o
  125. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c cacti/bank.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/bank.o
  126. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c cacti/basic_circuit.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/basic_circuit.o
  127. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c basic_components.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/basic_components.o
  128. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c cacti/cacti_interface.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/cacti_interface.o
  129. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c cacti/component.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/component.o
  130. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c core.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/core.o
  131. touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/Makefile.makedepend
  132. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c cacti/crossbar.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/crossbar.o
  133. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c cacti/decoder.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/decoder.o
  134. makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/Makefile.makedepend -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ cuda_device_printf.cc cuda_device_runtime.cc cuda-sim.cc instructions.cc memory.cc ptx_ir.cc ptx_loader.cc ptx_parser.cc ptx_sim.cc ptx-stats.cc 2> /dev/null
  135. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c cacti/htree2.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/htree2.o
  136. make[2]: 'depend' is up to date.
  137. make[2]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/cuda-sim'
  138. make -C ./src/cuda-sim/
  139. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c interconnect.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/interconnect.o
  140. make[2]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/cuda-sim'
  141. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c cacti/io.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/io.o
  142. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c iocontrollers.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/iocontrollers.o
  143. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c logic.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/logic.o
  144. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c main.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/main.o
  145. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c cacti/mat.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/mat.o
  146. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c memoryctrl.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/memoryctrl.o
  147. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c noc.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/noc.o
  148. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c cacti/nuca.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/nuca.o
  149. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c cacti/parameter.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/parameter.o
  150. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c processor.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/processor.o
  151. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c cacti/router.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/router.o
  152. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c sharedcache.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/sharedcache.o
  153. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c cacti/subarray.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/subarray.o
  154. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c cacti/technology.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/technology.o
  155. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c cacti/uca.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/uca.o
  156. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c cacti/wire.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/wire.o
  157. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c xmlParser.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/xmlParser.o
  158. In file included from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/../abstract_hardware_model.h:217,
  159. from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/cuda-sim.h:36,
  160. from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/gpgpu_context.h:3,
  161. from main.cc:13:
  162. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
  163. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
  164. 67 | bool done() const { return m_updates == m_issued; }
  165. | ~~~~~~~~~~^~~~~~~~~~~
  166. g++ -m64 -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -c gpgpu_sim_wrapper.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/gpgpu_sim_wrapper.o
  167. In file included from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/cuda-sim.h:36,
  168. from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/gpgpu_context.h:3,
  169. from main.cc:13:
  170. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  171. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  172. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  173. | ~~~^ ~~
  174. | | |
  175. | | address_type {aka long long unsigned int}
  176. | unsigned int
  177. | %04llx
  178. touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/Makefile.makedepend
  179. In file included from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/cuda-sim.h:36,
  180. from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/gpgpu_context.h:3,
  181. from main.cc:13:
  182. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  183. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  184. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  185. | ~~~^ ~~
  186. | | |
  187. | | address_type {aka long long unsigned int}
  188. | unsigned int
  189. | %04llx
  190. makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/Makefile.makedepend -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ cuda_device_printf.cc cuda_device_runtime.cc cuda-sim.cc instructions.cc memory.cc ptx_ir.cc ptx_loader.cc ptx_parser.cc ptx_sim.cc ptx-stats.cc 2> /dev/null
  191. bison --name-prefix=ptx_ -v -d ptx.y --file-prefix=/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx
  192. ptx.y: warning: 1 nonterminal useless in grammar [-Wother]
  193. ptx.y: warning: 2 rules useless in grammar [-Wother]
  194. ptx.y:573.1-7: warning: nonterminal useless in grammar: vp_spec [-Wother]
  195. 573 | vp_spec: WMMA_DIRECTIVE LAYOUT CONFIGURATION{recognizer->add_space_spec(global_space,0);recognizer->add_ptr_spec(global_space);recognizer->add_wmma_option($1);recognizer...
  196. | ^~~~~~~
  197. ptx.y: warning: 57 reduce/reduce conflicts [-Wconflicts-rr]
  198. ptx.y:277.11-112: warning: rule useless in parser due to conflicts [-Wother]
  199. 277 | | WEAK_DIRECTIVE FUNC_DIRECTIVE { $$ = 0; recognizer->g_func_decl=1; recognizer->func_header(".func"); }
  200. | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  201. g++ -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x memory.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/memory.o
  202. In file included from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/cuda-sim.h:37,
  203. from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/gpgpu_context.h:3,
  204. from main.cc:13:
  205. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  206. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  207. 450 | std::vector<register_set *> &m_spec_cores_out;
  208. | ^~~~~~~~~~~~~~~~
  209. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  210. 449 | register_set *m_mem_out;
  211. | ^~~~~~~~~
  212. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning: when initialized here [-Wreorder]
  213. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  214. | ^~~~~~~~~~~~~~
  215. g++ -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x decuda_pred_table/decuda_pred_table.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/decuda_pred_table/decuda_pred_table.o
  216. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
  217. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  218. 1298 | if (inst.op != m_supported_op) {
  219. | ~~~~~~~~^~~~~~~~~~~~~~~~~
  220. flex --outfile=/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/lex.ptx_.c ptx.l
  221. ptx.l:173: undeclared start condition IN_INST
  222. ptx.l:237: warning, rule cannot be matched
  223. bison --name-prefix=ptxinfo_ -v -d ptxinfo.y --file-prefix=/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptxinfo
  224. ptxinfo.y: warning: 2 reduce/reduce conflicts [-Wconflicts-rr]
  225. flex --outfile=/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/lex.ptxinfo_.c ptxinfo.l
  226. created /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/instructions.h
  227. In file included from memory.h:32,
  228. from memory.cc:29:
  229. ../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  230. ../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  231. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  232. | ~~~^ ~~
  233. | | |
  234. | | address_type {aka long long unsigned int}
  235. | unsigned int
  236. | %04llx
  237. In file included from memory.h:32,
  238. from memory.cc:29:
  239. ../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  240. ../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  241. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  242. | ~~~^ ~~
  243. | | |
  244. | | address_type {aka long long unsigned int}
  245. | unsigned int
  246. | %04llx
  247. cat /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx.tab.h | grep "=" | sed 's/^[ ]\+//' | sed 's/[=,]//g' | sed 's/\([_A-Z1-9]\+\)[ ]\+\([0-9]\+\)/\1 \1/' | sed 's/^/DEF(/' | sed 's/ /,"/' | sed 's/$/")/' > /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx_parser_decode.def
  248. g++ -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x cuda_device_printf.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/cuda_device_printf.o
  249. g++ -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x instructions.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/instructions.o
  250. main.cc: In function ‘int main(int, const char**)’:
  251. main.cc:157:29: warning: comparison of integer expressions of different signedness: ‘int’ and ‘std::vector<long unsigned int>::size_type’ {aka ‘long unsigned int’} [-Wsign-compare]
  252. 157 | for (int l = 0; l < busy_streams.size(); l++) {
  253. | ~~^~~~~~~~~~~~~~~~~~~~~
  254. main.cc:164:27: warning: deleting object of polymorphic class type ‘function_info’ which has non-virtual destructor might cause undefined behavior [-Wdelete-non-virtual-dtor]
  255. 164 | delete k->entry();
  256. | ^
  257. g++ -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x cuda-sim.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/cuda-sim.o
  258. g++ -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x ptx_ir.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx_ir.o
  259. g++ -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x ptx_sim.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx_sim.o
  260. g++ -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x ptx-stats.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx-stats.o
  261. g++ -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x -DYYDEBUG /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx.tab.c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx.tab.o
  262. g++ -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/lex.ptx_.c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/lex.ptx_.o
  263. g++ -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x -DYYDEBUG /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptxinfo.tab.c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptxinfo.tab.o
  264. g++ -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/lex.ptxinfo_.c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/lex.ptxinfo_.o
  265. g++ -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x cuda_device_runtime.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/cuda_device_runtime.o
  266. In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:37,
  267. from ../../libcuda/gpgpu_context.h:3,
  268. from memory.cc:31:
  269. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  270. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  271. 450 | std::vector<register_set *> &m_spec_cores_out;
  272. | ^~~~~~~~~~~~~~~~
  273. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  274. 449 | register_set *m_mem_out;
  275. | ^~~~~~~~~
  276. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning: when initialized here [-Wreorder]
  277. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  278. | ^~~~~~~~~~~~~~
  279. In file included from cuda-math.h:354,
  280. from instructions.cc:53:
  281. /usr/local/cuda-11.6/include/math_functions.h:54:2: warning: #warning "math_functions.h is an internal header file and must not be used directly. This file will be removed in a future CUDA release. Please use cuda_runtime_api.h or cuda_runtime.h instead." [-Wcpp]
  282. 54 | #warning "math_functions.h is an internal header file and must not be used directly. This file will be removed in a future CUDA release. Please use cuda_runtime_api.h or cuda_runtime.h instead."
  283. | ^~~~~~~
  284. In file included from cuda-sim.h:36,
  285. from cuda-sim.cc:32:
  286. ../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  287. ../abstract_hardware_model.h:966:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  288. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  289. | ^~~~~~~~~~~~~~~~~~~~~~ ~~
  290. | |
  291. | address_type {aka long long unsigned int}
  292. In file included from cuda-sim.h:36,
  293. from cuda-sim.cc:32:
  294. ../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  295. ../abstract_hardware_model.h:1160:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  296. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  297. | ^~~~~~~~~~~~~~~~~~~~~~ ~~
  298. | |
  299. | address_type {aka long long unsigned int}
  300. In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:36,
  301. from ../../libcuda/gpgpu_context.h:3,
  302. from ptx-stats.cc:32:
  303. ../../libcuda/../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  304. ../../libcuda/../src/cuda-sim/../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  305. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  306. | ~~~^ ~~
  307. | | |
  308. | | address_type {aka long long unsigned int}
  309. | unsigned int
  310. | %04llx
  311. In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:36,
  312. from ../../libcuda/gpgpu_context.h:3,
  313. from ptx-stats.cc:32:
  314. ../../libcuda/../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  315. ../../libcuda/../src/cuda-sim/../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  316. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  317. | ~~~^ ~~
  318. | | |
  319. | | address_type {aka long long unsigned int}
  320. | unsigned int
  321. | %04llx
  322. memory.cc: In instantiation of ‘void memory_space_impl<BSIZE>::print(const char*, FILE*) const [with unsigned int BSIZE = 32; FILE = _IO_FILE]’:
  323. memory.cc:182:16: required from here
  324. memory.cc:172:26: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘long long unsigned int’ [-Wformat=]
  325. 172 | fprintf(fout, "%s %08x:", m_name.c_str(), i_page->first);
  326. | ~~~^ ~~~~~~~~~~~~~
  327. | | |
  328. | unsigned int long long unsigned int
  329. | %08llx
  330. memory.cc: In instantiation of ‘void memory_space_impl<BSIZE>::read_single_block(mem_addr_t, mem_addr_t, size_t, void*) const [with unsigned int BSIZE = 32; mem_addr_t = long long unsigned int; size_t = long unsigned int]’:
  331. memory.cc:182:16: required from here
  332. memory.cc:112:18: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘mem_addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  333. 112 | "addr=0x%x, length=%zu\n",
  334. | ~^
  335. | |
  336. | unsigned int
  337. | %llx
  338. 113 | m_name.c_str(), addr, length);
  339. | ~~~~
  340. | |
  341. | mem_addr_t {aka long long unsigned int}
  342. memory.cc:115:43: warning: format ‘%lx’ expects argument of type ‘long unsigned int’, but argument 2 has type ‘long long unsigned int’ [-Wformat=]
  343. 115 | "GPGPU-Sim PTX: (addr+length)=0x%lx > 0x%x=(index+1)*BSIZE, "
  344. | ~~^
  345. | |
  346. | long unsigned int
  347. | %llx
  348. 116 | "index=0x%x, BSIZE=0x%x\n",
  349. 117 | (addr + length), (blk_idx + 1) * BSIZE, blk_idx, BSIZE);
  350. | ~~~~~~~~~~~~~~~
  351. | |
  352. | long long unsigned int
  353. memory.cc:115:50: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘mem_addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  354. 115 | "GPGPU-Sim PTX: (addr+length)=0x%lx > 0x%x=(index+1)*BSIZE, "
  355. | ~^
  356. | |
  357. | unsigned int
  358. | %llx
  359. 116 | "index=0x%x, BSIZE=0x%x\n",
  360. 117 | (addr + length), (blk_idx + 1) * BSIZE, blk_idx, BSIZE);
  361. | ~~~~~~~~~~~~~~~~~~~~~
  362. | |
  363. | mem_addr_t {aka long long unsigned int}
  364. memory.cc:116:19: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘mem_addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  365. 116 | "index=0x%x, BSIZE=0x%x\n",
  366. | ~^
  367. | |
  368. | unsigned int
  369. | %llx
  370. 117 | (addr + length), (blk_idx + 1) * BSIZE, blk_idx, BSIZE);
  371. | ~~~~~~~
  372. | |
  373. | mem_addr_t {aka long long unsigned int}
  374. memory.cc: In instantiation of ‘void memory_space_impl<BSIZE>::print(const char*, FILE*) const [with unsigned int BSIZE = 64; FILE = _IO_FILE]’:
  375. memory.cc:183:16: required from here
  376. memory.cc:172:26: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘long long unsigned int’ [-Wformat=]
  377. 172 | fprintf(fout, "%s %08x:", m_name.c_str(), i_page->first);
  378. | ~~~^ ~~~~~~~~~~~~~
  379. | | |
  380. | unsigned int long long unsigned int
  381. | %08llx
  382. memory.cc: In instantiation of ‘void memory_space_impl<BSIZE>::read_single_block(mem_addr_t, mem_addr_t, size_t, void*) const [with unsigned int BSIZE = 64; mem_addr_t = long long unsigned int; size_t = long unsigned int]’:
  383. memory.cc:183:16: required from here
  384. memory.cc:112:18: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘mem_addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  385. 112 | "addr=0x%x, length=%zu\n",
  386. | ~^
  387. | |
  388. | unsigned int
  389. | %llx
  390. 113 | m_name.c_str(), addr, length);
  391. | ~~~~
  392. | |
  393. | mem_addr_t {aka long long unsigned int}
  394. memory.cc:115:43: warning: format ‘%lx’ expects argument of type ‘long unsigned int’, but argument 2 has type ‘long long unsigned int’ [-Wformat=]
  395. 115 | "GPGPU-Sim PTX: (addr+length)=0x%lx > 0x%x=(index+1)*BSIZE, "
  396. | ~~^
  397. | |
  398. | long unsigned int
  399. | %llx
  400. 116 | "index=0x%x, BSIZE=0x%x\n",
  401. 117 | (addr + length), (blk_idx + 1) * BSIZE, blk_idx, BSIZE);
  402. | ~~~~~~~~~~~~~~~
  403. | |
  404. | long long unsigned int
  405. memory.cc:115:50: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘mem_addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  406. 115 | "GPGPU-Sim PTX: (addr+length)=0x%lx > 0x%x=(index+1)*BSIZE, "
  407. | ~^
  408. | |
  409. | unsigned int
  410. | %llx
  411. 116 | "index=0x%x, BSIZE=0x%x\n",
  412. 117 | (addr + length), (blk_idx + 1) * BSIZE, blk_idx, BSIZE);
  413. | ~~~~~~~~~~~~~~~~~~~~~
  414. | |
  415. | mem_addr_t {aka long long unsigned int}
  416. memory.cc:116:19: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘mem_addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  417. 116 | "index=0x%x, BSIZE=0x%x\n",
  418. | ~^
  419. | |
  420. | unsigned int
  421. | %llx
  422. 117 | (addr + length), (blk_idx + 1) * BSIZE, blk_idx, BSIZE);
  423. | ~~~~~~~
  424. | |
  425. | mem_addr_t {aka long long unsigned int}
  426. In file included from cuda-sim.h:37,
  427. from cuda-sim.cc:32:
  428. ../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  429. ../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  430. 450 | std::vector<register_set *> &m_spec_cores_out;
  431. | ^~~~~~~~~~~~~~~~
  432. ../gpgpu-sim/shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  433. 449 | register_set *m_mem_out;
  434. | ^~~~~~~~~
  435. ../gpgpu-sim/shader.h:344:3: warning: when initialized here [-Wreorder]
  436. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  437. | ^~~~~~~~~~~~~~
  438. memory.cc: In instantiation of ‘void memory_space_impl<BSIZE>::print(const char*, FILE*) const [with unsigned int BSIZE = 8192; FILE = _IO_FILE]’:
  439. memory.cc:184:16: required from here
  440. memory.cc:172:26: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘long long unsigned int’ [-Wformat=]
  441. 172 | fprintf(fout, "%s %08x:", m_name.c_str(), i_page->first);
  442. | ~~~^ ~~~~~~~~~~~~~
  443. | | |
  444. | unsigned int long long unsigned int
  445. | %08llx
  446. memory.cc: In instantiation of ‘void memory_space_impl<BSIZE>::read_single_block(mem_addr_t, mem_addr_t, size_t, void*) const [with unsigned int BSIZE = 8192; mem_addr_t = long long unsigned int; size_t = long unsigned int]’:
  447. memory.cc:184:16: required from here
  448. memory.cc:112:18: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘mem_addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  449. 112 | "addr=0x%x, length=%zu\n",
  450. | ~^
  451. | |
  452. | unsigned int
  453. | %llx
  454. 113 | m_name.c_str(), addr, length);
  455. | ~~~~
  456. | |
  457. | mem_addr_t {aka long long unsigned int}
  458. memory.cc:115:43: warning: format ‘%lx’ expects argument of type ‘long unsigned int’, but argument 2 has type ‘long long unsigned int’ [-Wformat=]
  459. 115 | "GPGPU-Sim PTX: (addr+length)=0x%lx > 0x%x=(index+1)*BSIZE, "
  460. | ~~^
  461. | |
  462. | long unsigned int
  463. | %llx
  464. 116 | "index=0x%x, BSIZE=0x%x\n",
  465. 117 | (addr + length), (blk_idx + 1) * BSIZE, blk_idx, BSIZE);
  466. | ~~~~~~~~~~~~~~~
  467. | |
  468. | long long unsigned int
  469. memory.cc:115:50: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘mem_addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  470. 115 | "GPGPU-Sim PTX: (addr+length)=0x%lx > 0x%x=(index+1)*BSIZE, "
  471. | ~^
  472. | |
  473. | unsigned int
  474. | %llx
  475. 116 | "index=0x%x, BSIZE=0x%x\n",
  476. 117 | (addr + length), (blk_idx + 1) * BSIZE, blk_idx, BSIZE);
  477. | ~~~~~~~~~~~~~~~~~~~~~
  478. | |
  479. | mem_addr_t {aka long long unsigned int}
  480. memory.cc:116:19: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘mem_addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  481. 116 | "index=0x%x, BSIZE=0x%x\n",
  482. | ~^
  483. | |
  484. | unsigned int
  485. | %llx
  486. 117 | (addr + length), (blk_idx + 1) * BSIZE, blk_idx, BSIZE);
  487. | ~~~~~~~
  488. | |
  489. | mem_addr_t {aka long long unsigned int}
  490. memory.cc: In instantiation of ‘void memory_space_impl<BSIZE>::print(const char*, FILE*) const [with unsigned int BSIZE = 16384; FILE = _IO_FILE]’:
  491. memory.cc:185:16: required from here
  492. memory.cc:172:26: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘long long unsigned int’ [-Wformat=]
  493. 172 | fprintf(fout, "%s %08x:", m_name.c_str(), i_page->first);
  494. | ~~~^ ~~~~~~~~~~~~~
  495. | | |
  496. | unsigned int long long unsigned int
  497. | %08llx
  498. memory.cc: In instantiation of ‘void memory_space_impl<BSIZE>::read_single_block(mem_addr_t, mem_addr_t, size_t, void*) const [with unsigned int BSIZE = 16384; mem_addr_t = long long unsigned int; size_t = long unsigned int]’:
  499. memory.cc:185:16: required from here
  500. memory.cc:112:18: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘mem_addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  501. 112 | "addr=0x%x, length=%zu\n",
  502. | ~^
  503. | |
  504. | unsigned int
  505. | %llx
  506. 113 | m_name.c_str(), addr, length);
  507. | ~~~~
  508. | |
  509. | mem_addr_t {aka long long unsigned int}
  510. memory.cc:115:43: warning: format ‘%lx’ expects argument of type ‘long unsigned int’, but argument 2 has type ‘long long unsigned int’ [-Wformat=]
  511. 115 | "GPGPU-Sim PTX: (addr+length)=0x%lx > 0x%x=(index+1)*BSIZE, "
  512. | ~~^
  513. | |
  514. | long unsigned int
  515. | %llx
  516. 116 | "index=0x%x, BSIZE=0x%x\n",
  517. 117 | (addr + length), (blk_idx + 1) * BSIZE, blk_idx, BSIZE);
  518. | ~~~~~~~~~~~~~~~
  519. | |
  520. | long long unsigned int
  521. memory.cc:115:50: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘mem_addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  522. 115 | "GPGPU-Sim PTX: (addr+length)=0x%lx > 0x%x=(index+1)*BSIZE, "
  523. | ~^
  524. | |
  525. | unsigned int
  526. | %llx
  527. 116 | "index=0x%x, BSIZE=0x%x\n",
  528. 117 | (addr + length), (blk_idx + 1) * BSIZE, blk_idx, BSIZE);
  529. | ~~~~~~~~~~~~~~~~~~~~~
  530. | |
  531. | mem_addr_t {aka long long unsigned int}
  532. memory.cc:116:19: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘mem_addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  533. 116 | "index=0x%x, BSIZE=0x%x\n",
  534. | ~^
  535. | |
  536. | unsigned int
  537. | %llx
  538. 117 | (addr + length), (blk_idx + 1) * BSIZE, blk_idx, BSIZE);
  539. | ~~~~~~~
  540. | |
  541. | mem_addr_t {aka long long unsigned int}
  542. In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:37,
  543. from ../../libcuda/gpgpu_context.h:3,
  544. from ptx-stats.cc:32:
  545. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  546. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  547. 450 | std::vector<register_set *> &m_spec_cores_out;
  548. | ^~~~~~~~~~~~~~~~
  549. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  550. 449 | register_set *m_mem_out;
  551. | ^~~~~~~~~
  552. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning: when initialized here [-Wreorder]
  553. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  554. | ^~~~~~~~~~~~~~
  555. In file included from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/abstract_hardware_model.h:217,
  556. from ../ISA_Def/ampere_opcode.h:9,
  557. from trace_driven.cc:40:
  558. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/stream_manager.h: In member function ‘bool CUevent_st::done() const’:
  559. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
  560. 67 | bool done() const { return m_updates == m_issued; }
  561. | ~~~~~~~~~~^~~~~~~~~~~
  562. In file included from ptx_ir.h:32,
  563. from cuda_device_printf.cc:30:
  564. ../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  565. ../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  566. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  567. | ~~~^ ~~
  568. | | |
  569. | | address_type {aka long long unsigned int}
  570. | unsigned int
  571. | %04llx
  572. cuda-sim.cc: In member function ‘void cuda_sim::ptx_print_insn(address_type, FILE*)’:
  573. cuda-sim.cc:548:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  574. 548 | fprintf(fp, "<no instruction at address 0x%x>", pc);
  575. | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ~~
  576. | |
  577. | address_type {aka long long unsigned int}
  578. cuda-sim.cc: In member function ‘std::string cuda_sim::ptx_get_insn_str(address_type)’:
  579. cuda-sim.cc:562:30: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  580. 562 | snprintf(buff, STR_SIZE, "<no instruction at address 0x%x>", pc);
  581. | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ~~
  582. | |
  583. | address_type {aka long long unsigned int}
  584. cuda-sim.cc: In member function ‘void function_info::add_param_data(unsigned int, gpgpu_ptx_sim_arg*)’:
  585. cuda-sim.cc:1374:11: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  586. 1374 | "GPGPU-Sim PTX: deferred allocation of shared region for \"%s\" from "
  587. | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  588. 1375 | "0x%x to 0x%x (shared memory space)\n",
  589. | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  590. 1376 | p->name().c_str(), m_symtab->get_shared_next(),
  591. | ~~~~~~~~~~~~~~~~~~~~~~~~~~~
  592. | |
  593. | addr_t {aka long long unsigned int}
  594. cuda-sim.cc:1374:11: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  595. cuda-sim.cc: In member function ‘void function_info::list_param(FILE*) const’:
  596. cuda-sim.cc:1506:19: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  597. 1506 | fprintf(fout, "%s: %#08x\n", name.c_str(), param_addr);
  598. | ^~~~~~~~~~~~~ ~~~~~~~~~~
  599. | |
  600. | addr_t {aka long long unsigned int}
  601. g++ -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x ptx_parser.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx_parser.o
  602. cuda-sim.cc: In member function ‘void function_info::ptx_jit_config(std::map<long long unsigned int, long unsigned int>, memory_space*, gpgpu_t*, dim3, dim3)’:
  603. cuda-sim.cc:1534:3: warning: NULL used in arithmetic [-Wpointer-arith]
  604. 1534 | assert(system(buff) != NULL);
  605. | ^~~~~~
  606. g++ -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x ptx_loader.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx_loader.o
  607. In file included from ptx_ir.h:32,
  608. from cuda_device_printf.cc:30:
  609. ../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  610. ../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  611. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  612. | ~~~^ ~~
  613. | | |
  614. | | address_type {aka long long unsigned int}
  615. | unsigned int
  616. | %04llx
  617. cuda-sim.cc: In member function ‘void ptx_thread_info::ptx_exec_inst(warp_inst_t&, unsigned int)’:
  618. cuda-sim.cc:1879:11: warning: format ‘%u’ expects argument of type ‘unsigned int’, but argument 12 has type ‘addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  619. 1879 | "%u [thd=%u][i=%u] : ctaid=(%u,%u,%u) tid=(%u,%u,%u) icount=%u "
  620. | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  621. 1880 | "[pc=%u] (%s:%u - %s) [0x%llx]\n",
  622. | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  623. 1881 | m_gpu->gpgpu_ctx->func_sim->g_ptx_sim_num_insn, get_uid(), pI->uid(),
  624. 1882 | ctaid.x, ctaid.y, ctaid.z, tid.x, tid.y, tid.z, get_icount(), pc,
  625. | ~~
  626. | |
  627. | addr_t {aka long long unsigned int}
  628. In file included from ./../../libcuda/../src/cuda-sim/cuda-sim.h:36,
  629. from ./../../libcuda/gpgpu_context.h:3,
  630. from ptx.y:33:
  631. ./../../libcuda/../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  632. ./../../libcuda/../src/cuda-sim/../abstract_hardware_model.h:966:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  633. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  634. | ^~~~~~~~~~~~~~~~~~~~~~ ~~
  635. | |
  636. | address_type {aka long long unsigned int}
  637. In file included from ptx_ir.h:32,
  638. from ptx_ir.cc:32:
  639. ../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  640. ../abstract_hardware_model.h:966:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  641. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  642. | ^~~~~~~~~~~~~~~~~~~~~~ ~~
  643. | |
  644. | address_type {aka long long unsigned int}
  645. In file included from ptx_sim.h:32,
  646. from ptx_sim.cc:29:
  647. ../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  648. ../abstract_hardware_model.h:966:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  649. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  650. | ^~~~~~~~~~~~~~~~~~~~~~ ~~
  651. | |
  652. | address_type {aka long long unsigned int}
  653. In file included from ptx_ir.h:32,
  654. from ptx_ir.cc:32:
  655. ../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  656. ../abstract_hardware_model.h:1160:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  657. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  658. | ^~~~~~~~~~~~~~~~~~~~~~ ~~
  659. | |
  660. | address_type {aka long long unsigned int}
  661. cuda-sim.cc: In member function ‘void cuda_sim::read_sim_environment_variables()’:
  662. cuda-sim.cc:2379:20: warning: format ‘%d’ expects argument of type ‘int*’, but argument 3 has type ‘addr_t*’ {aka ‘long long unsigned int*’} [-Wformat=]
  663. 2379 | sscanf(dbg_pc, "%d", &g_debug_pc);
  664. | ^~~~ ~~~~~~~~~~~
  665. | |
  666. | addr_t* {aka long long unsigned int*}
  667. In file included from ./../../libcuda/../src/cuda-sim/cuda-sim.h:36,
  668. from ./../../libcuda/gpgpu_context.h:3,
  669. from ptx.y:33:
  670. ./../../libcuda/../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  671. ./../../libcuda/../src/cuda-sim/../abstract_hardware_model.h:1160:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  672. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  673. | ^~~~~~~~~~~~~~~~~~~~~~ ~~
  674. | |
  675. | address_type {aka long long unsigned int}
  676. In file included from ptx_sim.h:32,
  677. from ptx_sim.cc:29:
  678. ../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  679. ../abstract_hardware_model.h:1160:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  680. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  681. | ^~~~~~~~~~~~~~~~~~~~~~ ~~
  682. | |
  683. | address_type {aka long long unsigned int}
  684. In file included from ./ptx_ir.h:32,
  685. from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/instructions.h:2,
  686. from instructions.cc:32:
  687. ./../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  688. ./../abstract_hardware_model.h:966:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  689. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  690. | ^~~~~~~~~~~~~~~~~~~~~~ ~~
  691. | |
  692. | address_type {aka long long unsigned int}
  693. In file included from ./../../libcuda/../src/cuda-sim/cuda-sim.h:36,
  694. from ./../../libcuda/gpgpu_context.h:3,
  695. from ptxinfo.l:42:
  696. ./../../libcuda/../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  697. ./../../libcuda/../src/cuda-sim/../abstract_hardware_model.h:966:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  698. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  699. | ^~~~~~~~~~~~~~~~~~~~~~ ~~
  700. | |
  701. | address_type {aka long long unsigned int}
  702. In file included from ./ptx_ir.h:32,
  703. from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/instructions.h:2,
  704. from instructions.cc:32:
  705. ./../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  706. ./../abstract_hardware_model.h:1160:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  707. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  708. | ^~~~~~~~~~~~~~~~~~~~~~ ~~
  709. | |
  710. | address_type {aka long long unsigned int}
  711. In file included from ../ISA_Def/ampere_opcode.h:9,
  712. from trace_driven.cc:40:
  713. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  714. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  715. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  716. | ~~~^ ~~
  717. | | |
  718. | | address_type {aka long long unsigned int}
  719. | unsigned int
  720. | %04llx
  721. In file included from ./../../libcuda/../src/cuda-sim/cuda-sim.h:36,
  722. from ./../../libcuda/gpgpu_context.h:3,
  723. from ptxinfo.l:42:
  724. ./../../libcuda/../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  725. ./../../libcuda/../src/cuda-sim/../abstract_hardware_model.h:1160:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  726. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  727. | ^~~~~~~~~~~~~~~~~~~~~~ ~~
  728. | |
  729. | address_type {aka long long unsigned int}
  730. In file included from ./ptx_parser.h:32,
  731. from ptx.l:43:
  732. ./../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  733. ./../abstract_hardware_model.h:966:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  734. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  735. | ^~~~~~~~~~~~~~~~~~~~~~ ~~
  736. | |
  737. | address_type {aka long long unsigned int}
  738. In file included from ../ISA_Def/ampere_opcode.h:9,
  739. from trace_driven.cc:40:
  740. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  741. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  742. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  743. | ~~~^ ~~
  744. | | |
  745. | | address_type {aka long long unsigned int}
  746. | unsigned int
  747. | %04llx
  748. In file included from ./ptx_parser.h:32,
  749. from ptx.l:43:
  750. ./../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  751. ./../abstract_hardware_model.h:1160:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  752. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  753. | ^~~~~~~~~~~~~~~~~~~~~~ ~~
  754. | |
  755. | address_type {aka long long unsigned int}
  756. In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:36,
  757. from ../../libcuda/gpgpu_context.h:3,
  758. from cuda_device_runtime.cc:12:
  759. ../../libcuda/../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  760. ../../libcuda/../src/cuda-sim/../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  761. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  762. | ~~~^ ~~
  763. | | |
  764. | | address_type {aka long long unsigned int}
  765. | unsigned int
  766. | %04llx
  767. In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:36,
  768. from ../../libcuda/gpgpu_context.h:3,
  769. from cuda_device_runtime.cc:12:
  770. ../../libcuda/../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  771. ../../libcuda/../src/cuda-sim/../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  772. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  773. | ~~~^ ~~
  774. | | |
  775. | | address_type {aka long long unsigned int}
  776. | unsigned int
  777. | %04llx
  778. In file included from ./../../libcuda/../src/cuda-sim/cuda-sim.h:37,
  779. from ./../../libcuda/gpgpu_context.h:3,
  780. from ptx.y:33:
  781. ./../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  782. ./../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  783. 450 | std::vector<register_set *> &m_spec_cores_out;
  784. | ^~~~~~~~~~~~~~~~
  785. ./../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  786. 449 | register_set *m_mem_out;
  787. | ^~~~~~~~~
  788. ./../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning: when initialized here [-Wreorder]
  789. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  790. | ^~~~~~~~~~~~~~
  791. In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:37,
  792. from ../../libcuda/gpgpu_context.h:3,
  793. from cuda_device_runtime.cc:12:
  794. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  795. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  796. 450 | std::vector<register_set *> &m_spec_cores_out;
  797. | ^~~~~~~~~~~~~~~~
  798. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  799. 449 | register_set *m_mem_out;
  800. | ^~~~~~~~~
  801. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning: when initialized here [-Wreorder]
  802. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  803. | ^~~~~~~~~~~~~~
  804. In file included from ./../../libcuda/../src/cuda-sim/cuda-sim.h:37,
  805. from ./../../libcuda/gpgpu_context.h:3,
  806. from ptxinfo.l:42:
  807. ./../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  808. ./../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  809. 450 | std::vector<register_set *> &m_spec_cores_out;
  810. | ^~~~~~~~~~~~~~~~
  811. ./../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  812. 449 | register_set *m_mem_out;
  813. | ^~~~~~~~~
  814. ./../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning: when initialized here [-Wreorder]
  815. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  816. | ^~~~~~~~~~~~~~
  817. In file included from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/cuda-sim/cuda-sim.h:37,
  818. from trace_driven.cc:48:
  819. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  820. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  821. 450 | std::vector<register_set *> &m_spec_cores_out;
  822. | ^~~~~~~~~~~~~~~~
  823. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  824. 449 | register_set *m_mem_out;
  825. | ^~~~~~~~~
  826. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning: when initialized here [-Wreorder]
  827. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  828. | ^~~~~~~~~~~~~~
  829. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/cuda-sim/../gpgpu-sim/shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
  830. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/cuda-sim/../gpgpu-sim/shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  831. 1298 | if (inst.op != m_supported_op) {
  832. | ~~~~~~~~^~~~~~~~~~~~~~~~~
  833. In file included from ptx_parser.h:32,
  834. from ptx_parser.cc:29:
  835. ../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  836. ../abstract_hardware_model.h:966:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  837. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  838. | ^~~~~~~~~~~~~~~~~~~~~~ ~~
  839. | |
  840. | address_type {aka long long unsigned int}
  841. In file included from ptx_parser.h:32,
  842. from ptx_parser.cc:29:
  843. ../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  844. ../abstract_hardware_model.h:1160:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  845. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  846. | ^~~~~~~~~~~~~~~~~~~~~~ ~~
  847. | |
  848. | address_type {aka long long unsigned int}
  849. cuda-sim.cc: In member function ‘void function_info::ptx_jit_config(std::map<long long unsigned int, long unsigned int>, memory_space*, gpgpu_t*, dim3, dim3)’:
  850. cuda-sim.cc:1536:8: warning: ignoring return value of ‘char* fgets(char*, int, FILE*)’, declared with attribute warn_unused_result [-Wunused-result]
  851. 1536 | fgets(buff, 1024, fp);
  852. | ~~~~~^~~~~~~~~~~~~~~~
  853. In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:37,
  854. from ../../libcuda/gpgpu_context.h:3,
  855. from ptx_ir.cc:44:
  856. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  857. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  858. 450 | std::vector<register_set *> &m_spec_cores_out;
  859. | ^~~~~~~~~~~~~~~~
  860. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  861. 449 | register_set *m_mem_out;
  862. | ^~~~~~~~~
  863. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning: when initialized here [-Wreorder]
  864. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  865. | ^~~~~~~~~~~~~~
  866. In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:37,
  867. from ../../libcuda/gpgpu_context.h:3,
  868. from ptx_sim.cc:34:
  869. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  870. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  871. 450 | std::vector<register_set *> &m_spec_cores_out;
  872. | ^~~~~~~~~~~~~~~~
  873. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  874. 449 | register_set *m_mem_out;
  875. | ^~~~~~~~~
  876. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning: when initialized here [-Wreorder]
  877. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  878. | ^~~~~~~~~~~~~~
  879. In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:36,
  880. from ../../libcuda/gpgpu_context.h:3,
  881. from ptx_loader.cc:34:
  882. ../../libcuda/../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  883. ../../libcuda/../src/cuda-sim/../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  884. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  885. | ~~~^ ~~
  886. | | |
  887. | | address_type {aka long long unsigned int}
  888. | unsigned int
  889. | %04llx
  890. In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:36,
  891. from ../../libcuda/gpgpu_context.h:3,
  892. from ptx_loader.cc:34:
  893. ../../libcuda/../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  894. ../../libcuda/../src/cuda-sim/../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  895. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  896. | ~~~^ ~~
  897. | | |
  898. | | address_type {aka long long unsigned int}
  899. | unsigned int
  900. | %04llx
  901. In file included from ../gpgpu-sim/gpu-sim.h:44,
  902. from instructions.cc:51:
  903. ../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  904. ../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  905. 450 | std::vector<register_set *> &m_spec_cores_out;
  906. | ^~~~~~~~~~~~~~~~
  907. ../gpgpu-sim/shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  908. 449 | register_set *m_mem_out;
  909. | ^~~~~~~~~
  910. ../gpgpu-sim/shader.h:344:3: warning: when initialized here [-Wreorder]
  911. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  912. | ^~~~~~~~~~~~~~
  913. In file included from ./../../libcuda/../src/cuda-sim/cuda-sim.h:37,
  914. from ./../../libcuda/gpgpu_context.h:3,
  915. from ptx.l:46:
  916. ./../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  917. ./../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  918. 450 | std::vector<register_set *> &m_spec_cores_out;
  919. | ^~~~~~~~~~~~~~~~
  920. ./../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  921. 449 | register_set *m_mem_out;
  922. | ^~~~~~~~~
  923. ./../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning: when initialized here [-Wreorder]
  924. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  925. | ^~~~~~~~~~~~~~
  926. trace_driven.cc: In member function ‘bool trace_warp_inst_t::parse_from_trace_struct(const inst_trace_t&, const std::unordered_map<std::__cxx11::basic_string<char>, OpcodeChar>*, const trace_config*, const kernel_trace_t*)’:
  927. trace_driven.cc:199:5: warning: this ‘if’ clause does not guard... [-Wmisleading-indentation]
  928. 199 | if(it2 != OpcPowerMap->end())
  929. | ^~
  930. trace_driven.cc:201:7: note: ...this statement, but the latter is misleadingly indented as if it were guarded by the ‘if’
  931. 201 | oprnd_type = get_oprnd_type(op, sp_op);
  932. | ^~~~~~~~~~
  933. In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:37,
  934. from ../../libcuda/gpgpu_context.h:3,
  935. from ptx_loader.cc:34:
  936. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  937. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  938. 450 | std::vector<register_set *> &m_spec_cores_out;
  939. | ^~~~~~~~~~~~~~~~
  940. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  941. 449 | register_set *m_mem_out;
  942. | ^~~~~~~~~
  943. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning: when initialized here [-Wreorder]
  944. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  945. | ^~~~~~~~~~~~~~
  946. ptx_sim.cc: In function ‘void print_reg(FILE*, std::string, ptx_reg_t, symbol_table*)’:
  947. ptx_sim.cc:372:19: warning: format ‘%f’ expects argument of type ‘double’, but argument 3 has type ‘half_float::half’ [-Wformat=]
  948. 372 | fprintf(fp, ".f16 %f [0x%04x]\n", value.f16, (unsigned)value.u16);
  949. | ^~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~
  950. | |
  951. | half_float::half
  952. ptx_ir.cc: In member function ‘std::string ptx_instruction::to_string() const’:
  953. ptx_ir.cc:1473:59: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  954. 1473 | snprintf(buf + used_bytes, STR_SIZE - used_bytes, " PC=0x%03x ", m_PC);
  955. | ^~~~~~~~~~~~~ ~~~~
  956. | |
  957. | addr_t {aka long long unsigned int}
  958. instructions.cc: In member function ‘void ptx_thread_info::print_reg_thread(char*)’:
  959. instructions.cc:202:19: warning: format ‘%llu’ expects argument of type ‘long long unsigned int’, but argument 4 has type ‘const ptx_reg_t’ [-Wformat=]
  960. 202 | fprintf(fp, "%s %llu %s %d\n", name.c_str(), it->second, dec.c_str(),
  961. | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~
  962. | |
  963. | const ptx_reg_t
  964. instructions.cc: In function ‘void mma_impl(const ptx_instruction*, core_t*, warp_inst_t)’:
  965. instructions.cc:1951:27: warning: dereferencing type-punned pointer will break strict-aliasing rules [-Wstrict-aliasing]
  966. 1951 | nw_v[k].f16 = *((half *)&hex_val);
  967. | ~^~~~~~~~~~~~~~~~~
  968. instructions.cc:1951:27: warning: dereferencing type-punned pointer will break strict-aliasing rules [-Wstrict-aliasing]
  969. In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:37,
  970. from ../../libcuda/gpgpu_context.h:3,
  971. from ptx_parser.cc:30:
  972. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  973. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  974. 450 | std::vector<register_set *> &m_spec_cores_out;
  975. | ^~~~~~~~~~~~~~~~
  976. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  977. 449 | register_set *m_mem_out;
  978. | ^~~~~~~~~
  979. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning: when initialized here [-Wreorder]
  980. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  981. | ^~~~~~~~~~~~~~
  982. ptx_loader.cc: In member function ‘void gpgpu_context::print_ptx_file(const char*, unsigned int, const char*)’:
  983. ptx_loader.cc:98:27: warning: format ‘%u’ expects argument of type ‘unsigned int’, but argument 4 has type ‘addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  984. 98 | snprintf(pc, 64, "%4u", pI->get_PC());
  985. | ~~^ ~~~~~~~~~~~~
  986. | | |
  987. | unsigned int addr_t {aka long long unsigned int}
  988. | %4llu
  989. ptx_parser.cc: In member function ‘void ptx_recognizer::end_function()’:
  990. ptx_parser.cc:209:21: warning: format ‘%d’ expects argument of type ‘int’, but argument 3 has type ‘addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  991. 209 | PTX_PARSE_DPRINTF("function %s, PC = %d\n", g_func_info->get_name().c_str(),
  992. | ^~~~~~~~~~~~~~~~~~~~~~~~
  993. 210 | g_func_info->get_start_PC());
  994. | ~~~~~~~~~~~~~~~~~~~~~~~~~~~
  995. | |
  996. | addr_t {aka long long unsigned int}
  997. ptx_parser.cc:55:12: note: in definition of macro ‘PTX_PARSE_DPRINTF’
  998. 55 | printf(__VA_ARGS__); \
  999. | ^~~~~~~~~~~
  1000. ptx_parser.cc: In member function ‘void ptx_recognizer::add_identifier(const char*, int, unsigned int)’:
  1001. ptx_parser.cc:488:11: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  1002. 488 | "GPGPU-Sim PTX: allocating stack frame region for .param \"%s\" from "
  1003. | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1004. 489 | "0x%x to 0x%lx\n",
  1005. | ~~~~~~~~~~~~~~~~~
  1006. 490 | identifier, g_current_symbol_table->get_local_next(),
  1007. | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1008. | |
  1009. | addr_t {aka long long unsigned int}
  1010. ptx_parser.cc:488:11: warning: format ‘%lx’ expects argument of type ‘long unsigned int’, but argument 4 has type ‘long long unsigned int’ [-Wformat=]
  1011. ptx_parser.cc: In member function ‘void ptx_recognizer::add_constptr(const char*, const char*, int)’:
  1012. ptx_parser.cc:524:10: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  1013. 524 | printf("GPGPU-Sim PTX: moving \"%s\" from 0x%x to 0x%x (%s+%x)\n",
  1014. | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1015. 525 | identifier1, s1->get_address(), addr + offset, identifier2, offset);
  1016. | ~~~~~~~~~~~~~~~~~
  1017. | |
  1018. | addr_t {aka long long unsigned int}
  1019. make[1]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/trace-parser'
  1020. In file included from /usr/include/string.h:495,
  1021. from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/lex.ptxinfo_.c:243:
  1022. In function ‘char* strncpy(char*, const char*, size_t)’,
  1023. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:87:24:
  1024. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ specified bound 1024 equals destination size [-Wstringop-truncation]
  1025. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1026. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1027. In function ‘char* strncpy(char*, const char*, size_t)’,
  1028. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:85:1:
  1029. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1030. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1031. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1032. In function ‘char* strncpy(char*, const char*, size_t)’,
  1033. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:84:1:
  1034. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1035. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1036. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1037. In function ‘char* strncpy(char*, const char*, size_t)’,
  1038. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:83:1:
  1039. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1040. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1041. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1042. In function ‘char* strncpy(char*, const char*, size_t)’,
  1043. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:82:1:
  1044. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1045. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1046. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1047. In function ‘char* strncpy(char*, const char*, size_t)’,
  1048. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:81:1:
  1049. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1050. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1051. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1052. In function ‘char* strncpy(char*, const char*, size_t)’,
  1053. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:80:1:
  1054. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1055. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1056. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1057. In function ‘char* strncpy(char*, const char*, size_t)’,
  1058. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:79:1:
  1059. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1060. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1061. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1062. In function ‘char* strncpy(char*, const char*, size_t)’,
  1063. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:78:1:
  1064. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1065. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1066. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1067. In function ‘char* strncpy(char*, const char*, size_t)’,
  1068. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:77:1:
  1069. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1070. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1071. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1072. In function ‘char* strncpy(char*, const char*, size_t)’,
  1073. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:75:1:
  1074. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1075. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1076. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1077. In function ‘char* strncpy(char*, const char*, size_t)’,
  1078. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:74:1:
  1079. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1080. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1081. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1082. In function ‘char* strncpy(char*, const char*, size_t)’,
  1083. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:72:1:
  1084. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1085. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1086. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1087. In function ‘char* strncpy(char*, const char*, size_t)’,
  1088. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:71:1:
  1089. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1090. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1091. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1092. In function ‘char* strncpy(char*, const char*, size_t)’,
  1093. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:70:1:
  1094. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1095. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1096. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1097. In function ‘char* strncpy(char*, const char*, size_t)’,
  1098. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:69:1:
  1099. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1100. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1101. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1102. In function ‘char* strncpy(char*, const char*, size_t)’,
  1103. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:68:1:
  1104. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1105. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1106. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1107. In function ‘char* strncpy(char*, const char*, size_t)’,
  1108. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:67:1:
  1109. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1110. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1111. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1112. In function ‘char* strncpy(char*, const char*, size_t)’,
  1113. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:66:1:
  1114. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1115. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1116. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1117. In function ‘char* strncpy(char*, const char*, size_t)’,
  1118. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:65:1:
  1119. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1120. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1121. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1122. In function ‘char* strncpy(char*, const char*, size_t)’,
  1123. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:64:1:
  1124. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1125. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1126. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1127. In function ‘char* strncpy(char*, const char*, size_t)’,
  1128. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:63:1:
  1129. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1130. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1131. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1132. In function ‘char* strncpy(char*, const char*, size_t)’,
  1133. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:62:1:
  1134. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1135. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1136. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1137. In function ‘char* strncpy(char*, const char*, size_t)’,
  1138. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:61:1:
  1139. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1140. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1141. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1142. In function ‘char* strncpy(char*, const char*, size_t)’,
  1143. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:60:1:
  1144. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1145. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1146. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1147. In function ‘char* strncpy(char*, const char*, size_t)’,
  1148. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:59:1:
  1149. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1150. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1151. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1152. In function ‘char* strncpy(char*, const char*, size_t)’,
  1153. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:58:1:
  1154. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1155. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1156. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1157. In function ‘char* strncpy(char*, const char*, size_t)’,
  1158. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:57:1:
  1159. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1160. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1161. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1162. In function ‘char* strncpy(char*, const char*, size_t)’,
  1163. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:56:1:
  1164. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1165. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1166. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1167. In function ‘char* strncpy(char*, const char*, size_t)’,
  1168. inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:55:1:
  1169. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  1170. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1171. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1172. cuda_device_runtime.cc: In member function ‘void cuda_device_runtime::gpgpusim_cuda_getParameterBufferV2(const ptx_instruction*, ptx_thread_info*, const function_info*)’:
  1173. cuda_device_runtime.cc:93:48: warning: ‘child_kernel_entry’ may be used uninitialized in this function [-Wmaybe-uninitialized]
  1174. 93 | g_cuda_device_launch_param_map[param_buffer] = device_launch_config;
  1175. cuda_device_runtime.cc: In member function ‘void cuda_device_runtime::gpgpusim_cuda_launchDeviceV2(const ptx_instruction*, ptx_thread_info*, const function_info*)’:
  1176. cuda_device_runtime.cc:126:29: warning: ‘device_launch_op’ may be used uninitialized in this function [-Wmaybe-uninitialized]
  1177. 126 | device_launch_operation_t device_launch_op;
  1178. | ^~~~~~~~~~~~~~~~
  1179. In file included from /usr/include/string.h:495,
  1180. from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/lex.ptx_.c:243:
  1181. In function ‘char* strncpy(char*, const char*, size_t)’,
  1182. inlined from ‘int ptx_lex(YYSTYPE*, yyscan_t, ptx_recognizer*)’ at ptx.l:436:27:
  1183. /usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ specified bound 4096 equals destination size [-Wstringop-truncation]
  1184. 106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
  1185. | ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1186. ptx_loader.cc: In member function ‘void gpgpu_context::gpgpu_ptx_info_load_from_filename(const char*, unsigned int)’:
  1187. ptx_loader.cc:357:37: warning: ‘%s’ directive output may be truncated writing up to 1023 bytes into a region of size 995 [-Wformat-truncation=]
  1188. 357 | "$CUDA_INSTALL_PATH/bin/ptxas %s -v %s --output-file /dev/null 2> %s",
  1189. | ^~
  1190. 358 | extra_flags, filename, ptxas_filename.c_str());
  1191. | ~~~~~~~~~~~
  1192. In file included from /usr/include/stdio.h:867,
  1193. from /usr/include/c++/9/cstdio:42,
  1194. from /usr/include/c++/9/ext/string_conversions.h:43,
  1195. from /usr/include/c++/9/bits/basic_string.h:6496,
  1196. from /usr/include/c++/9/string:55,
  1197. from ptx_loader.h:31,
  1198. from ptx_loader.cc:29:
  1199. /usr/include/x86_64-linux-gnu/bits/stdio2.h:67:35: note: ‘__builtin___snprintf_chk’ output 63 or more bytes (assuming 1086) into a destination of size 1024
  1200. 67 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
  1201. | ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1202. 68 | __bos (__s), __fmt, __va_arg_pack ());
  1203. | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1204. ptx_loader.cc: In function ‘char* get_app_binary_name()’:
  1205. ptx_loader.cc:341:25: warning: ‘self_exe_path’ may be used uninitialized in this function [-Wmaybe-uninitialized]
  1206. 341 | self_exe_path = strtok(self_exe_path, ".");
  1207. | ~~~~~~^~~~~~~~~~~~~~~~~~~~
  1208. ptx_loader.cc: In member function ‘char* ptxinfo_data::gpgpu_ptx_sim_convert_ptx_and_sass_to_ptxplus(std::string, std::string, std::string)’:
  1209. ptx_loader.cc:125:43: warning: ‘%s’ directive output may be truncated writing up to 1023 bytes into a region of size 941 [-Wformat-truncation=]
  1210. 125 | "cuobjdump_to_ptxplus %s %s %s %s",
  1211. | ^~
  1212. 126 | ptxfilename.c_str(), sassfilename.c_str(), elffilename.c_str(),
  1213. 127 | fname_ptxplus);
  1214. | ~~~~~~~~~~~~~
  1215. In file included from /usr/include/stdio.h:867,
  1216. from /usr/include/c++/9/cstdio:42,
  1217. from /usr/include/c++/9/ext/string_conversions.h:43,
  1218. from /usr/include/c++/9/bits/basic_string.h:6496,
  1219. from /usr/include/c++/9/string:55,
  1220. from ptx_loader.h:31,
  1221. from ptx_loader.cc:29:
  1222. /usr/include/x86_64-linux-gnu/bits/stdio2.h:67:35: note: ‘__builtin___snprintf_chk’ output 84 or more bytes (assuming 1107) into a destination of size 1024
  1223. 67 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
  1224. | ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1225. 68 | __bos (__s), __fmt, __va_arg_pack ());
  1226. | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1227. ptx_loader.cc:152:43: warning: ‘%s’ directive output may be truncated writing up to 1023 bytes into a region of size 1018 [-Wformat-truncation=]
  1228. 152 | snprintf(rm_commandline, 1024, "rm -f %s", fname_ptxplus);
  1229. | ^~ ~~~~~~~~~~~~~
  1230. In file included from /usr/include/stdio.h:867,
  1231. from /usr/include/c++/9/cstdio:42,
  1232. from /usr/include/c++/9/ext/string_conversions.h:43,
  1233. from /usr/include/c++/9/bits/basic_string.h:6496,
  1234. from /usr/include/c++/9/string:55,
  1235. from ptx_loader.h:31,
  1236. from ptx_loader.cc:29:
  1237. /usr/include/x86_64-linux-gnu/bits/stdio2.h:67:35: note: ‘__builtin___snprintf_chk’ output between 7 and 1030 bytes into a destination of size 1024
  1238. 67 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
  1239. | ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1240. 68 | __bos (__s), __fmt, __va_arg_pack ());
  1241. | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1242. ptx_loader.cc: In function ‘void fix_duplicate_errors(char*)’:
  1243. ptx_loader.cc:217:38: warning: ‘%s’ directive output may be truncated writing up to 1023 bytes into a region of size 1020 [-Wformat-truncation=]
  1244. 217 | snprintf(commandline, 1024, "mv %s %s", fname2, tempfile);
  1245. | ^~ ~~~~~~~~
  1246. In file included from /usr/include/stdio.h:867,
  1247. from /usr/include/c++/9/cstdio:42,
  1248. from /usr/include/c++/9/ext/string_conversions.h:43,
  1249. from /usr/include/c++/9/bits/basic_string.h:6496,
  1250. from /usr/include/c++/9/string:55,
  1251. from ptx_loader.h:31,
  1252. from ptx_loader.cc:29:
  1253. /usr/include/x86_64-linux-gnu/bits/stdio2.h:67:35: note: ‘__builtin___snprintf_chk’ output 5 or more bytes (assuming 1028) into a destination of size 1024
  1254. 67 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
  1255. | ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1256. 68 | __bos (__s), __fmt, __va_arg_pack ());
  1257. | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1258. ptx_loader.cc:311:38: warning: ‘%s’ directive output may be truncated writing up to 1023 bytes into a region of size 1018 [-Wformat-truncation=]
  1259. 311 | snprintf(commandline, 1024, "rm -f %s", tempfile);
  1260. | ^~ ~~~~~~~~
  1261. In file included from /usr/include/stdio.h:867,
  1262. from /usr/include/c++/9/cstdio:42,
  1263. from /usr/include/c++/9/ext/string_conversions.h:43,
  1264. from /usr/include/c++/9/bits/basic_string.h:6496,
  1265. from /usr/include/c++/9/string:55,
  1266. from ptx_loader.h:31,
  1267. from ptx_loader.cc:29:
  1268. /usr/include/x86_64-linux-gnu/bits/stdio2.h:67:35: note: ‘__builtin___snprintf_chk’ output between 7 and 1030 bytes into a destination of size 1024
  1269. 67 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
  1270. | ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1271. 68 | __bos (__s), __fmt, __va_arg_pack ());
  1272. | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1273. ptx_loader.cc:269:55: warning: ‘funcptr’ may be used uninitialized in this function [-Wmaybe-uninitialized]
  1274. 269 | fwrite(startptr, sizeof(char), funcptr - offset + 1 - startptr, ptxdest);
  1275. | ~~~~~~~~~~~~~~~~~^~~
  1276. ptx_loader.cc: In member function ‘void gpgpu_context::gpgpu_ptxinfo_load_from_string(const char*, unsigned int, unsigned int, int)’:
  1277. ptx_loader.cc:421:41: warning: ‘info’ directive output may be truncated writing 4 bytes into a region of size between 1 and 1024 [-Wformat-truncation=]
  1278. 421 | snprintf(tempfile_ptxinfo, 1024, "%sinfo", fname);
  1279. | ^~~~
  1280. In file included from /usr/include/stdio.h:867,
  1281. from /usr/include/c++/9/cstdio:42,
  1282. from /usr/include/c++/9/ext/string_conversions.h:43,
  1283. from /usr/include/c++/9/bits/basic_string.h:6496,
  1284. from /usr/include/c++/9/string:55,
  1285. from ptx_loader.h:31,
  1286. from ptx_loader.cc:29:
  1287. /usr/include/x86_64-linux-gnu/bits/stdio2.h:67:35: note: ‘__builtin___snprintf_chk’ output between 5 and 1028 bytes into a destination of size 1024
  1288. 67 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
  1289. | ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1290. 68 | __bos (__s), __fmt, __va_arg_pack ());
  1291. | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1292. ptx_loader.cc:444:50: warning: ‘%s’ directive output may be truncated writing up to 1023 bytes into a region of size 989 [-Wformat-truncation=]
  1293. 444 | "$PTXAS_CUDA_INSTALL_PATH/bin/ptxas %s -v %s --output-file "
  1294. | ^~
  1295. 445 | "/dev/null 2> %s",
  1296. 446 | extra_flags, fname2, tempfile_ptxinfo);
  1297. | ~~~~~~~~~~~
  1298. In file included from /usr/include/stdio.h:867,
  1299. from /usr/include/c++/9/cstdio:42,
  1300. from /usr/include/c++/9/ext/string_conversions.h:43,
  1301. from /usr/include/c++/9/bits/basic_string.h:6496,
  1302. from /usr/include/c++/9/string:55,
  1303. from ptx_loader.h:31,
  1304. from ptx_loader.cc:29:
  1305. /usr/include/x86_64-linux-gnu/bits/stdio2.h:67:35: note: ‘__builtin___snprintf_chk’ output between 69 and 3138 bytes into a destination of size 1024
  1306. 67 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
  1307. | ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1308. 68 | __bos (__s), __fmt, __va_arg_pack ());
  1309. | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1310. ptx_loader.cc:463:48: warning: ‘%s’ directive output may be truncated writing up to 1023 bytes into a region of size 995 [-Wformat-truncation=]
  1311. 463 | "$CUDA_INSTALL_PATH/bin/ptxas %s -v %s --output-file "
  1312. | ^~
  1313. 464 | "/dev/null 2> %s",
  1314. 465 | extra_flags, fname2, tempfile_ptxinfo);
  1315. | ~~~~~~~~~~~
  1316. In file included from /usr/include/stdio.h:867,
  1317. from /usr/include/c++/9/cstdio:42,
  1318. from /usr/include/c++/9/ext/string_conversions.h:43,
  1319. from /usr/include/c++/9/bits/basic_string.h:6496,
  1320. from /usr/include/c++/9/string:55,
  1321. from ptx_loader.h:31,
  1322. from ptx_loader.cc:29:
  1323. /usr/include/x86_64-linux-gnu/bits/stdio2.h:67:35: note: ‘__builtin___snprintf_chk’ output between 63 and 3132 bytes into a destination of size 1024
  1324. 67 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
  1325. | ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1326. 68 | __bos (__s), __fmt, __va_arg_pack ());
  1327. | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1328. ptx_loader.cc:512:41: warning: ‘info’ directive output may be truncated writing 4 bytes into a region of size between 1 and 1024 [-Wformat-truncation=]
  1329. 512 | snprintf(tempfile_ptxinfo, 1024, "%sinfo", fname);
  1330. | ^~~~
  1331. In file included from /usr/include/stdio.h:867,
  1332. from /usr/include/c++/9/cstdio:42,
  1333. from /usr/include/c++/9/ext/string_conversions.h:43,
  1334. from /usr/include/c++/9/bits/basic_string.h:6496,
  1335. from /usr/include/c++/9/string:55,
  1336. from ptx_loader.h:31,
  1337. from ptx_loader.cc:29:
  1338. /usr/include/x86_64-linux-gnu/bits/stdio2.h:67:35: note: ‘__builtin___snprintf_chk’ output between 5 and 1028 bytes into a destination of size 1024
  1339. 67 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
  1340. | ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1341. 68 | __bos (__s), __fmt, __va_arg_pack ());
  1342. | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1343. ptx_loader.cc:527:39: warning: ‘%s’ directive output may be truncated writing up to 1023 bytes into a region of size 995 [-Wformat-truncation=]
  1344. 527 | "$CUDA_INSTALL_PATH/bin/ptxas %s -v %s --output-file /dev/null 2> %s",
  1345. | ^~
  1346. 528 | extra_flags, fname2, tempfile_ptxinfo);
  1347. | ~~~~~~~~~~~
  1348. In file included from /usr/include/stdio.h:867,
  1349. from /usr/include/c++/9/cstdio:42,
  1350. from /usr/include/c++/9/ext/string_conversions.h:43,
  1351. from /usr/include/c++/9/bits/basic_string.h:6496,
  1352. from /usr/include/c++/9/string:55,
  1353. from ptx_loader.h:31,
  1354. from ptx_loader.cc:29:
  1355. /usr/include/x86_64-linux-gnu/bits/stdio2.h:67:35: note: ‘__builtin___snprintf_chk’ output between 63 and 3132 bytes into a destination of size 1024
  1356. 67 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
  1357. | ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1358. 68 | __bos (__s), __fmt, __va_arg_pack ());
  1359. | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1360. ptx_loader.cc:575:42: warning: ‘%s’ directive output may be truncated writing up to 1023 bytes into a region of size 1018 [-Wformat-truncation=]
  1361. 575 | snprintf(commandline, 1024, "rm -f %s %s %s", fname, fname2,
  1362. | ^~ ~~~~~
  1363. In file included from /usr/include/stdio.h:867,
  1364. from /usr/include/c++/9/cstdio:42,
  1365. from /usr/include/c++/9/ext/string_conversions.h:43,
  1366. from /usr/include/c++/9/bits/basic_string.h:6496,
  1367. from /usr/include/c++/9/string:55,
  1368. from ptx_loader.h:31,
  1369. from ptx_loader.cc:29:
  1370. /usr/include/x86_64-linux-gnu/bits/stdio2.h:67:35: note: ‘__builtin___snprintf_chk’ output between 9 and 3078 bytes into a destination of size 1024
  1371. 67 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
  1372. | ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1373. 68 | __bos (__s), __fmt, __va_arg_pack ());
  1374. | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1375. ptx_loader.cc:572:42: warning: ‘%s’ directive output may be truncated writing up to 1023 bytes into a region of size 1018 [-Wformat-truncation=]
  1376. 572 | snprintf(commandline, 1024, "rm -f %s %s %s", fname, fname2,
  1377. | ^~ ~~~~~
  1378. In file included from /usr/include/stdio.h:867,
  1379. from /usr/include/c++/9/cstdio:42,
  1380. from /usr/include/c++/9/ext/string_conversions.h:43,
  1381. from /usr/include/c++/9/bits/basic_string.h:6496,
  1382. from /usr/include/c++/9/string:55,
  1383. from ptx_loader.h:31,
  1384. from ptx_loader.cc:29:
  1385. /usr/include/x86_64-linux-gnu/bits/stdio2.h:67:35: note: ‘__builtin___snprintf_chk’ output between 9 and 3078 bytes into a destination of size 1024
  1386. 67 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
  1387. | ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1388. 68 | __bos (__s), __fmt, __va_arg_pack ());
  1389. | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1390. make[1]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/trace-driven'
  1391. ar rcs /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/libgpgpu_ptx_sim.a /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx.tab.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/lex.ptx_.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptxinfo.tab.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/lex.ptxinfo_.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx_parser.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx_loader.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/cuda_device_printf.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/instructions.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/cuda-sim.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx_ir.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx_sim.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/memory.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx-stats.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/decuda_pred_table/decuda_pred_table.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx.tab.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/lex.ptx_.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptxinfo.tab.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/lex.ptxinfo_.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/cuda_device_runtime.o
  1392. make[2]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/cuda-sim'
  1393. make -C ./src/gpgpu-sim/ depend
  1394. make -C ./libcuda/ depend
  1395. make[2]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim'
  1396. make -C ./cuobjdump_to_ptxplus/ depend
  1397. make[2]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda'
  1398. make[2]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/cuobjdump_to_ptxplus'
  1399. touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/Makefile.makedepend
  1400. makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/Makefile.makedepend cuobjdumpInst.cc cuobjdumpInstList.cc cuobjdump_to_ptxplus.cc 2> /dev/null
  1401. make[2]: 'depend' is up to date.
  1402. make[2]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/cuobjdump_to_ptxplus'
  1403. make -C ./cuobjdump_to_ptxplus/
  1404. make[2]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/cuobjdump_to_ptxplus'
  1405. g++ -ggdb -fPIC -Wall -Wno-unused-function -Wno-sign-compare -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/../cuda-sim/ -I . -I ../src/cuda-sim/ -c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/cuobjdumpInst.o cuobjdumpInst.cc
  1406. g++ -ggdb -fPIC -Wall -Wno-unused-function -Wno-sign-compare -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/../cuda-sim/ -I . -I ../src/cuda-sim/ -c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/cuobjdumpInstList.o cuobjdumpInstList.cc
  1407. g++ -ggdb -fPIC -Wall -Wno-unused-function -Wno-sign-compare -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/../cuda-sim/ -I . -I ../src/cuda-sim/ -c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/cuobjdump_to_ptxplus.o cuobjdump_to_ptxplus.cc
  1408. bison -t -d --report=all --verbose --name-prefix=ptx_ -v ptx.y --file-prefix=/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/ptx
  1409. flex -B -o/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/lex.ptx_.c ptx.l
  1410. bison -t -d --report=all --verbose -p sass_ -o/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/sass_parser.cc sass.y --file-prefix=/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus//home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/sass_parser.cc
  1411. bison -t -d --report=all --verbose -p elf_ -o/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/elf_parser.cc elf.y --file-prefix=/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus//home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/elf_parser.cc
  1412. bison -t -d --report=all --verbose -p header_ -o/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/header_parser.cc header.y --file-prefix=/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus//home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/header_parser.cc
  1413. ptx.l:162: undeclared start condition IN_INST
  1414. ptx.y: warning: 1 nonterminal useless in grammar [-Wother]
  1415. ptx.y: warning: 2 rules useless in grammar [-Wother]
  1416. ptx.y:544.1-7: warning: nonterminal useless in grammar: vp_spec [-Wother]
  1417. 544 | vp_spec: WMMA_DIRECTIVE LAYOUT CONFIGURATION{add_space_spec(global_space,0);add_ptr_spec(global_space);add_wmma_option($1);add_wmma_option($2);add_wmma_optiptx.l:215: warning, rule cannot be matched
  1418. on($3);}
  1419. | ^~~~~~~
  1420. ptx.y: warning: 57 reduce/reduce conflicts [-Wconflicts-rr]
  1421. ptx.y:266.11-88: warning: rule useless in parser due to conflicts [-Wother]
  1422. 266 | | WEAK_DIRECTIVE FUNC_DIRECTIVE { $$ = 0; g_func_decl=1; func_header(".func"); }
  1423. | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  1424. :
  1425. flex -B -P elf_ -o/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/elf_lexer.cc elf.l
  1426. g++ -ggdb -fPIC -Wall -Wno-unused-function -Wno-sign-compare -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/../cuda-sim/ -I . -I ../src/cuda-sim/ -c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/elf_parser.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/elf_parser.cc
  1427. g++ -ggdb -fPIC -Wall -Wno-unused-function -Wno-sign-compare -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/../cuda-sim/ -I . -I ../src/cuda-sim/ -c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/elf_lexer.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/elf_lexer.cc
  1428. g++ -ggdb -fPIC -Wall -Wno-unused-function -Wno-sign-compare -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/../cuda-sim/ -I . -I ../src/cuda-sim/ -c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/header_parser.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/header_parser.cc
  1429. :
  1430. flex -B -P header_ -o/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/header_lexer.cc header.l
  1431. g++ -ggdb -fPIC -Wall -Wno-unused-function -Wno-sign-compare -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/../cuda-sim/ -I . -I ../src/cuda-sim/ -c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/header_lexer.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/header_lexer.cc
  1432. touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/Makefile.makedepend
  1433. makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/Makefile.makedepend -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/ addrdec.cc dram.cc dram_sched.cc gpu-cache.cc gpu-misc.cc gpu-sim.cc hashing.cc histogram.cc icnt_wrapper.cc l2cache.cc local_interconnect.cc mem_fetch.cc mem_latency_stat.cc power_interface.cc power_stat.cc scoreboard.cc shader.cc stack.cc stat-tool.cc traffic_breakdown.cc visualizer.cc 2> /dev/null
  1434. touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/Makefile.makedepend
  1435. makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/Makefile.makedepend -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/ cuda_runtime_api.cc 2> /dev/null
  1436. make[2]: 'depend' is up to date.
  1437. make[2]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda'
  1438. make[2]: 'depend' is up to date.
  1439. make[2]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim'
  1440. make -C ./libcuda/
  1441. make -C ./src/gpgpu-sim/
  1442. make[2]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda'
  1443. make[2]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim'
  1444. touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/Makefile.makedepend
  1445. makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/Makefile.makedepend -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/ cuda_runtime_api.cc 2> /dev/null
  1446. touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/Makefile.makedepend
  1447. makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/Makefile.makedepend -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/ addrdec.cc dram.cc dram_sched.cc gpu-cache.cc gpu-misc.cc gpu-sim.cc hashing.cc histogram.cc icnt_wrapper.cc l2cache.cc local_interconnect.cc mem_fetch.cc mem_latency_stat.cc power_interface.cc power_stat.cc scoreboard.cc shader.cc stack.cc stat-tool.cc traffic_breakdown.cc visualizer.cc 2> /dev/null
  1448. g++ -std=c++0x -O3 -g -Wall -Wno-unused-function -Wno-sign-compare -fPIC -DCUDART_VERSION=11060 -I./ -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda -I/usr/local/cuda-11.6/include -c cuda_runtime_api.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuda_runtime_api.o
  1449. bison -t -d -v --report=all -p cuobjdump_ -o/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuobjdump_parser.c cuobjdump.y --file-prefix=/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuobjdump
  1450. g++ -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/addrdec.o -c addrdec.cc
  1451. g++ -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/dram.o -c dram.cc
  1452. g++ -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/dram_sched.o -c dram_sched.cc
  1453. g++ -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/gpu-cache.o -c gpu-cache.cc
  1454. g++ -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/gpu-misc.o -c gpu-misc.cc
  1455. g++ -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/gpu-sim.o -c gpu-sim.cc
  1456. g++ -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/hashing.o -c hashing.cc
  1457. g++ -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/histogram.o -c histogram.cc
  1458. g++ -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/icnt_wrapper.o -c icnt_wrapper.cc
  1459. g++ -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/l2cache.o -c l2cache.cc
  1460. :
  1461. flex -B -P cuobjdump_ -o/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuobjdump_lexer.c cuobjdump.l
  1462. g++ -Wall -Wno-unused-function -Wno-sign-compare -fPIC -I./ -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda -I/usr/local/cuda-11.6/include -c /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuobjdump_parser.c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuobjdump_parser.o
  1463. :
  1464. flex -B -P sass_ -o/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/sass_lexer.cc sass.l
  1465. g++ -ggdb -fPIC -Wall -Wno-unused-function -Wno-sign-compare -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/../cuda-sim/ -I . -I ../src/cuda-sim/ -c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/sass_parser.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/sass_parser.cc
  1466. g++ -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/local_interconnect.o -c local_interconnect.cc
  1467. g++ -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/mem_fetch.o -c mem_fetch.cc
  1468. g++ -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/mem_latency_stat.o -c mem_latency_stat.cc
  1469. g++ -ggdb -fPIC -Wall -Wno-unused-function -Wno-sign-compare -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/../cuda-sim/ -I . -I ../src/cuda-sim/ -c /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/ptx.tab.c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/ptx.tab.o
  1470. g++ -ggdb -fPIC -Wall -Wno-unused-function -Wno-sign-compare -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/../cuda-sim/ -I . -I ../src/cuda-sim/ -c /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/lex.ptx_.c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/lex.ptx_.o
  1471. g++ -Wall -Wno-unused-function -Wno-sign-compare -fPIC -I./ -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda -I/usr/local/cuda-11.6/include -c /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuobjdump_lexer.c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuobjdump_lexer.o
  1472. g++ -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/power_interface.o -c power_interface.cc
  1473. g++ -ggdb -fPIC -Wall -Wno-unused-function -Wno-sign-compare -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/../cuda-sim/ -I . -I ../src/cuda-sim/ -c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/sass_lexer.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/sass_lexer.cc
  1474. g++ -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/power_stat.o -c power_stat.cc
  1475. g++ -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/scoreboard.o -c scoreboard.cc
  1476. g++ -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/shader.o -c shader.cc
  1477. In file included from cuda_runtime_api.cc:127:
  1478. /usr/local/cuda-11.6/include/host_defines.h:54:2: warning: #warning "host_defines.h is an internal header file and must not be used directly. This file will be removed in a future CUDA release. Please use cuda_runtime_api.h or cuda_runtime.h instead." [-Wcpp]
  1479. 54 | #warning "host_defines.h is an internal header file and must not be used directly. This file will be removed in a future CUDA release. Please use cuda_runtime_api.h or cuda_runtime.h instead."
  1480. | ^~~~~~~
  1481. g++ -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/stack.o -c stack.cc
  1482. g++ -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/stat-tool.o -c stat-tool.cc
  1483. g++ -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/traffic_breakdown.o -c traffic_breakdown.cc
  1484. g++ -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/visualizer.o -c visualizer.cc
  1485. cuobjdumpInstList.cc: In member function ‘std::string cuobjdumpInstList::parseCuobjdumpRegister(std::string, bool, int)’:
  1486. cuobjdumpInstList.cc:508:21: warning: format not a string literal and no format arguments [-Wformat-security]
  1487. 508 | printf(reg.c_str());
  1488. | ^
  1489. gpu-sim.cc:83: warning: "MAX" redefined
  1490. 83 | #define MAX(a, b) (((a) > (b)) ? (a) : (b))
  1491. |
  1492. In file included from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/cacti/cacti_interface.h:42,
  1493. from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/cacti/area.h:37,
  1494. from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/cacti/parameter.h:37,
  1495. from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/basic_components.h:37,
  1496. from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/array.h:37,
  1497. from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/processor.h:44,
  1498. from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/gpgpu_sim_wrapper.h:41,
  1499. from power_interface.h:38,
  1500. from gpu-sim.cc:72:
  1501. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/cacti/const.h:105: note: this is the location of the previous definition
  1502. 105 | #define MAX(a,b) (((a)>(b))?(a):(b))
  1503. |
  1504. In file included from ../abstract_hardware_model.h:217,
  1505. from gpu-cache.h:36,
  1506. from gpu-cache.cc:32:
  1507. ../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
  1508. ../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
  1509. 67 | bool done() const { return m_updates == m_issued; }
  1510. | ~~~~~~~~~~^~~~~~~~~~~
  1511. cuobjdump_to_ptxplus.cc: In function ‘void output(const char*)’:
  1512. cuobjdump_to_ptxplus.cc:57:27: warning: format not a string literal and no format arguments [-Wformat-security]
  1513. 57 | fprintf(ptxplus_out, text);
  1514. | ^
  1515. In file included from ../abstract_hardware_model.h:217,
  1516. from l2cache.cc:38:
  1517. ../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
  1518. ../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
  1519. 67 | bool done() const { return m_updates == m_issued; }
  1520. | ~~~~~~~~~~^~~~~~~~~~~
  1521. In file included from gpu-cache.h:36,
  1522. from gpu-cache.cc:32:
  1523. ../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  1524. ../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1525. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1526. | ~~~^ ~~
  1527. | | |
  1528. | | address_type {aka long long unsigned int}
  1529. | unsigned int
  1530. | %04llx
  1531. In file included from gpu-cache.h:36,
  1532. from gpu-cache.cc:32:
  1533. ../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  1534. ../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1535. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1536. | ~~~^ ~~
  1537. | | |
  1538. | | address_type {aka long long unsigned int}
  1539. | unsigned int
  1540. | %04llx
  1541. In file included from l2cache.cc:38:
  1542. ../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  1543. ../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1544. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1545. | ~~~^ ~~
  1546. | | |
  1547. | | address_type {aka long long unsigned int}
  1548. | unsigned int
  1549. | %04llx
  1550. In file included from ../abstract_hardware_model.h:217,
  1551. from gpu-sim.h:39,
  1552. from dram_sched.h:36,
  1553. from dram_sched.cc:29:
  1554. ../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
  1555. ../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
  1556. 67 | bool done() const { return m_updates == m_issued; }
  1557. | ~~~~~~~~~~^~~~~~~~~~~
  1558. In file included from l2cache.cc:38:
  1559. ../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  1560. ../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1561. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1562. | ~~~^ ~~
  1563. | | |
  1564. | | address_type {aka long long unsigned int}
  1565. | unsigned int
  1566. | %04llx
  1567. In file included from ../abstract_hardware_model.h:217,
  1568. from mem_latency_stat.cc:31:
  1569. ../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
  1570. ../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
  1571. 67 | bool done() const { return m_updates == m_issued; }
  1572. | ~~~~~~~~~~^~~~~~~~~~~
  1573. In file included from gpu-sim.h:39,
  1574. from dram_sched.h:36,
  1575. from dram_sched.cc:29:
  1576. ../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  1577. ../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1578. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1579. | ~~~^ ~~
  1580. | | |
  1581. | | address_type {aka long long unsigned int}
  1582. | unsigned int
  1583. | %04llx
  1584. In file included from gpu-sim.h:39,
  1585. from dram_sched.h:36,
  1586. from dram_sched.cc:29:
  1587. ../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  1588. ../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1589. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1590. | ~~~^ ~~
  1591. | | |
  1592. | | address_type {aka long long unsigned int}
  1593. | unsigned int
  1594. | %04llx
  1595. In file included from mem_latency_stat.cc:31:
  1596. ../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  1597. ../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1598. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1599. | ~~~^ ~~
  1600. | | |
  1601. | | address_type {aka long long unsigned int}
  1602. | unsigned int
  1603. | %04llx
  1604. In file included from mem_latency_stat.cc:31:
  1605. ../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  1606. ../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1607. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1608. | ~~~^ ~~
  1609. | | |
  1610. | | address_type {aka long long unsigned int}
  1611. | unsigned int
  1612. | %04llx
  1613. In file included from ../abstract_hardware_model.h:217,
  1614. from hashing.cc:6:
  1615. ../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
  1616. ../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
  1617. 67 | bool done() const { return m_updates == m_issued; }
  1618. | ~~~~~~~~~~^~~~~~~~~~~
  1619. In file included from ../abstract_hardware_model.h:217,
  1620. from addrdec.h:37,
  1621. from addrdec.cc:29:
  1622. ../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
  1623. ../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
  1624. 67 | bool done() const { return m_updates == m_issued; }
  1625. | ~~~~~~~~~~^~~~~~~~~~~
  1626. In file included from gpu-sim.h:44,
  1627. from dram_sched.h:36,
  1628. from dram_sched.cc:29:
  1629. shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  1630. shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  1631. 450 | std::vector<register_set *> &m_spec_cores_out;
  1632. | ^~~~~~~~~~~~~~~~
  1633. shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  1634. 449 | register_set *m_mem_out;
  1635. | ^~~~~~~~~
  1636. shader.h:344:3: warning: when initialized here [-Wreorder]
  1637. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  1638. | ^~~~~~~~~~~~~~
  1639. In file included from ../abstract_hardware_model.h:217,
  1640. from mem_fetch.h:33,
  1641. from mem_fetch.cc:29:
  1642. ../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
  1643. ../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
  1644. 67 | bool done() const { return m_updates == m_issued; }
  1645. | ~~~~~~~~~~^~~~~~~~~~~
  1646. In file included from gpu-sim.h:44,
  1647. from gpu-cache.cc:34:
  1648. shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  1649. shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  1650. 450 | std::vector<register_set *> &m_spec_cores_out;
  1651. | ^~~~~~~~~~~~~~~~
  1652. shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  1653. 449 | register_set *m_mem_out;
  1654. | ^~~~~~~~~
  1655. shader.h:344:3: warning: when initialized here [-Wreorder]
  1656. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  1657. | ^~~~~~~~~~~~~~
  1658. shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
  1659. shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  1660. 1298 | if (inst.op != m_supported_op) {
  1661. | ~~~~~~~~^~~~~~~~~~~~~~~~~
  1662. shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
  1663. shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  1664. 1298 | if (inst.op != m_supported_op) {
  1665. | ~~~~~~~~^~~~~~~~~~~~~~~~~
  1666. In file included from addrdec.h:37,
  1667. from addrdec.cc:29:
  1668. ../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  1669. ../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1670. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1671. | ~~~^ ~~
  1672. | | |
  1673. | | address_type {aka long long unsigned int}
  1674. | unsigned int
  1675. | %04llx
  1676. In file included from gpu-sim.h:44,
  1677. from l2cache.cc:43:
  1678. shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  1679. shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  1680. 450 | std::vector<register_set *> &m_spec_cores_out;
  1681. | ^~~~~~~~~~~~~~~~
  1682. shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  1683. 449 | register_set *m_mem_out;
  1684. | ^~~~~~~~~
  1685. shader.h:344:3: warning: when initialized here [-Wreorder]
  1686. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  1687. | ^~~~~~~~~~~~~~
  1688. In file included from addrdec.h:37,
  1689. from addrdec.cc:29:
  1690. ../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  1691. ../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1692. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1693. | ~~~^ ~~
  1694. | | |
  1695. | | address_type {aka long long unsigned int}
  1696. | unsigned int
  1697. | %04llx
  1698. In file included from mem_fetch.h:33,
  1699. from mem_fetch.cc:29:
  1700. ../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  1701. ../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1702. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1703. | ~~~^ ~~
  1704. | | |
  1705. | | address_type {aka long long unsigned int}
  1706. | unsigned int
  1707. | %04llx
  1708. In file included from mem_fetch.h:33,
  1709. from mem_fetch.cc:29:
  1710. ../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  1711. ../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1712. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1713. | ~~~^ ~~
  1714. | | |
  1715. | | address_type {aka long long unsigned int}
  1716. | unsigned int
  1717. | %04llx
  1718. In file included from ../abstract_hardware_model.h:217,
  1719. from stack.h:32,
  1720. from stack.cc:29:
  1721. ../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
  1722. ../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
  1723. 67 | bool done() const { return m_updates == m_issued; }
  1724. | ~~~~~~~~~~^~~~~~~~~~~
  1725. In file included from ../abstract_hardware_model.h:217,
  1726. from mem_fetch.h:33,
  1727. from traffic_breakdown.cc:2:
  1728. ../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
  1729. ../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
  1730. 67 | bool done() const { return m_updates == m_issued; }
  1731. | ~~~~~~~~~~^~~~~~~~~~~
  1732. shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
  1733. shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  1734. 1298 | if (inst.op != m_supported_op) {
  1735. | ~~~~~~~~^~~~~~~~~~~~~~~~~
  1736. In file included from hashing.cc:6:
  1737. ../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  1738. ../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1739. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1740. | ~~~^ ~~
  1741. | | |
  1742. | | address_type {aka long long unsigned int}
  1743. | unsigned int
  1744. | %04llx
  1745. In file included from hashing.cc:6:
  1746. ../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  1747. ../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1748. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1749. | ~~~^ ~~
  1750. | | |
  1751. | | address_type {aka long long unsigned int}
  1752. | unsigned int
  1753. | %04llx
  1754. In file included from ../abstract_hardware_model.h:217,
  1755. from shader.h:50,
  1756. from shader.cc:32:
  1757. ../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
  1758. ../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
  1759. 67 | bool done() const { return m_updates == m_issued; }
  1760. | ~~~~~~~~~~^~~~~~~~~~~
  1761. In file included from ../abstract_hardware_model.h:217,
  1762. from gpu-sim.h:39,
  1763. from gpu-sim.cc:32:
  1764. ../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
  1765. ../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
  1766. 67 | bool done() const { return m_updates == m_issued; }
  1767. | ~~~~~~~~~~^~~~~~~~~~~
  1768. In file included from ../abstract_hardware_model.h:217,
  1769. from stat-tool.h:32,
  1770. from stat-tool.cc:29:
  1771. ../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
  1772. ../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
  1773. 67 | bool done() const { return m_updates == m_issued; }
  1774. | ~~~~~~~~~~^~~~~~~~~~~
  1775. In file included from ../abstract_hardware_model.h:217,
  1776. from scoreboard.h:38,
  1777. from scoreboard.cc:29:
  1778. ../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
  1779. ../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
  1780. 67 | bool done() const { return m_updates == m_issued; }
  1781. | ~~~~~~~~~~^~~~~~~~~~~
  1782. In file included from stack.h:32,
  1783. from stack.cc:29:
  1784. ../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  1785. ../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1786. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1787. | ~~~^ ~~
  1788. | | |
  1789. | | address_type {aka long long unsigned int}
  1790. | unsigned int
  1791. | %04llx
  1792. In file included from mem_fetch.h:33,
  1793. from traffic_breakdown.cc:2:
  1794. ../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  1795. ../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1796. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1797. | ~~~^ ~~
  1798. | | |
  1799. | | address_type {aka long long unsigned int}
  1800. | unsigned int
  1801. | %04llx
  1802. In file included from mem_fetch.h:33,
  1803. from traffic_breakdown.cc:2:
  1804. ../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  1805. ../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1806. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1807. | ~~~^ ~~
  1808. | | |
  1809. | | address_type {aka long long unsigned int}
  1810. | unsigned int
  1811. | %04llx
  1812. In file included from stack.h:32,
  1813. from stack.cc:29:
  1814. ../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  1815. ../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1816. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1817. | ~~~^ ~~
  1818. | | |
  1819. | | address_type {aka long long unsigned int}
  1820. | unsigned int
  1821. | %04llx
  1822. In file included from ../abstract_hardware_model.h:217,
  1823. from gpu-sim.h:39,
  1824. from dram_sched.h:36,
  1825. from dram.cc:33:
  1826. ../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
  1827. ../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
  1828. 67 | bool done() const { return m_updates == m_issued; }
  1829. | ~~~~~~~~~~^~~~~~~~~~~
  1830. In file included from gpu-sim.h:44,
  1831. from mem_latency_stat.cc:36:
  1832. shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  1833. shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  1834. 450 | std::vector<register_set *> &m_spec_cores_out;
  1835. | ^~~~~~~~~~~~~~~~
  1836. shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  1837. 449 | register_set *m_mem_out;
  1838. | ^~~~~~~~~
  1839. shader.h:344:3: warning: when initialized here [-Wreorder]
  1840. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  1841. | ^~~~~~~~~~~~~~
  1842. In file included from ../src/cuda-sim/cuda-sim.h:36,
  1843. from gpgpu_context.h:3,
  1844. from cuda_runtime_api.cc:136:
  1845. ../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  1846. ../src/cuda-sim/../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1847. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1848. | ~~~^ ~~
  1849. | | |
  1850. | | address_type {aka long long unsigned int}
  1851. | unsigned int
  1852. | %04llx
  1853. In file included from ../src/cuda-sim/cuda-sim.h:36,
  1854. from gpgpu_context.h:3,
  1855. from cuda_runtime_api.cc:136:
  1856. ../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  1857. ../src/cuda-sim/../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1858. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1859. | ~~~^ ~~
  1860. | | |
  1861. | | address_type {aka long long unsigned int}
  1862. | unsigned int
  1863. | %04llx
  1864. In file included from ../abstract_hardware_model.h:217,
  1865. from mem_fetch.h:33,
  1866. from local_interconnect.cc:38:
  1867. ../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
  1868. ../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
  1869. 67 | bool done() const { return m_updates == m_issued; }
  1870. | ~~~~~~~~~~^~~~~~~~~~~
  1871. In file included from ../abstract_hardware_model.h:217,
  1872. from gpu-sim.h:39,
  1873. from power_interface.h:34,
  1874. from power_interface.cc:32:
  1875. ../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
  1876. ../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
  1877. 67 | bool done() const { return m_updates == m_issued; }
  1878. | ~~~~~~~~~~^~~~~~~~~~~
  1879. shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
  1880. shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  1881. 1298 | if (inst.op != m_supported_op) {
  1882. | ~~~~~~~~^~~~~~~~~~~~~~~~~
  1883. In file included from ../abstract_hardware_model.h:217,
  1884. from gpu-sim.h:39,
  1885. from power_stat.h:36,
  1886. from power_stat.cc:31:
  1887. ../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
  1888. ../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
  1889. 67 | bool done() const { return m_updates == m_issued; }
  1890. | ~~~~~~~~~~^~~~~~~~~~~
  1891. In file included from gpu-sim.h:39,
  1892. from dram_sched.h:36,
  1893. from dram.cc:33:
  1894. ../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  1895. ../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1896. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1897. | ~~~^ ~~
  1898. | | |
  1899. | | address_type {aka long long unsigned int}
  1900. | unsigned int
  1901. | %04llx
  1902. In file included from scoreboard.h:38,
  1903. from scoreboard.cc:29:
  1904. ../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  1905. ../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1906. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1907. | ~~~^ ~~
  1908. | | |
  1909. | | address_type {aka long long unsigned int}
  1910. | unsigned int
  1911. | %04llx
  1912. In file included from stat-tool.h:32,
  1913. from stat-tool.cc:29:
  1914. ../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  1915. ../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1916. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1917. | ~~~^ ~~
  1918. | | |
  1919. | | address_type {aka long long unsigned int}
  1920. | unsigned int
  1921. | %04llx
  1922. In file included from gpu-sim.h:39,
  1923. from dram_sched.h:36,
  1924. from dram.cc:33:
  1925. ../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  1926. ../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1927. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1928. | ~~~^ ~~
  1929. | | |
  1930. | | address_type {aka long long unsigned int}
  1931. | unsigned int
  1932. | %04llx
  1933. In file included from shader.h:50,
  1934. from shader.cc:32:
  1935. ../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  1936. ../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1937. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1938. | ~~~^ ~~
  1939. | | |
  1940. | | address_type {aka long long unsigned int}
  1941. | unsigned int
  1942. | %04llx
  1943. In file included from gpu-sim.h:39,
  1944. from gpu-sim.cc:32:
  1945. ../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  1946. ../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1947. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1948. | ~~~^ ~~
  1949. | | |
  1950. | | address_type {aka long long unsigned int}
  1951. | unsigned int
  1952. | %04llx
  1953. In file included from scoreboard.h:38,
  1954. from scoreboard.cc:29:
  1955. ../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  1956. ../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1957. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1958. | ~~~^ ~~
  1959. | | |
  1960. | | address_type {aka long long unsigned int}
  1961. | unsigned int
  1962. | %04llx
  1963. In file included from stat-tool.h:32,
  1964. from stat-tool.cc:29:
  1965. ../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  1966. ../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1967. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1968. | ~~~^ ~~
  1969. | | |
  1970. | | address_type {aka long long unsigned int}
  1971. | unsigned int
  1972. | %04llx
  1973. In file included from shader.h:50,
  1974. from shader.cc:32:
  1975. ../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  1976. ../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1977. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1978. | ~~~^ ~~
  1979. | | |
  1980. | | address_type {aka long long unsigned int}
  1981. | unsigned int
  1982. | %04llx
  1983. In file included from gpu-sim.h:39,
  1984. from gpu-sim.cc:32:
  1985. ../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  1986. ../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  1987. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  1988. | ~~~^ ~~
  1989. | | |
  1990. | | address_type {aka long long unsigned int}
  1991. | unsigned int
  1992. | %04llx
  1993. In file included from ../abstract_hardware_model.h:217,
  1994. from gpu-sim.h:39,
  1995. from visualizer.cc:32:
  1996. ../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
  1997. ../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
  1998. 67 | bool done() const { return m_updates == m_issued; }
  1999. | ~~~~~~~~~~^~~~~~~~~~~
  2000. In file included from gpu-sim.h:44,
  2001. from mem_fetch.cc:30:
  2002. shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  2003. shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  2004. 450 | std::vector<register_set *> &m_spec_cores_out;
  2005. | ^~~~~~~~~~~~~~~~
  2006. shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  2007. 449 | register_set *m_mem_out;
  2008. | ^~~~~~~~~
  2009. shader.h:344:3: warning: when initialized here [-Wreorder]
  2010. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  2011. | ^~~~~~~~~~~~~~
  2012. In file included from gpu-sim.h:44,
  2013. from addrdec.cc:33:
  2014. shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  2015. shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  2016. 450 | std::vector<register_set *> &m_spec_cores_out;
  2017. | ^~~~~~~~~~~~~~~~
  2018. shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  2019. 449 | register_set *m_mem_out;
  2020. | ^~~~~~~~~
  2021. shader.h:344:3: warning: when initialized here [-Wreorder]
  2022. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  2023. | ^~~~~~~~~~~~~~
  2024. shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
  2025. shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  2026. 1298 | if (inst.op != m_supported_op) {
  2027. | ~~~~~~~~^~~~~~~~~~~~~~~~~
  2028. In file included from ../src/cuda-sim/cuda-sim.h:37,
  2029. from gpgpu_context.h:3,
  2030. from cuda_runtime_api.cc:136:
  2031. ../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  2032. ../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  2033. 450 | std::vector<register_set *> &m_spec_cores_out;
  2034. | ^~~~~~~~~~~~~~~~
  2035. ../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  2036. 449 | register_set *m_mem_out;
  2037. | ^~~~~~~~~
  2038. ../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning: when initialized here [-Wreorder]
  2039. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  2040. | ^~~~~~~~~~~~~~
  2041. In file included from mem_fetch.h:33,
  2042. from local_interconnect.cc:38:
  2043. ../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  2044. ../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  2045. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  2046. | ~~~^ ~~
  2047. | | |
  2048. | | address_type {aka long long unsigned int}
  2049. | unsigned int
  2050. | %04llx
  2051. In file included from gpu-sim.h:39,
  2052. from power_interface.h:34,
  2053. from power_interface.cc:32:
  2054. ../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  2055. ../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  2056. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  2057. | ~~~^ ~~
  2058. | | |
  2059. | | address_type {aka long long unsigned int}
  2060. | unsigned int
  2061. | %04llx
  2062. shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
  2063. shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  2064. 1298 | if (inst.op != m_supported_op) {
  2065. | ~~~~~~~~^~~~~~~~~~~~~~~~~
  2066. In file included from mem_fetch.h:33,
  2067. from local_interconnect.cc:38:
  2068. ../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  2069. ../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  2070. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  2071. | ~~~^ ~~
  2072. | | |
  2073. | | address_type {aka long long unsigned int}
  2074. | unsigned int
  2075. | %04llx
  2076. In file included from gpu-sim.h:39,
  2077. from power_interface.h:34,
  2078. from power_interface.cc:32:
  2079. ../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  2080. ../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  2081. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  2082. | ~~~^ ~~
  2083. | | |
  2084. | | address_type {aka long long unsigned int}
  2085. | unsigned int
  2086. | %04llx
  2087. local_interconnect.cc: In member function ‘void xbar_router::RR_Advance()’:
  2088. local_interconnect.cc:152:16: warning: operation on ‘((xbar_router*)this)->xbar_router::next_node_id’ may be undefined [-Wsequence-point]
  2089. 152 | next_node_id = (++next_node_id % total_nodes);
  2090. | ~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  2091. local_interconnect.cc:162:25: warning: format ‘%d’ expects argument of type ‘int’, but argument 3 has type ‘long long unsigned int’ [-Wformat=]
  2092. 162 | printf("%d : cycle %d : conflicts = %d\n", m_id, cycles, conflict_sub);
  2093. | ~^ ~~~~~~
  2094. | | |
  2095. | int long long unsigned int
  2096. | %lld
  2097. local_interconnect.cc:163:25: warning: format ‘%d’ expects argument of type ‘int’, but argument 3 has type ‘long long unsigned int’ [-Wformat=]
  2098. 163 | printf("%d : cycle %d : passing reqs = %d\n", m_id, cycles, reqs);
  2099. | ~^ ~~~~~~
  2100. | | |
  2101. | int long long unsigned int
  2102. | %lld
  2103. local_interconnect.cc: In member function ‘void xbar_router::iSLIP_Advance()’:
  2104. local_interconnect.cc:220:35: warning: format ‘%d’ expects argument of type ‘int’, but argument 3 has type ‘long long unsigned int’ [-Wformat=]
  2105. 220 | printf("%d : cycle %d : send req from %d to %d\n", m_id, cycles,
  2106. | ~^ ~~~~~~
  2107. | | |
  2108. | int long long unsigned int
  2109. | %lld
  2110. local_interconnect.cc:231:41: warning: format ‘%d’ expects argument of type ‘int’, but argument 3 has type ‘long long unsigned int’ [-Wformat=]
  2111. 231 | printf("%d : cycle %d : cannot send req from %d to %d\n",
  2112. | ~^
  2113. | |
  2114. | int
  2115. | %lld
  2116. 232 | m_id, cycles, node_id2, i - _n_shader);
  2117. | ~~~~~~
  2118. | |
  2119. | long long unsigned int
  2120. local_interconnect.cc:251:25: warning: format ‘%d’ expects argument of type ‘int’, but argument 3 has type ‘long long unsigned int’ [-Wformat=]
  2121. 251 | printf("%d : cycle %d : grant_cycles = %d\n", m_id, cycles, grant_cycles);
  2122. | ~^ ~~~~~~
  2123. | | |
  2124. | int long long unsigned int
  2125. | %lld
  2126. local_interconnect.cc:259:25: warning: format ‘%d’ expects argument of type ‘int’, but argument 3 has type ‘long long unsigned int’ [-Wformat=]
  2127. 259 | printf("%d : cycle %d : conflicts = %d\n", m_id, cycles, conflict_sub);
  2128. | ~^ ~~~~~~
  2129. | | |
  2130. | int long long unsigned int
  2131. | %lld
  2132. local_interconnect.cc:260:25: warning: format ‘%d’ expects argument of type ‘int’, but argument 3 has type ‘long long unsigned int’ [-Wformat=]
  2133. 260 | printf("%d : cycle %d : passing reqs = %d\n", m_id, cycles, reqs);
  2134. | ~^ ~~~~~~
  2135. | | |
  2136. | int long long unsigned int
  2137. | %lld
  2138. In file included from gpu-sim.h:39,
  2139. from power_stat.h:36,
  2140. from power_stat.cc:31:
  2141. ../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  2142. ../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  2143. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  2144. | ~~~^ ~~
  2145. | | |
  2146. | | address_type {aka long long unsigned int}
  2147. | unsigned int
  2148. | %04llx
  2149. In file included from gpu-sim.h:44,
  2150. from dram_sched.h:36,
  2151. from dram.cc:33:
  2152. shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  2153. shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  2154. 450 | std::vector<register_set *> &m_spec_cores_out;
  2155. | ^~~~~~~~~~~~~~~~
  2156. shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  2157. 449 | register_set *m_mem_out;
  2158. | ^~~~~~~~~
  2159. shader.h:344:3: warning: when initialized here [-Wreorder]
  2160. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  2161. | ^~~~~~~~~~~~~~
  2162. In file included from gpu-sim.h:39,
  2163. from power_stat.h:36,
  2164. from power_stat.cc:31:
  2165. ../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  2166. ../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  2167. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  2168. | ~~~^ ~~
  2169. | | |
  2170. | | address_type {aka long long unsigned int}
  2171. | unsigned int
  2172. | %04llx
  2173. In file included from gpu-sim.h:39,
  2174. from visualizer.cc:32:
  2175. ../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  2176. ../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  2177. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  2178. | ~~~^ ~~
  2179. | | |
  2180. | | address_type {aka long long unsigned int}
  2181. | unsigned int
  2182. | %04llx
  2183. In file included from hashing.h:4,
  2184. from addrdec.cc:34:
  2185. addrdec.cc: In member function ‘void linear_to_raw_address_translation::sweep_test() const’:
  2186. addrdec.cc:522:28: warning: comparison of integer expressions of different signedness: ‘int’ and ‘const unsigned int’ [-Wsign-compare]
  2187. 522 | assert((int)tlx.chip < m_n_channel);
  2188. | ~~~~~~~~~~~~~~^~~~~~~~~~~~~
  2189. addrdec.cc: In function ‘unsigned int next_powerOf2(unsigned int)’:
  2190. addrdec.cc:587:16: warning: suggest parentheses around ‘-’ in operand of ‘&’ [-Wparentheses]
  2191. 587 | while (n & n - 1) n = n & (n - 1); // unset rightmost bit
  2192. | ~~^~~
  2193. g++ -ggdb -fPIC -Wall -Wno-unused-function -Wno-sign-compare -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/cuobjdump_to_ptxplus /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/cuobjdumpInst.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/cuobjdumpInstList.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/cuobjdump_to_ptxplus.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/ptx.tab.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/lex.ptx_.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/sass_lexer.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/sass_parser.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/elf_lexer.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/elf_parser.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/header_parser.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/header_lexer.o
  2194. In file included from gpu-sim.h:39,
  2195. from visualizer.cc:32:
  2196. ../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  2197. ../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  2198. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  2199. | ~~~^ ~~
  2200. | | |
  2201. | | address_type {aka long long unsigned int}
  2202. | unsigned int
  2203. | %04llx
  2204. shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
  2205. shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  2206. 1298 | if (inst.op != m_supported_op) {
  2207. | ~~~~~~~~^~~~~~~~~~~~~~~~~
  2208. In file included from gpu-cache.cc:32:
  2209. gpu-cache.h: In member function ‘unsigned int sector_cache_block::get_sector_index(mem_access_sector_mask_t)’:
  2210. gpu-cache.h:502:3: warning: control reaches end of non-void function [-Wreturn-type]
  2211. 502 | }
  2212. | ^
  2213. In file included from gpu-sim.h:44,
  2214. from gpu-sim.cc:32:
  2215. shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  2216. shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  2217. 450 | std::vector<register_set *> &m_spec_cores_out;
  2218. | ^~~~~~~~~~~~~~~~
  2219. shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  2220. 449 | register_set *m_mem_out;
  2221. | ^~~~~~~~~
  2222. shader.h:344:3: warning: when initialized here [-Wreorder]
  2223. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  2224. | ^~~~~~~~~~~~~~
  2225. In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:37,
  2226. from ../../libcuda/gpgpu_context.h:3,
  2227. from stat-tool.cc:40:
  2228. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  2229. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  2230. 450 | std::vector<register_set *> &m_spec_cores_out;
  2231. | ^~~~~~~~~~~~~~~~
  2232. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  2233. 449 | register_set *m_mem_out;
  2234. | ^~~~~~~~~
  2235. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning: when initialized here [-Wreorder]
  2236. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  2237. | ^~~~~~~~~~~~~~
  2238. shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
  2239. shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  2240. 1298 | if (inst.op != m_supported_op) {
  2241. | ~~~~~~~~^~~~~~~~~~~~~~~~~
  2242. make[2]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/cuobjdump_to_ptxplus'
  2243. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
  2244. ../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  2245. 1298 | if (inst.op != m_supported_op) {
  2246. | ~~~~~~~~^~~~~~~~~~~~~~~~~
  2247. In file included from gpu-sim.h:44,
  2248. from power_interface.h:34,
  2249. from power_interface.cc:32:
  2250. shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  2251. shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  2252. 450 | std::vector<register_set *> &m_spec_cores_out;
  2253. | ^~~~~~~~~~~~~~~~
  2254. shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  2255. 449 | register_set *m_mem_out;
  2256. | ^~~~~~~~~
  2257. shader.h:344:3: warning: when initialized here [-Wreorder]
  2258. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  2259. | ^~~~~~~~~~~~~~
  2260. In file included from shader.cc:32:
  2261. shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  2262. shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  2263. 450 | std::vector<register_set *> &m_spec_cores_out;
  2264. | ^~~~~~~~~~~~~~~~
  2265. shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  2266. 449 | register_set *m_mem_out;
  2267. | ^~~~~~~~~
  2268. shader.h:344:3: warning: when initialized here [-Wreorder]
  2269. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  2270. | ^~~~~~~~~~~~~~
  2271. shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
  2272. shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  2273. 1298 | if (inst.op != m_supported_op) {
  2274. | ~~~~~~~~^~~~~~~~~~~~~~~~~
  2275. In file included from gpu-sim.h:44,
  2276. from power_stat.h:36,
  2277. from power_stat.cc:31:
  2278. shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  2279. shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  2280. 450 | std::vector<register_set *> &m_spec_cores_out;
  2281. | ^~~~~~~~~~~~~~~~
  2282. shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  2283. 449 | register_set *m_mem_out;
  2284. | ^~~~~~~~~
  2285. shader.h:344:3: warning: when initialized here [-Wreorder]
  2286. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  2287. | ^~~~~~~~~~~~~~
  2288. In file included from gpu-sim.h:44,
  2289. from visualizer.cc:32:
  2290. shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  2291. shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  2292. 450 | std::vector<register_set *> &m_spec_cores_out;
  2293. | ^~~~~~~~~~~~~~~~
  2294. shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  2295. 449 | register_set *m_mem_out;
  2296. | ^~~~~~~~~
  2297. shader.h:344:3: warning: when initialized here [-Wreorder]
  2298. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  2299. | ^~~~~~~~~~~~~~
  2300. cuda_runtime_api.cc: In function ‘cudaError_t cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlagsInternal(int*, const char*, int, size_t, unsigned int, gpgpu_context*)’:
  2301. cuda_runtime_api.cc:1413:18: warning: format ‘%d’ expects argument of type ‘int’, but argument 4 has type ‘size_t’ {aka ‘long unsigned int’} [-Wformat=]
  2302. 1413 | "SMemSize=%d\n",
  2303. | ~^
  2304. | |
  2305. | int
  2306. | %ld
  2307. 1414 | hostFunc, blockSize, dynamicSMemSize);
  2308. | ~~~~~~~~~~~~~~~
  2309. | |
  2310. | size_t {aka long unsigned int}
  2311. shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
  2312. shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  2313. 1298 | if (inst.op != m_supported_op) {
  2314. | ~~~~~~~~^~~~~~~~~~~~~~~~~
  2315. dram.cc: In member function ‘unsigned int dram_t::get_bankgrp_number(unsigned int)’:
  2316. dram.cc:884:1: warning: control reaches end of non-void function [-Wreturn-type]
  2317. 884 | }
  2318. | ^
  2319. shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
  2320. shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  2321. 1298 | if (inst.op != m_supported_op) {
  2322. | ~~~~~~~~^~~~~~~~~~~~~~~~~
  2323. cuda_runtime_api.cc: In function ‘unsigned int __cudaPushCallConfiguration(dim3, dim3, size_t, CUstream_st*)’:
  2324. cuda_runtime_api.cc:3599:1: warning: no return statement in function returning non-void [-Wreturn-type]
  2325. 3599 | }
  2326. | ^
  2327. mem_fetch.cc: In member function ‘void mem_fetch::print(FILE*, bool) const’:
  2328. mem_fetch.cc:90:3: warning: nonnull argument ‘this’ compared to NULL [-Wnonnull-compare]
  2329. 90 | if (this == NULL) {
  2330. | ^~
  2331. shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
  2332. shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  2333. 1298 | if (inst.op != m_supported_op) {
  2334. | ~~~~~~~~^~~~~~~~~~~~~~~~~
  2335. In file included from scoreboard.cc:31:
  2336. shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  2337. shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  2338. 450 | std::vector<register_set *> &m_spec_cores_out;
  2339. | ^~~~~~~~~~~~~~~~
  2340. shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  2341. 449 | register_set *m_mem_out;
  2342. | ^~~~~~~~~
  2343. shader.h:344:3: warning: when initialized here [-Wreorder]
  2344. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  2345. | ^~~~~~~~~~~~~~
  2346. power_interface.cc: In function ‘void calculate_hw_mcpat(const gpgpu_sim_config&, const shader_core_config*, gpgpu_sim_wrapper*, power_stat_t*, unsigned int, unsigned int, unsigned int, unsigned int, unsigned int, int, bool, char*, char*, std::string, const bool*, bool)’:
  2347. power_interface.cc:269:3: warning: this ‘if’ clause does not guard... [-Wmisleading-indentation]
  2348. 269 | if((power_simulation_mode == 2) && (accelwattch_hybrid_configuration[HW_L1_WM]))
  2349. | ^~
  2350. power_interface.cc:272:5: note: ...this statement, but the latter is misleadingly indented as if it were guarded by the ‘if’
  2351. 272 | if(aggregate_power_stats){
  2352. | ^~
  2353. shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
  2354. shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  2355. 1298 | if (inst.op != m_supported_op) {
  2356. | ~~~~~~~~^~~~~~~~~~~~~~~~~
  2357. stat-tool.cc: In member function ‘void thread_insn_span::print_span(FILE*) const’:
  2358. stat-tool.cc:522:21: warning: format ‘%d’ expects argument of type ‘int’, but argument 3 has type ‘long long unsigned int’ [-Wformat=]
  2359. 522 | fprintf(fout, "%d ", i_sc->first);
  2360. | ~^ ~~~~~~~~~~~
  2361. | | |
  2362. | int long long unsigned int
  2363. | %lld
  2364. shader.cc: In member function ‘void shader_core_ctx::create_front_pipeline()’:
  2365. shader.cc:118:21: warning: comparison of integer expressions of different signedness: ‘int’ and ‘std::vector<specialized_unit_params>::size_type’ {aka ‘long unsigned int’} [-Wsign-compare]
  2366. 118 | for (int j = 0; j < m_config->m_specialized_unit.size(); j++) {
  2367. | ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  2368. shader.cc:126:21: warning: comparison of integer expressions of different signedness: ‘int’ and ‘std::vector<specialized_unit_params>::size_type’ {aka ‘long unsigned int’} [-Wsign-compare]
  2369. 126 | for (int j = 0; j < m_config->m_specialized_unit.size(); j++) {
  2370. | ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  2371. shader.cc:151:23: warning: comparison of integer expressions of different signedness: ‘int’ and ‘std::vector<specialized_unit_params>::size_type’ {aka ‘long unsigned int’} [-Wsign-compare]
  2372. 151 | for (int j = 0; j < m_config->m_specialized_unit.size(); j++) {
  2373. | ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  2374. shader.cc: In member function ‘void shader_core_ctx::create_exec_pipeline()’:
  2375. shader.cc:419:26: warning: comparison of integer expressions of different signedness: ‘unsigned int’ and ‘const int’ [-Wsign-compare]
  2376. 419 | for (unsigned k = 0; k < m_config->gpgpu_num_sp_units; k++) {
  2377. | ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  2378. shader.cc:425:26: warning: comparison of integer expressions of different signedness: ‘unsigned int’ and ‘const int’ [-Wsign-compare]
  2379. 425 | for (unsigned k = 0; k < m_config->gpgpu_num_dp_units; k++) {
  2380. | ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  2381. shader.cc:430:26: warning: comparison of integer expressions of different signedness: ‘unsigned int’ and ‘const int’ [-Wsign-compare]
  2382. 430 | for (unsigned k = 0; k < m_config->gpgpu_num_int_units; k++) {
  2383. | ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  2384. shader.cc:436:26: warning: comparison of integer expressions of different signedness: ‘unsigned int’ and ‘const int’ [-Wsign-compare]
  2385. 436 | for (unsigned k = 0; k < m_config->gpgpu_num_sfu_units; k++) {
  2386. | ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  2387. shader.cc:442:26: warning: comparison of integer expressions of different signedness: ‘unsigned int’ and ‘const int’ [-Wsign-compare]
  2388. 442 | for (unsigned k = 0; k < m_config->gpgpu_num_tensor_core_units; k++) {
  2389. | ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  2390. In file included from gpu-sim.cc:43:
  2391. gpu-sim.cc: In member function ‘bool shader_core_ctx::occupy_shader_resource_1block(kernel_info_t&, bool)’:
  2392. shader_trace.h:41:26: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  2393. 41 | (Trace::sampling_core == get_sid() || Trace::sampling_core == -1))
  2394. | ~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~
  2395. shader_trace.h:47:9: note: in expansion of macro ‘SHADER_DTRACE’
  2396. 47 | if (SHADER_DTRACE(x)) { \
  2397. | ^~~~~~~~~~~~~
  2398. gpu-sim.cc:1680:5: note: in expansion of macro ‘SHADER_DPRINTF’
  2399. 1680 | SHADER_DPRINTF(LIVENESS,
  2400. | ^~~~~~~~~~~~~~
  2401. shader.cc: In constructor ‘shader_core_ctx::shader_core_ctx(gpgpu_sim*, simt_core_cluster*, unsigned int, unsigned int, const shader_core_config*, const memory_config*, shader_core_stats*)’:
  2402. shader.cc:492:12: warning: unused variable ‘warp_size’ [-Wunused-variable]
  2403. 492 | unsigned warp_size = config->warp_size;
  2404. | ^~~~~~~~~
  2405. gpu-sim.cc: In member function ‘void shader_core_ctx::issue_block2core(kernel_info_t&)’:
  2406. shader_trace.h:41:26: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  2407. 41 | (Trace::sampling_core == get_sid() || Trace::sampling_core == -1))
  2408. | ~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~
  2409. shader_trace.h:47:9: note: in expansion of macro ‘SHADER_DTRACE’
  2410. 47 | if (SHADER_DTRACE(x)) { \
  2411. | ^~~~~~~~~~~~~
  2412. gpu-sim.cc:1848:3: note: in expansion of macro ‘SHADER_DPRINTF’
  2413. 1848 | SHADER_DPRINTF(LIVENESS,
  2414. | ^~~~~~~~~~~~~~
  2415. In file included from gpu-sim.h:41,
  2416. from gpu-sim.cc:32:
  2417. gpu-sim.cc: In member function ‘void gpgpu_sim::cycle()’:
  2418. gpu-sim.cc:2203:18: warning: unknown conversion type character ‘[’ in format [-Wformat=]
  2419. 2203 | "uArch: inst.: %lld (ipc=%4.1f, occ=%0.4f\% [%llu / %llu]) "
  2420. | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  2421. 2204 | "sim_rate=%u (inst/sec) elapsed = %u:%u:%02u:%02u / %s",
  2422. | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  2423. ../trace.h:76:14: note: in definition of macro ‘DPRINTFG’
  2424. 76 | printf(__VA_ARGS__); \
  2425. | ^~~~~~~~~~~
  2426. gpu-sim.cc:2203:62: note: format string is defined here
  2427. 2203 | "uArch: inst.: %lld (ipc=%4.1f, occ=%0.4f\% [%llu / %llu]) "
  2428. | ^
  2429. shader.cc: In member function ‘void shader_core_ctx::read_operands()’:
  2430. shader.cc:1656:21: warning: comparison of integer expressions of different signedness: ‘int’ and ‘const unsigned int’ [-Wsign-compare]
  2431. 1656 | for (int i = 0; i < m_config->reg_file_port_throughput; ++i)
  2432. | ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  2433. shader.cc: In member function ‘mem_stage_stall_type ldst_unit::process_memory_access_queue_l1cache(l1_cache*, warp_inst_t&)’:
  2434. shader.cc:1959:23: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  2435. 1959 | for (int j = 0; j < m_config->m_L1D_config.l1_banks;
  2436. | ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  2437. shader.cc: In member function ‘void ldst_unit::L1_latency_queue_cycle()’:
  2438. shader.cc:2012:21: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  2439. 2012 | for (int j = 0; j < m_config->m_L1D_config.l1_banks; j++) {
  2440. | ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  2441. In file included from shader.cc:48:
  2442. shader.cc: In member function ‘void shader_core_ctx::register_cta_thread_exit(unsigned int, kernel_info_t*)’:
  2443. shader_trace.h:41:26: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  2444. 41 | (Trace::sampling_core == get_sid() || Trace::sampling_core == -1))
  2445. | ~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~
  2446. shader_trace.h:47:9: note: in expansion of macro ‘SHADER_DTRACE’
  2447. 47 | if (SHADER_DTRACE(x)) { \
  2448. | ^~~~~~~~~~~~~
  2449. shader.cc:2828:5: note: in expansion of macro ‘SHADER_DPRINTF’
  2450. 2828 | SHADER_DPRINTF(
  2451. | ^~~~~~~~~~~~~~
  2452. shader_trace.h:41:26: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  2453. 41 | (Trace::sampling_core == get_sid() || Trace::sampling_core == -1))
  2454. | ~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~
  2455. shader_trace.h:47:9: note: in expansion of macro ‘SHADER_DTRACE’
  2456. 47 | if (SHADER_DTRACE(x)) { \
  2457. | ^~~~~~~~~~~~~
  2458. shader.cc:2835:7: note: in expansion of macro ‘SHADER_DPRINTF’
  2459. 2835 | SHADER_DPRINTF(
  2460. | ^~~~~~~~~~~~~~
  2461. shader_trace.h:41:26: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  2462. 41 | (Trace::sampling_core == get_sid() || Trace::sampling_core == -1))
  2463. | ~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~
  2464. shader_trace.h:47:9: note: in expansion of macro ‘SHADER_DTRACE’
  2465. 47 | if (SHADER_DTRACE(x)) { \
  2466. | ^~~~~~~~~~~~~
  2467. shader.cc:2853:9: note: in expansion of macro ‘SHADER_DPRINTF’
  2468. 2853 | SHADER_DPRINTF(LIVENESS,
  2469. | ^~~~~~~~~~~~~~
  2470. shader.cc: In member function ‘void warp_inst_t::print(FILE*) const’:
  2471. shader.cc:3091:25: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  2472. 3091 | fprintf(fout, "0x%04x ", pc);
  2473. | ~~~^ ~~
  2474. | | |
  2475. | | address_type {aka long long unsigned int}
  2476. | unsigned int
  2477. | %04llx
  2478. shader.cc: In member function ‘void shader_core_ctx::display_pipeline(FILE*, int, int) const’:
  2479. shader.cc:3277:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  2480. 3277 | fprintf(fout, "w%2u : pc = 0x%x, nbytes = %u\n",
  2481. | ~^
  2482. | |
  2483. | unsigned int
  2484. | %llx
  2485. 3278 | m_inst_fetch_buffer.m_warp_id, m_inst_fetch_buffer.m_pc,
  2486. | ~~~~~~~~~~~~~~~~~~~~~~~~
  2487. | |
  2488. | address_type {aka long long unsigned int}
  2489. shader.cc: In member function ‘void shader_core_ctx::cycle()’:
  2490. shader.cc:3510:21: warning: comparison of integer expressions of different signedness: ‘int’ and ‘const unsigned int’ [-Wsign-compare]
  2491. 3510 | for (int i = 0; i < m_config->inst_fetch_throughput; ++i) {
  2492. | ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  2493. shader.cc: In member function ‘void shd_warp_t::print(FILE*) const’:
  2494. shader.cc:3943:36: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  2495. 3943 | fprintf(fout, "w%02u npc: 0x%04x, done:%c%c%c%c:%2u i:%u s:%u a:%u (done: ",
  2496. | ~~~^
  2497. | |
  2498. | unsigned int
  2499. | %04llx
  2500. 3944 | m_warp_id, m_next_pc, (functional_done() ? 'f' : ' '),
  2501. | ~~~~~~~~~
  2502. | |
  2503. | address_type {aka long long unsigned int}
  2504. shader.cc: In member function ‘bool simt_core_cluster::icnt_injection_buffer_full(unsigned int, bool)’:
  2505. shader.cc:4425:12: warning: unused variable ‘source’ [-Wunused-variable]
  2506. 4425 | unsigned source = m_cluster_id / (m_config->n_simt_clusters/m_config->chiplet_num);
  2507. | ^~~~~~
  2508. cuda_runtime_api.cc: In function ‘int get_app_cuda_version()’:
  2509. cuda_runtime_api.cc:466:9: warning: ignoring return value of ‘int system(const char*)’, declared with attribute warn_unused_result [-Wunused-result]
  2510. 466 | system(app_cuda_version_command.c_str());
  2511. | ~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  2512. cuda_runtime_api.cc: In function ‘char* readfile(std::string)’:
  2513. cuda_runtime_api.cc:3237:8: warning: ignoring return value of ‘size_t fread(void*, size_t, size_t, FILE*)’, declared with attribute warn_unused_result [-Wunused-result]
  2514. 3237 | fread(ret, 1, filesize, fp);
  2515. | ~~~~~^~~~~~~~~~~~~~~~~~~~~~
  2516. cuda_runtime_api.cc: In function ‘char* get_app_binary_name(std::string)’:
  2517. cuda_runtime_api.cc:451:25: warning: ‘self_exe_path’ may be used uninitialized in this function [-Wmaybe-uninitialized]
  2518. 451 | self_exe_path = strtok(self_exe_path, ".");
  2519. | ~~~~~~^~~~~~~~~~~~~~~~~~~~
  2520. cuda_runtime_api.cc: In member function ‘void cuda_runtime_api::extract_ptx_files_using_cuobjdump(CUctx_st*)’:
  2521. cuda_runtime_api.cc:3006:30: warning: ‘%s’ directive output may be truncated writing up to 1023 bytes into a region of size 922 [-Wformat-truncation=]
  2522. 3006 | "awk '{$1=$1}1' > %s",
  2523. | ^~
  2524. 3007 | app_binary.c_str(), ptx_list_file_name);
  2525. | ~~~~~~~~~~~~~~~~~~
  2526. In file included from /usr/include/stdio.h:867,
  2527. from cuda_runtime_api.cc:107:
  2528. /usr/include/x86_64-linux-gnu/bits/stdio2.h:67:35: note: ‘__builtin___snprintf_chk’ output 79 or more bytes (assuming 1102) into a destination of size 1000
  2529. 67 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
  2530. | ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  2531. 68 | __bos (__s), __fmt, __va_arg_pack ());
  2532. | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  2533. cuda_runtime_api.cc: In member function ‘void gpgpu_context::cuobjdumpParseBinary(unsigned int)’:
  2534. cuda_runtime_api.cc:3499:22: warning: ‘symtab’ may be used uninitialized in this function [-Wmaybe-uninitialized]
  2535. 3499 | api->load_constants(symtab, STATIC_ALLOC_LIMIT,
  2536. | ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~
  2537. 3500 | context->get_device()->get_gpgpu());
  2538. | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  2539. shader.cc: In member function ‘void opndcoll_rfu_t::init(unsigned int, shader_core_ctx*)’:
  2540. shader.cc:4033:18: warning: ‘reg_id’ may be used uninitialized in this function [-Wmaybe-uninitialized]
  2541. 4033 | m_cu[j]->init(j, num_banks, m_bank_warp_shift, shader->get_config(), this,
  2542. | ~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  2543. 4034 | sub_core_model, reg_id, m_num_banks_per_sched);
  2544. | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  2545. In file included from shader.h:50,
  2546. from shader.cc:32:
  2547. ../abstract_hardware_model.h: In member function ‘void opndcoll_rfu_t::allocate_cu(unsigned int)’:
  2548. ../abstract_hardware_model.h:1403:27: warning: ‘reg_id’ may be used uninitialized in this function [-Wmaybe-uninitialized]
  2549. 1403 | assert(not regs[reg_id]->empty());
  2550. | ^
  2551. ../abstract_hardware_model.h:1389:14: note: ‘reg_id’ was declared here
  2552. 1389 | unsigned reg_id;
  2553. | ^~~~~~
  2554. echo /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuda_runtime_api.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuobjdump_lexer.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuobjdump_parser.o
  2555. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuda_runtime_api.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuobjdump_lexer.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuobjdump_parser.o
  2556. ar rcs /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/libcuda.a /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuda_runtime_api.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuobjdump_lexer.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuobjdump_parser.o
  2557. make[2]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda'
  2558. g++ -m64 /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/Ucache.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/XML_Parse.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/arbiter.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/area.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/array.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/bank.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/basic_circuit.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/basic_components.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/cacti_interface.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/component.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/core.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/crossbar.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/decoder.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/htree2.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/interconnect.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/io.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/iocontrollers.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/logic.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/main.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/mat.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/memoryctrl.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/noc.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/nuca.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/parameter.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/processor.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/router.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/sharedcache.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/subarray.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/technology.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/uca.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/wire.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/xmlParser.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/gpgpu_sim_wrapper.o -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/mcpat -lm -Wno-unknown-pragmas -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz -I/usr/lib/ -I/usr/lib64/ -pthread
  2559. make[3]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch'
  2560. make[2]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch'
  2561. ar rcs /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/libgpu_uarch_sim.a /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/addrdec.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/dram.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/dram_sched.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/gpu-cache.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/gpu-misc.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/gpu-sim.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/hashing.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/histogram.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/icnt_wrapper.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/l2cache.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/local_interconnect.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/mem_fetch.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/mem_latency_stat.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/power_interface.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/power_stat.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/scoreboard.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/shader.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/stack.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/stat-tool.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/traffic_breakdown.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/visualizer.o
  2562. make[2]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim'
  2563. make "CREATE_LIBRARY=1" "DEBUG=0" -C ./src/intersim2
  2564. make[2]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/intersim2'
  2565. touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/Makefile.makedepend
  2566. makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/Makefile.makedepend -I-I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/ config_utils.cpp booksim_config.cpp module.cpp buffer.cpp vc.cpp routefunc.cpp traffic.cpp flitchannel.cpp trafficmanager.cpp batchtrafficmanager.cpp packet_reply_info.cpp buffer_state.cpp stats.cpp credit.cpp outputset.cpp flit.cpp injection.cpp misc_utils.cpp rng_wrapper.cpp rng_double_wrapper.cpp power_module.cpp switch_monitor.cpp buffer_monitor.cpp main.cpp gputrafficmanager.cpp intersim_config.cpp interconnect_interface.cpp allocators/allocator.cpp allocators/islip.cpp allocators/loa.cpp allocators/maxsize.cpp allocators/pim.cpp allocators/selalloc.cpp allocators/separable.cpp allocators/separable_input_first.cpp allocators/separable_output_first.cpp allocators/wavefront.cpp arbiters/arbiter.cpp arbiters/matrix_arb.cpp arbiters/prio_arb.cpp arbiters/roundrobin_arb.cpp arbiters/tree_arb.cpp networks/anynet.cpp networks/cmesh.cpp networks/dragonfly.cpp networks/fattree.cpp networks/flatfly_onchip.cpp networks/fly.cpp networks/kncube.cpp networks/network.cpp networks/qtree.cpp networks/tree4.cpp power/buffer_monitor.cpp power/power_module.cpp power/switch_monitor.cpp routers/chaos_router.cpp routers/event_router.cpp routers/iq_router.cpp routers/router.cpp 2> /dev/null
  2567. flex -o/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/lex.yy.c config.l
  2568. bison -y -d config.y --file-prefix=/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/y
  2569. bison -y -d config.y --file-prefix=/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/y
  2570. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c config_utils.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/config_utils.o
  2571. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c booksim_config.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/booksim_config.o
  2572. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c module.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/module.o
  2573. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c buffer.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/buffer.o
  2574. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c vc.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/vc.o
  2575. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c routefunc.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/routefunc.o
  2576. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c traffic.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/traffic.o
  2577. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c flitchannel.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/flitchannel.o
  2578. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c trafficmanager.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/trafficmanager.o
  2579. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c batchtrafficmanager.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/batchtrafficmanager.o
  2580. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c packet_reply_info.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/packet_reply_info.o
  2581. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c buffer_state.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/buffer_state.o
  2582. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c stats.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/stats.o
  2583. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c credit.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/credit.o
  2584. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c outputset.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/outputset.o
  2585. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c flit.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/flit.o
  2586. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c injection.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/injection.o
  2587. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c misc_utils.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/misc_utils.o
  2588. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c rng_wrapper.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/rng_wrapper.o
  2589. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c rng_double_wrapper.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/rng_double_wrapper.o
  2590. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c power/power_module.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/power_module.o
  2591. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c power/switch_monitor.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/switch_monitor.o
  2592. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c power/buffer_monitor.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/buffer_monitor.o
  2593. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c main.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/main.o
  2594. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c gputrafficmanager.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/gputrafficmanager.o
  2595. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c intersim_config.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/intersim_config.o
  2596. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c interconnect_interface.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/interconnect_interface.o
  2597. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c networks/fattree.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/fattree.o
  2598. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c networks/cmesh.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/cmesh.o
  2599. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c networks/flatfly_onchip.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/flatfly_onchip.o
  2600. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c networks/qtree.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/qtree.o
  2601. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c networks/tree4.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/tree4.o
  2602. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c networks/network.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/network.o
  2603. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c networks/anynet.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/anynet.o
  2604. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c networks/fly.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/fly.o
  2605. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c networks/dragonfly.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/dragonfly.o
  2606. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c networks/kncube.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/kncube.o
  2607. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c allocators/wavefront.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/wavefront.o
  2608. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c allocators/islip.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/islip.o
  2609. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c allocators/selalloc.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/selalloc.o
  2610. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c allocators/separable_output_first.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/separable_output_first.o
  2611. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c allocators/separable.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/separable.o
  2612. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c allocators/allocator.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/allocator.o
  2613. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c allocators/separable_input_first.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/separable_input_first.o
  2614. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c allocators/pim.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/pim.o
  2615. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c allocators/loa.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/loa.o
  2616. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c allocators/maxsize.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/maxsize.o
  2617. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c arbiters/prio_arb.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/prio_arb.o
  2618. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c arbiters/matrix_arb.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/matrix_arb.o
  2619. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c arbiters/tree_arb.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/tree_arb.o
  2620. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c arbiters/roundrobin_arb.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/roundrobin_arb.o
  2621. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c arbiters/arbiter.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/arbiter.o
  2622. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c routers/router.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/router.o
  2623. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c routers/event_router.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/event_router.o
  2624. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c routers/iq_router.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/iq_router.o
  2625. g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c routers/chaos_router.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/chaos_router.o
  2626. gcc -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/lex.yy.c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/lex.yy.o
  2627. gcc -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/y.tab.c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/y.tab.o
  2628. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/lex.yy.c:1194:16: warning: ‘input’ defined but not used [-Wunused-function]
  2629. 1194 | static int input (void)
  2630. | ^~~~~
  2631. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/lex.yy.c:1151:17: warning: ‘yyunput’ defined but not used [-Wunused-function]
  2632. 1151 | static void yyunput (int c, char * yy_bp )
  2633. | ^~~~~~~
  2634. vc.cpp: In member function ‘void VC::AddFlit(Flit*)’:
  2635. vc.cpp:86:15: warning: comparison of integer expressions of different signedness: ‘long long unsigned int’ and ‘int’ [-Wsign-compare]
  2636. 86 | if(f->pid != _expected_pid) {
  2637. | ~~~~~~~^~~~~~~~~~~~~~~~
  2638. networks/qtree.cpp: In member function ‘virtual void QTree::_BuildNet(const Configuration&, int)’:
  2639. networks/qtree.cpp:143:12: warning: ‘r’ may be used uninitialized in this function [-Wmaybe-uninitialized]
  2640. 143 | _routers[r]->AddInputChannel( _chan[c],
  2641. | ^
  2642. networks/dragonfly.cpp: In function ‘int dragonfly_port(int, int, int)’:
  2643. networks/dragonfly.cpp:114:7: warning: variable ‘group_dest’ set but not used [-Wunused-but-set-variable]
  2644. 114 | int group_dest=-1;
  2645. | ^~~~~~~~~~
  2646. networks/dragonfly.cpp: In member function ‘virtual void DragonFlyNew::_BuildNet(const Configuration&, int)’:
  2647. networks/dragonfly.cpp:362:9: warning: variable ‘_grp_num_routers’ set but not used [-Wunused-but-set-variable]
  2648. 362 | int _grp_num_routers;
  2649. | ^~~~~~~~~~~~~~~~
  2650. networks/dragonfly.cpp:364:9: warning: variable ‘grp_ID2’ set but not used [-Wunused-but-set-variable]
  2651. 364 | int grp_ID2;
  2652. | ^~~~~~~
  2653. networks/dragonfly.cpp:227:7: warning: variable ‘_dim_size’ set but not used [-Wunused-but-set-variable]
  2654. 227 | int _dim_size=-1;
  2655. | ^~~~~~~~~
  2656. networks/dragonfly.cpp: In function ‘void ugal_dragonflynew(const Router*, const Flit*, int, OutputSet*, int, bool)’:
  2657. networks/dragonfly.cpp:501:23: warning: variable ‘min_hopcnt’ set but not used [-Wunused-but-set-variable]
  2658. 501 | int min_queue_size, min_hopcnt;
  2659. | ^~~~~~~~~~
  2660. networks/dragonfly.cpp:502:26: warning: variable ‘nonmin_hopcnt’ set but not used [-Wunused-but-set-variable]
  2661. 502 | int nonmin_queue_size, nonmin_hopcnt;
  2662. | ^~~~~~~~~~~~~
  2663. networks/kncube.cpp: In member function ‘virtual void KNCube::InsertRandomFaults(const Configuration&)’:
  2664. networks/kncube.cpp:305:22: warning: ‘chan’ may be used uninitialized in this function [-Wmaybe-uninitialized]
  2665. 305 | OutChannelFault( node, chan );
  2666. | ~~~~~~~~~~~~~~~^~~~~~~~~~~~~~
  2667. networks/flatfly_onchip.cpp: In function ‘int find_distance(int, int, int)’:
  2668. networks/flatfly_onchip.cpp:1212:7: warning: variable ‘_dim_size’ set but not used [-Wunused-but-set-variable]
  2669. 1212 | int _dim_size;
  2670. | ^~~~~~~~~
  2671. networks/anynet.cpp: In member function ‘void AnyNet::readFile()’:
  2672. networks/anynet.cpp:495:22: warning: comparison of integer expressions of different signedness: ‘__gnu_cxx::__alloc_traits<std::allocator<int>, int>::value_type’ {aka ‘int’} and ‘size_t’ {aka ‘long unsigned int’} [-Wsign-compare]
  2673. 495 | if(node_check[i] != i){
  2674. In file included from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/../abstract_hardware_model.h:217,
  2675. from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/mem_fetch.h:33,
  2676. from interconnect_interface.cpp:41:
  2677. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
  2678. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
  2679. 67 | bool done() const { return m_updates == m_issued; }
  2680. | ~~~~~~~~~~^~~~~~~~~~~
  2681. main.cpp: In function ‘int GetSimTime(int)’:
  2682. main.cpp:92:1: warning: control reaches end of non-void function [-Wreturn-type]
  2683. 92 | }
  2684. | ^
  2685. main.cpp: In function ‘Stats* GetStats(const string&, int)’:
  2686. main.cpp:111:1: warning: control reaches end of non-void function [-Wreturn-type]
  2687. 111 | }
  2688. | ^
  2689. networks/tree4.cpp: In member function ‘int Tree4::_WireLatency(int, int, int, int)’:
  2690. networks/tree4.cpp:290:10: warning: ‘L’ may be used uninitialized in this function [-Wmaybe-uninitialized]
  2691. 290 | return L;
  2692. | ^
  2693. In file included from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/mem_fetch.h:33,
  2694. from interconnect_interface.cpp:41:
  2695. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  2696. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  2697. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  2698. | ~~~^ ~~
  2699. | | |
  2700. | | address_type {aka long long unsigned int}
  2701. | unsigned int
  2702. | %04llx
  2703. In file included from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/mem_fetch.h:33,
  2704. from interconnect_interface.cpp:41:
  2705. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  2706. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  2707. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  2708. | ~~~^ ~~
  2709. | | |
  2710. | | address_type {aka long long unsigned int}
  2711. | unsigned int
  2712. | %04llx
  2713. make[2]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/intersim2'
  2714. make -C ./src/ depend
  2715. make[2]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src'
  2716. touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/Makefile.makedepend
  2717. makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/Makefile.makedepend -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/ abstract_hardware_model.cc debug.cc gpgpusim_entrypoint.cc option_parser.cc statwrapper.cc stream_manager.cc trace.cc 2> /dev/null
  2718. make[2]: 'depend' is up to date.
  2719. make[2]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src'
  2720. make -C ./src/
  2721. make[2]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src'
  2722. touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/Makefile.makedepend
  2723. makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/Makefile.makedepend -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/ abstract_hardware_model.cc debug.cc gpgpusim_entrypoint.cc option_parser.cc statwrapper.cc stream_manager.cc trace.cc 2> /dev/null
  2724. g++ -O3 -g3 -fPIC -Wall -DDEBUG -DCUDART_VERSION=11060 -std=c++0x -DTRACING_ON=1 -I/usr/local/cuda-11.6/include -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/abstract_hardware_model.o -c abstract_hardware_model.cc
  2725. g++ -O3 -g3 -fPIC -Wall -DDEBUG -DCUDART_VERSION=11060 -std=c++0x -DTRACING_ON=1 -I/usr/local/cuda-11.6/include -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/debug.o -c debug.cc
  2726. g++ -O3 -g3 -fPIC -Wall -DDEBUG -DCUDART_VERSION=11060 -std=c++0x -DTRACING_ON=1 -I/usr/local/cuda-11.6/include -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpusim_entrypoint.o -c gpgpusim_entrypoint.cc
  2727. g++ -O3 -g3 -fPIC -Wall -DDEBUG -DCUDART_VERSION=11060 -std=c++0x -DTRACING_ON=1 -I/usr/local/cuda-11.6/include -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/option_parser.o -c option_parser.cc
  2728. g++ -O3 -g3 -fPIC -Wall -DDEBUG -DCUDART_VERSION=11060 -std=c++0x -DTRACING_ON=1 -I/usr/local/cuda-11.6/include -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/statwrapper.o -c statwrapper.cc
  2729. g++ -O3 -g3 -fPIC -Wall -DDEBUG -DCUDART_VERSION=11060 -std=c++0x -DTRACING_ON=1 -I/usr/local/cuda-11.6/include -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/stream_manager.o -c stream_manager.cc
  2730. g++ -O3 -g3 -fPIC -Wall -DDEBUG -DCUDART_VERSION=11060 -std=c++0x -DTRACING_ON=1 -I/usr/local/cuda-11.6/include -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/trace.o -c trace.cc
  2731. make -C ./gpgpu-sim
  2732. make[3]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim'
  2733. touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/Makefile.makedepend
  2734. makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/Makefile.makedepend -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/ addrdec.cc dram.cc dram_sched.cc gpu-cache.cc gpu-misc.cc gpu-sim.cc hashing.cc histogram.cc icnt_wrapper.cc l2cache.cc local_interconnect.cc mem_fetch.cc mem_latency_stat.cc power_interface.cc power_stat.cc scoreboard.cc shader.cc stack.cc stat-tool.cc traffic_breakdown.cc visualizer.cc 2> /dev/null
  2735. ar rcs /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/libgpu_uarch_sim.a /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/addrdec.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/dram.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/dram_sched.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/gpu-cache.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/gpu-misc.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/gpu-sim.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/hashing.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/histogram.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/icnt_wrapper.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/l2cache.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/local_interconnect.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/mem_fetch.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/mem_latency_stat.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/power_interface.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/power_stat.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/scoreboard.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/shader.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/stack.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/stat-tool.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/traffic_breakdown.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/visualizer.o
  2736. make[3]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim'
  2737. In file included from abstract_hardware_model.h:217,
  2738. from abstract_hardware_model.cc:32:
  2739. stream_manager.h: In member function ‘bool CUevent_st::done() const’:
  2740. stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
  2741. 67 | bool done() const { return m_updates == m_issued; }
  2742. | ~~~~~~~~~~^~~~~~~~~~~
  2743. In file included from abstract_hardware_model.h:217,
  2744. from debug.h:32,
  2745. from debug.cc:29:
  2746. stream_manager.h: In member function ‘bool CUevent_st::done() const’:
  2747. stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
  2748. 67 | bool done() const { return m_updates == m_issued; }
  2749. | ~~~~~~~~~~^~~~~~~~~~~
  2750. In file included from abstract_hardware_model.cc:32:
  2751. abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  2752. abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  2753. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  2754. | ~~~^ ~~
  2755. | | |
  2756. | | address_type {aka long long unsigned int}
  2757. | unsigned int
  2758. | %04llx
  2759. In file included from debug.h:32,
  2760. from debug.cc:29:
  2761. abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  2762. abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  2763. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  2764. | ~~~^ ~~
  2765. | | |
  2766. | | address_type {aka long long unsigned int}
  2767. | unsigned int
  2768. | %04llx
  2769. In file included from abstract_hardware_model.cc:32:
  2770. abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  2771. abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  2772. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  2773. | ~~~^ ~~
  2774. | | |
  2775. | | address_type {aka long long unsigned int}
  2776. | unsigned int
  2777. | %04llx
  2778. In file included from abstract_hardware_model.h:217,
  2779. from gpgpusim_entrypoint.h:35,
  2780. from gpgpusim_entrypoint.cc:29:
  2781. stream_manager.h: In member function ‘bool CUevent_st::done() const’:
  2782. stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
  2783. 67 | bool done() const { return m_updates == m_issued; }
  2784. | ~~~~~~~~~~^~~~~~~~~~~
  2785. In file included from debug.h:32,
  2786. from debug.cc:29:
  2787. abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  2788. abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  2789. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  2790. | ~~~^ ~~
  2791. | | |
  2792. | | address_type {aka long long unsigned int}
  2793. | unsigned int
  2794. | %04llx
  2795. In file included from gpgpusim_entrypoint.h:35,
  2796. from gpgpusim_entrypoint.cc:29:
  2797. abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  2798. abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  2799. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  2800. | ~~~^ ~~
  2801. | | |
  2802. | | address_type {aka long long unsigned int}
  2803. | unsigned int
  2804. | %04llx
  2805. In file included from gpgpusim_entrypoint.h:35,
  2806. from gpgpusim_entrypoint.cc:29:
  2807. abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  2808. abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  2809. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  2810. | ~~~^ ~~
  2811. | | |
  2812. | | address_type {aka long long unsigned int}
  2813. | unsigned int
  2814. | %04llx
  2815. In file included from stream_manager.h:35,
  2816. from stream_manager.cc:29:
  2817. abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
  2818. abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  2819. 966 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  2820. | ~~~^ ~~
  2821. | | |
  2822. | | address_type {aka long long unsigned int}
  2823. | unsigned int
  2824. | %04llx
  2825. In file included from stream_manager.h:35,
  2826. from stream_manager.cc:29:
  2827. abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
  2828. abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  2829. 1160 | fprintf(fp, " [inst @ pc=0x%04x] ", pc);
  2830. | ~~~^ ~~
  2831. | | |
  2832. | | address_type {aka long long unsigned int}
  2833. | unsigned int
  2834. | %04llx
  2835. In file included from stream_manager.cc:29:
  2836. stream_manager.h: In member function ‘bool CUevent_st::done() const’:
  2837. stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
  2838. 67 | bool done() const { return m_updates == m_issued; }
  2839. | ~~~~~~~~~~^~~~~~~~~~~
  2840. In file included from cuda-sim/cuda-sim.h:37,
  2841. from debug.cc:30:
  2842. cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  2843. cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  2844. 450 | std::vector<register_set *> &m_spec_cores_out;
  2845. | ^~~~~~~~~~~~~~~~
  2846. cuda-sim/../gpgpu-sim/shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  2847. 449 | register_set *m_mem_out;
  2848. | ^~~~~~~~~
  2849. cuda-sim/../gpgpu-sim/shader.h:344:3: warning: when initialized here [-Wreorder]
  2850. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  2851. | ^~~~~~~~~~~~~~
  2852. cuda-sim/../gpgpu-sim/shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
  2853. cuda-sim/../gpgpu-sim/shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  2854. 1298 | if (inst.op != m_supported_op) {
  2855. | ~~~~~~~~^~~~~~~~~~~~~~~~~
  2856. In file included from ../libcuda/../src/cuda-sim/cuda-sim.h:37,
  2857. from ../libcuda/gpgpu_context.h:3,
  2858. from gpgpusim_entrypoint.cc:32:
  2859. ../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  2860. ../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  2861. 450 | std::vector<register_set *> &m_spec_cores_out;
  2862. | ^~~~~~~~~~~~~~~~
  2863. ../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  2864. 449 | register_set *m_mem_out;
  2865. | ^~~~~~~~~
  2866. ../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning: when initialized here [-Wreorder]
  2867. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  2868. | ^~~~~~~~~~~~~~
  2869. In file included from ../libcuda/../src/cuda-sim/cuda-sim.h:37,
  2870. from ../libcuda/gpgpu_context.h:3,
  2871. from abstract_hardware_model.cc:37:
  2872. ../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  2873. ../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  2874. 450 | std::vector<register_set *> &m_spec_cores_out;
  2875. | ^~~~~~~~~~~~~~~~
  2876. ../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  2877. 449 | register_set *m_mem_out;
  2878. | ^~~~~~~~~
  2879. ../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning: when initialized here [-Wreorder]
  2880. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  2881. | ^~~~~~~~~~~~~~
  2882. ../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
  2883. ../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  2884. 1298 | if (inst.op != m_supported_op) {
  2885. | ~~~~~~~~^~~~~~~~~~~~~~~~~
  2886. ../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
  2887. ../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  2888. 1298 | if (inst.op != m_supported_op) {
  2889. | ~~~~~~~~^~~~~~~~~~~~~~~~~
  2890. In file included from ../libcuda/../src/cuda-sim/cuda-sim.h:37,
  2891. from ../libcuda/gpgpu_context.h:3,
  2892. from stream_manager.cc:30:
  2893. ../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
  2894. ../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  2895. 450 | std::vector<register_set *> &m_spec_cores_out;
  2896. | ^~~~~~~~~~~~~~~~
  2897. ../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning: ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  2898. 449 | register_set *m_mem_out;
  2899. | ^~~~~~~~~
  2900. ../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning: when initialized here [-Wreorder]
  2901. 344 | scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
  2902. | ^~~~~~~~~~~~~~
  2903. ../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
  2904. ../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
  2905. 1298 | if (inst.op != m_supported_op) {
  2906. | ~~~~~~~~^~~~~~~~~~~~~~~~~
  2907. abstract_hardware_model.cc: In member function ‘void simt_stack::print(FILE*) const’:
  2908. abstract_hardware_model.cc:1009:30: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  2909. 1009 | fprintf(fout, " pc: 0x%03x", stack_entry.m_pc);
  2910. | ~~~^ ~~~~~~~~~~~~~~~~
  2911. | | |
  2912. | unsigned int address_type {aka long long unsigned int}
  2913. | %03llx
  2914. abstract_hardware_model.cc:1015:29: warning: format ‘%u’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  2915. 1015 | fprintf(fout, " rp: %4u tp: %s cd: %2u ", stack_entry.m_recvg_pc,
  2916. | ~~^ ~~~~~~~~~~~~~~~~~~~~~~
  2917. | | |
  2918. | unsigned int address_type {aka long long unsigned int}
  2919. | %4llu
  2920. abstract_hardware_model.cc: In member function ‘void simt_stack::print_checkpoint(FILE*) const’:
  2921. abstract_hardware_model.cc:1035:21: warning: format ‘%d’ expects argument of type ‘int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  2922. 1035 | fprintf(fout, "%d %d %d %lld %d ", stack_entry.m_pc,
  2923. | ~^ ~~~~~~~~~~~~~~~~
  2924. | | |
  2925. | int address_type {aka long long unsigned int}
  2926. | %lld
  2927. abstract_hardware_model.cc:1035:27: warning: format ‘%d’ expects argument of type ‘int’, but argument 5 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  2928. 1035 | fprintf(fout, "%d %d %d %lld %d ", stack_entry.m_pc,
  2929. | ~^
  2930. | |
  2931. | int
  2932. | %lld
  2933. 1036 | stack_entry.m_calldepth, stack_entry.m_recvg_pc,
  2934. | ~~~~~~~~~~~~~~~~~~~~~~
  2935. | |
  2936. | address_type {aka long long unsigned int}
  2937. stream_manager.cc: In member function ‘void stream_operation::print(FILE*) const’:
  2938. stream_manager.cc:205:10: warning: enumeration value ‘stream_wait_event’ not handled in switch [-Wswitch]
  2939. 205 | switch (m_type) {
  2940. | ^
  2941. debug.cc: In member function ‘void gpgpu_sim::gpgpu_debug()’:
  2942. debug.cc:127:10: warning: ignoring return value of ‘char* fgets(char*, int, FILE*)’, declared with attribute warn_unused_result [-Wunused-result]
  2943. 127 | fgets(line, 1024, stdin);
  2944. | ~~~~~^~~~~~~~~~~~~~~~~~~
  2945. debug.cc:139:12: warning: ignoring return value of ‘char* fgets(char*, int, FILE*)’, declared with attribute warn_unused_result [-Wunused-result]
  2946. 139 | fgets(line, 1024, stdin);
  2947. | ~~~~~^~~~~~~~~~~~~~~~~~~
  2948. abstract_hardware_model.cc: In member function ‘void checkpoint::load_global_mem(memory_space*, char*)’:
  2949. abstract_hardware_model.cc:98:14: warning: ‘offset’ may be used uninitialized in this function [-Wmaybe-uninitialized]
  2950. 98 | offset = offset + 4;
  2951. | ~~~~~~~^~~~~~~~~~~~
  2952. ar rcs /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libgpgpusim.a /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/abstract_hardware_model.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/debug.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpusim_entrypoint.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/option_parser.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/statwrapper.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/stream_manager.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/trace.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/*.o
  2953. make[2]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src'
  2954. g++ -shared -Wl,-soname,libcudart.so -Wl,--version-script=linux-so-version.txt\
  2955. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/*.o \
  2956. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/*.o \
  2957. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/decuda_pred_table/*.o \
  2958. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/*.o \
  2959. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/*.o \
  2960. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/*.o -lm -lz -lGL -pthread \
  2961. /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/*.o \
  2962. -o lib/gcc-9.4.0/cuda-11060/release/libcudart.so
  2963. if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.2 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.2; fi
  2964. if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.3 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.3; fi
  2965. if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.4 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.4; fi
  2966. if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.5.0 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.5.0; fi
  2967. if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.5.5 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.5.5; fi
  2968. if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.6.0 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.6.0; fi
  2969. if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.6.5 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.6.5; fi
  2970. if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.7.0 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.7.0; fi
  2971. if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.7.5 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.7.5; fi
  2972. if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.8.0 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.8.0; fi
  2973. if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.9.0 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.9.0; fi
  2974. if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.9.1 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.9.1; fi
  2975. if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.9.2 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.9.2; fi
  2976. if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.10.0 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.10.0; fi
  2977. if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.10.1 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.10.1; fi
  2978. if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.11.0 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.11.0; fi
  2979. make[1]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim'
  2980. g++ -std=c++0x -o ./bin/release/accel-sim.out -L/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/lib/gcc-9.4.0/cuda-11060/release/ -lcudart -lm -lz -lGL -pthread ./build/release/*.o
  2981. make: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator'
  2982. mnaderan@rtx3080:accelsim-chiplet$ cd test/
  2983. mnaderan@rtx3080:test$ sh runme.sh
  2984. Accel-Sim [build accelsim-commit-9e19b6621cc94d89d8f7b18e65d802ad6979c195_modified_0.0]
  2985.  
  2986. *** GPGPU-Sim Simulator Version 4.2.0 [build gpgpu-sim_git-commit-60af80a7d140a30c781ca485707a1d4dbb8031fa_modified_0.0] ***
  2987.  
  2988.  
  2989. GPGPU-Sim: Configuration options:
  2990.  
  2991. -save_embedded_ptx 0 # saves ptx files embedded in binary as <n>.ptx
  2992. -keep 0 # keep intermediate files created by GPGPU-Sim when interfacing with external programs
  2993. -gpgpu_ptx_save_converted_ptxplus 0 # Saved converted ptxplus to a file
  2994. -gpgpu_occupancy_sm_number 86 # The SM number to pass to ptxas when getting register usage for computing GPU occupancy. This parameter is required in the config.
  2995. -ptx_opcode_latency_int 4,4,4,4,21 # Opcode latencies for integers <ADD,MAX,MUL,MAD,DIV,SHFL>Default 1,1,19,25,145,32
  2996. -ptx_opcode_latency_fp 4,4,4,4,39 # Opcode latencies for single precision floating points <ADD,MAX,MUL,MAD,DIV>Default 1,1,1,1,30
  2997. -ptx_opcode_latency_dp 64,64,64,64,330 # Opcode latencies for double precision floating points <ADD,MAX,MUL,MAD,DIV>Default 8,8,8,8,335
  2998. -ptx_opcode_latency_sfu 21 # Opcode latencies for SFU instructionsDefault 8
  2999. -ptx_opcode_latency_tesnor 64 # Opcode latencies for Tensor instructionsDefault 64
  3000. -ptx_opcode_initiation_int 2,2,2,2,2 # Opcode initiation intervals for integers <ADD,MAX,MUL,MAD,DIV,SHFL>Default 1,1,4,4,32,4
  3001. -ptx_opcode_initiation_fp 1,1,1,1,2 # Opcode initiation intervals for single precision floating points <ADD,MAX,MUL,MAD,DIV>Default 1,1,1,1,5
  3002. -ptx_opcode_initiation_dp 64,64,64,64,130 # Opcode initiation intervals for double precision floating points <ADD,MAX,MUL,MAD,DIV>Default 8,8,8,8,130
  3003. -ptx_opcode_initiation_sfu 8 # Opcode initiation intervals for sfu instructionsDefault 8
  3004. -ptx_opcode_initiation_tensor 64 # Opcode initiation intervals for tensor instructionsDefault 64
  3005. -cdp_latency 7200,8000,100,12000,1600 # CDP API latency <cudaStreamCreateWithFlags, cudaGetParameterBufferV2_init_perWarp, cudaGetParameterBufferV2_perKernel, cudaLaunchDeviceV2_init_perWarp, cudaLaunchDevicV2_perKernel>Default 7200,8000,100,12000,1600
  3006. -network_mode 1 # Interconnection network mode
  3007. -inter_config_file config_ampere_islip.icnt # Interconnection network config file
  3008. -inter_config_file_chLet config_ampere_islip_ch.icnt # Interconnection network config file
  3009. -icnt_in_buffer_limit 512 # in_buffer_limit
  3010. -icnt_out_buffer_limit 512 # out_buffer_limit
  3011. -icnt_subnets 2 # subnets
  3012. -icnt_arbiter_algo 1 # arbiter_algo
  3013. -icnt_verbose 0 # inct_verbose
  3014. -icnt_grant_cycles 1 # grant_cycles
  3015. -gpgpu_ptx_use_cuobjdump 1 # Use cuobjdump to extract ptx and sass from binaries
  3016. -gpgpu_experimental_lib_support 0 # Try to extract code from cuda libraries [Broken because of unknown cudaGetExportTable]
  3017. -checkpoint_option 0 # checkpointing flag (0 = no checkpoint)
  3018. -checkpoint_kernel 1 # checkpointing during execution of which kernel (1- 1st kernel)
  3019. -checkpoint_CTA 0 # checkpointing after # of CTA (< less than total CTA)
  3020. -resume_option 0 # resume flag (0 = no resume)
  3021. -resume_kernel 0 # Resume from which kernel (1= 1st kernel)
  3022. -resume_CTA 0 # resume from which CTA
  3023. -checkpoint_CTA_t 0 # resume from which CTA
  3024. -checkpoint_insn_Y 0 # resume from which CTA
  3025. -gpgpu_ptx_convert_to_ptxplus 0 # Convert SASS (native ISA) to ptxplus and run ptxplus
  3026. -gpgpu_ptx_force_max_capability 86 # Force maximum compute capability
  3027. -gpgpu_ptx_inst_debug_to_file 0 # Dump executed instructions' debug information to file
  3028. -gpgpu_ptx_inst_debug_file inst_debug.txt # Executed instructions' debug output file
  3029. -gpgpu_ptx_inst_debug_thread_uid 1 # Thread UID for executed instructions' debug output
  3030. -gpgpu_simd_model 1 # 1 = post-dominator
  3031. -gpgpu_shader_core_pipeline 1536:32 # shader core pipeline config, i.e., {<nthread>:<warpsize>}
  3032. -gpgpu_tex_cache:l1 N:4:128:256,L:R:m:N:L,T:512:8,128:2 # per-shader L1 texture cache (READ-ONLY) config {<nsets>:<bsize>:<assoc>,<rep>:<wr>:<alloc>:<wr_alloc>,<mshr>:<N>:<merge>,<mq>:<rf>}
  3033. -gpgpu_const_cache:l1 N:128:64:8,L:R:f:N:L,S:2:64,4 # per-shader L1 constant memory cache (READ-ONLY) config {<nsets>:<bsize>:<assoc>,<rep>:<wr>:<alloc>:<wr_alloc>,<mshr>:<N>:<merge>,<mq>}
  3034. -gpgpu_cache:il1 N:64:128:16,L:R:f:N:L,S:2:48,4 # shader L1 instruction cache config {<nsets>:<bsize>:<assoc>,<rep>:<wr>:<alloc>:<wr_alloc>,<mshr>:<N>:<merge>,<mq>}
  3035. -gpgpu_cache:dl1 S:4:128:256,L:T:m:L:L,A:384:48,16:0,32 # per-shader L1 data cache config {<nsets>:<bsize>:<assoc>,<rep>:<wr>:<alloc>:<wr_alloc>,<mshr>:<N>:<merge>,<mq> | none}
  3036. -gpgpu_l1_cache_write_ratio 25 # L1D write ratio
  3037. -gpgpu_l1_banks 4 # The number of L1 cache banks
  3038. -gpgpu_l1_banks_byte_interleaving 32 # l1 banks byte interleaving granularity
  3039. -gpgpu_l1_banks_hashing_function 0 # l1 banks hashing function
  3040. -gpgpu_l1_latency 39 # L1 Hit Latency
  3041. -gpgpu_smem_latency 29 # smem Latency
  3042. -gpgpu_cache:dl1PrefL1 none # per-shader L1 data cache config {<nsets>:<bsize>:<assoc>,<rep>:<wr>:<alloc>:<wr_alloc>,<mshr>:<N>:<merge>,<mq> | none}
  3043. -gpgpu_cache:dl1PrefShared none # per-shader L1 data cache config {<nsets>:<bsize>:<assoc>,<rep>:<wr>:<alloc>:<wr_alloc>,<mshr>:<N>:<merge>,<mq> | none}
  3044. -gpgpu_gmem_skip_L1D 0 # global memory access skip L1D cache (implements -Xptxas -dlcm=cg, default=no skip)
  3045. -gpgpu_perfect_mem 0 # enable perfect memory mode (no cache miss)
  3046. -n_regfile_gating_group 4 # group of lanes that should be read/written together)
  3047. -gpgpu_clock_gated_reg_file 0 # enable clock gated reg file for power calculations
  3048. -gpgpu_clock_gated_lanes 0 # enable clock gated lanes for power calculations
  3049. -gpgpu_shader_registers 65536 # Number of registers per shader core. Limits number of concurrent CTAs. (default 8192)
  3050. -gpgpu_registers_per_block 65536 # Maximum number of registers per CTA. (default 8192)
  3051. -gpgpu_ignore_resources_limitation 0 # gpgpu_ignore_resources_limitation (default 0)
  3052. -gpgpu_shader_cta 32 # Maximum number of concurrent CTAs in shader (default 32)
  3053. -gpgpu_num_cta_barriers 16 # Maximum number of named barriers per CTA (default 16)
  3054. -gpgpu_n_chiplets 16 # number of chiplets
  3055. -gpgpu_n_clusters 32 # number of processing clusters
  3056. -gpgpu_n_cores_per_cluster 1 # number of simd cores per cluster
  3057. -gpgpu_n_cluster_ejection_buffer_size 32 # number of packets in ejection buffer
  3058. -gpgpu_n_ldst_response_buffer_size 2 # number of response packets in ld/st unit ejection buffer
  3059. -gpgpu_shmem_per_block 49152 # Size of shared memory per thread block or CTA (default 48kB)
  3060. -gpgpu_shmem_size 102400 # Size of shared memory per shader core (default 16kB)
  3061. -gpgpu_shmem_option 0,8,16,32,64,100 # Option list of shared memory sizes
  3062. -gpgpu_unified_l1d_size 128 # Size of unified data cache(L1D + shared memory) in KB
  3063. -gpgpu_adaptive_cache_config 1 # adaptive_cache_config
  3064. -gpgpu_shmem_sizeDefault 102400 # Size of shared memory per shader core (default 16kB)
  3065. -gpgpu_shmem_size_PrefL1 16384 # Size of shared memory per shader core (default 16kB)
  3066. -gpgpu_shmem_size_PrefShared 16384 # Size of shared memory per shader core (default 16kB)
  3067. -gpgpu_shmem_num_banks 32 # Number of banks in the shared memory in each shader core (default 16)
  3068. -gpgpu_shmem_limited_broadcast 0 # Limit shared memory to do one broadcast per cycle (default on)
  3069. -gpgpu_shmem_warp_parts 1 # Number of portions a warp is divided into for shared memory bank conflict check
  3070. -gpgpu_mem_unit_ports 1 # The number of memory transactions allowed per core cycle
  3071. -gpgpu_shmem_warp_parts 1 # Number of portions a warp is divided into for shared memory bank conflict check
  3072. -gpgpu_warpdistro_shader -1 # Specify which shader core to collect the warp size distribution from
  3073. -gpgpu_warp_issue_shader 0 # Specify which shader core to collect the warp issue distribution from
  3074. -gpgpu_local_mem_map 1 # Mapping from local memory space address to simulated GPU physical address space (default = enabled)
  3075. -gpgpu_num_reg_banks 8 # Number of register banks (default = 8)
  3076. -gpgpu_reg_bank_use_warp_id 0 # Use warp ID in mapping registers to banks (default = off)
  3077. -gpgpu_sub_core_model 1 # Sub Core Volta/Pascal model (default = off)
  3078. -gpgpu_enable_specialized_operand_collector 0 # enable_specialized_operand_collector
  3079. -gpgpu_operand_collector_num_units_sp 4 # number of collector units (default = 4)
  3080. -gpgpu_operand_collector_num_units_dp 0 # number of collector units (default = 0)
  3081. -gpgpu_operand_collector_num_units_sfu 4 # number of collector units (default = 4)
  3082. -gpgpu_operand_collector_num_units_int 0 # number of collector units (default = 0)
  3083. -gpgpu_operand_collector_num_units_tensor_core 4 # number of collector units (default = 4)
  3084. -gpgpu_operand_collector_num_units_mem 2 # number of collector units (default = 2)
  3085. -gpgpu_operand_collector_num_units_gen 8 # number of collector units (default = 0)
  3086. -gpgpu_operand_collector_num_in_ports_sp 1 # number of collector unit in ports (default = 1)
  3087. -gpgpu_operand_collector_num_in_ports_dp 0 # number of collector unit in ports (default = 0)
  3088. -gpgpu_operand_collector_num_in_ports_sfu 1 # number of collector unit in ports (default = 1)
  3089. -gpgpu_operand_collector_num_in_ports_int 0 # number of collector unit in ports (default = 0)
  3090. -gpgpu_operand_collector_num_in_ports_tensor_core 1 # number of collector unit in ports (default = 1)
  3091. -gpgpu_operand_collector_num_in_ports_mem 1 # number of collector unit in ports (default = 1)
  3092. -gpgpu_operand_collector_num_in_ports_gen 8 # number of collector unit in ports (default = 0)
  3093. -gpgpu_operand_collector_num_out_ports_sp 1 # number of collector unit in ports (default = 1)
  3094. -gpgpu_operand_collector_num_out_ports_dp 0 # number of collector unit in ports (default = 0)
  3095. -gpgpu_operand_collector_num_out_ports_sfu 1 # number of collector unit in ports (default = 1)
  3096. -gpgpu_operand_collector_num_out_ports_int 0 # number of collector unit in ports (default = 0)
  3097. -gpgpu_operand_collector_num_out_ports_tensor_core 1 # number of collector unit in ports (default = 1)
  3098. -gpgpu_operand_collector_num_out_ports_mem 1 # number of collector unit in ports (default = 1)
  3099. -gpgpu_operand_collector_num_out_ports_gen 8 # number of collector unit in ports (default = 0)
  3100. -gpgpu_coalesce_arch 86 # Coalescing arch (GT200 = 13, Fermi = 20)
  3101. -gpgpu_num_sched_per_core 4 # Number of warp schedulers per core
  3102. -gpgpu_max_insn_issue_per_warp 1 # Max number of instructions that can be issued per warp in one cycle by scheduler (either 1 or 2)
  3103. -gpgpu_dual_issue_diff_exec_units 1 # should dual issue use two different execution unit resources (Default = 1)
  3104. -gpgpu_simt_core_sim_order 1 # Select the simulation order of cores in a cluster (0=Fix, 1=Round-Robin)
  3105. -gpgpu_pipeline_widths 4,4,4,4,4,4,4,4,4,4,8,4,4 # Pipeline widths ID_OC_SP,ID_OC_DP,ID_OC_INT,ID_OC_SFU,ID_OC_MEM,OC_EX_SP,OC_EX_DP,OC_EX_INT,OC_EX_SFU,OC_EX_MEM,EX_WB,ID_OC_TENSOR_CORE,OC_EX_TENSOR_CORE
  3106. -gpgpu_tensor_core_avail 1 # Tensor Core Available (default=0)
  3107. -gpgpu_num_sp_units 4 # Number of SP units (default=1)
  3108. -gpgpu_num_dp_units 4 # Number of DP units (default=0)
  3109. -gpgpu_num_int_units 4 # Number of INT units (default=0)
  3110. -gpgpu_num_sfu_units 4 # Number of SF units (default=1)
  3111. -gpgpu_num_tensor_core_units 4 # Number of tensor_core units (default=1)
  3112. -gpgpu_num_mem_units 1 # Number if ldst units (default=1) WARNING: not hooked up to anything
  3113. -gpgpu_scheduler lrr # Scheduler configuration: < lrr | gto | two_level_active > If two_level_active:<num_active_warps>:<inner_prioritization>:<outer_prioritization>For complete list of prioritization values see shader.h enum scheduler_prioritization_typeDefault: gto
  3114. -gpgpu_concurrent_kernel_sm 0 # Support concurrent kernels on a SM (default = disabled)
  3115. -gpgpu_perfect_inst_const_cache 1 # perfect inst and const cache mode, so all inst and const hits in the cache(default = disabled)
  3116. -gpgpu_inst_fetch_throughput 4 # the number of fetched intruction per warp each cycle
  3117. -gpgpu_reg_file_port_throughput 2 # the number ports of the register file
  3118. -specialized_unit_1 1,4,4,4,4,BRA # specialized unit config {<enabled>,<num_units>:<latency>:<initiation>,<ID_OC_SPEC>:<OC_EX_SPEC>,<NAME>}
  3119. -specialized_unit_2 1,4,200,4,4,TEX # specialized unit config {<enabled>,<num_units>:<latency>:<initiation>,<ID_OC_SPEC>:<OC_EX_SPEC>,<NAME>}
  3120. -specialized_unit_3 1,4,32,4,4,TENSOR # specialized unit config {<enabled>,<num_units>:<latency>:<initiation>,<ID_OC_SPEC>:<OC_EX_SPEC>,<NAME>}
  3121. -specialized_unit_4 1,4,4,4,4,UDP # specialized unit config {<enabled>,<num_units>:<latency>:<initiation>,<ID_OC_SPEC>:<OC_EX_SPEC>,<NAME>}
  3122. -specialized_unit_5 0,4,4,4,4,BRA # specialized unit config {<enabled>,<num_units>:<latency>:<initiation>,<ID_OC_SPEC>:<OC_EX_SPEC>,<NAME>}
  3123. -specialized_unit_6 0,4,4,4,4,BRA # specialized unit config {<enabled>,<num_units>:<latency>:<initiation>,<ID_OC_SPEC>:<OC_EX_SPEC>,<NAME>}
  3124. -specialized_unit_7 0,4,4,4,4,BRA # specialized unit config {<enabled>,<num_units>:<latency>:<initiation>,<ID_OC_SPEC>:<OC_EX_SPEC>,<NAME>}
  3125. -specialized_unit_8 0,4,4,4,4,BRA # specialized unit config {<enabled>,<num_units>:<latency>:<initiation>,<ID_OC_SPEC>:<OC_EX_SPEC>,<NAME>}
  3126. -gpgpu_perf_sim_memcpy 1 # Fill the L2 cache on memcpy
  3127. -gpgpu_simple_dram_model 0 # simple_dram_model with fixed latency and BW
  3128. -gpgpu_dram_scheduler 1 # 0 = fifo, 1 = FR-FCFS (defaul)
  3129. -gpgpu_dram_partition_queues 64:64:64:64 # i2$:$2d:d2$:$2i
  3130. -l2_ideal 0 # Use a ideal L2 cache that always hit
  3131. -gpgpu_cache:dl2 S:64:128:16,L:B:m:L:P,A:192:4,32:0,32 # unified banked L2 data cache config {<nsets>:<bsize>:<assoc>,<rep>:<wr>:<alloc>:<wr_alloc>,<mshr>:<N>:<merge>,<mq>}
  3132. -gpgpu_cache:dl2_texture_only 0 # L2 cache used for texture only
  3133. -gpgpu_n_mem 16 # number of memory modules (e.g. memory controllers) in gpu
  3134. -gpgpu_n_sub_partition_per_mchannel 2 # number of memory subpartition in each memory module
  3135. -gpgpu_n_mem_per_ctrlr 1 # number of memory chips per memory controller
  3136. -gpgpu_memlatency_stat 14 # track and display latency statistics 0x2 enables MC, 0x4 enables queue logs
  3137. -gpgpu_frfcfs_dram_sched_queue_size 64 # 0 = unlimited (default); # entries per chip
  3138. -gpgpu_dram_return_queue_size 192 # 0 = unlimited (default); # entries per chip
  3139. -gpgpu_dram_buswidth 2 # default = 4 bytes (8 bytes per cycle at DDR)
  3140. -gpgpu_dram_burst_length 16 # Burst length of each DRAM request (default = 4 data bus cycle)
  3141. -dram_data_command_freq_ratio 4 # Frequency ratio between DRAM data bus and command bus (default = 2 times, i.e. DDR)
  3142. -gpgpu_dram_timing_opt nbk=16:CCD=4:RRD=12:RCD=24:RAS=55:RP=24:RC=78:CL=24:WL=8:CDLR=10:WR=24:nbkgrp=4:CCDL=6:RTPL=4 # DRAM timing parameters = {nbk:tCCD:tRRD:tRCD:tRAS:tRP:tRC:CL:WL:tCDLR:tWR:nbkgrp:tCCDL:tRTPL}
  3143. -gpgpu_l2_rop_latency 187 # ROP queue latency (default 85)
  3144. -dram_latency 254 # DRAM latency (default 30)
  3145. -dram_dual_bus_interface 0 # dual_bus_interface (default = 0)
  3146. -dram_bnk_indexing_policy 0 # dram_bnk_indexing_policy (0 = normal indexing, 1 = Xoring with the higher bits) (Default = 0)
  3147. -dram_bnkgrp_indexing_policy 1 # dram_bnkgrp_indexing_policy (0 = take higher bits, 1 = take lower bits) (Default = 0)
  3148. -dram_seperate_write_queue_enable 0 # Seperate_Write_Queue_Enable
  3149. -dram_write_queue_size 32:28:16 # Write_Queue_Size
  3150. -dram_elimnate_rw_turnaround 0 # elimnate_rw_turnaround i.e set tWTR and tRTW = 0
  3151. -icnt_flit_size 40 # icnt_flit_size
  3152. -gpgpu_mem_addr_mapping dramid@8;00000000.00000000.00000000.00000000.0000RRRR.RRRRRRRR.RBBBCCCC.BCCSSSSS # mapping memory address to dram model {dramid@<start bit>;<memory address map>}
  3153. -gpgpu_mem_addr_test 0 # run sweep test to check address mapping for aliased address
  3154. -gpgpu_mem_address_mask 1 # 0 = old addressing mask, 1 = new addressing mask, 2 = new add. mask + flipped bank sel and chip sel bits
  3155. -gpgpu_memory_partition_indexing 2 # 0 = no indexing, 1 = bitwise xoring, 2 = IPoly, 3 = custom indexing
  3156. -accelwattch_xml_file accelwattch_sass_sim.xml # AccelWattch XML file
  3157. -power_simulation_enabled 0 # Turn on power simulator (1=On, 0=Off)
  3158. -power_per_cycle_dump 0 # Dump detailed power output each cycle
  3159. -hw_perf_file_name hw_perf.csv # Hardware Performance Statistics file
  3160. -hw_perf_bench_name # Kernel Name in Hardware Performance Statistics file
  3161. -power_simulation_mode 0 # Switch performance counter input for power simulation (0=Sim, 1=HW, 2=HW-Sim Hybrid)
  3162. -dvfs_enabled 0 # Turn on DVFS for power model
  3163. -aggregate_power_stats 0 # Accumulate power across all kernels
  3164. -accelwattch_hybrid_perfsim_L1_RH 0 # Get L1 Read Hits for Accelwattch-Hybrid from Accel-Sim
  3165. -accelwattch_hybrid_perfsim_L1_RM 0 # Get L1 Read Misses for Accelwattch-Hybrid from Accel-Sim
  3166. -accelwattch_hybrid_perfsim_L1_WH 0 # Get L1 Write Hits for Accelwattch-Hybrid from Accel-Sim
  3167. -accelwattch_hybrid_perfsim_L1_WM 0 # Get L1 Write Misses for Accelwattch-Hybrid from Accel-Sim
  3168. -accelwattch_hybrid_perfsim_L2_RH 0 # Get L2 Read Hits for Accelwattch-Hybrid from Accel-Sim
  3169. -accelwattch_hybrid_perfsim_L2_RM 0 # Get L2 Read Misses for Accelwattch-Hybrid from Accel-Sim
  3170. -accelwattch_hybrid_perfsim_L2_WH 0 # Get L2 Write Hits for Accelwattch-Hybrid from Accel-Sim
  3171. -accelwattch_hybrid_perfsim_L2_WM 0 # Get L2 Write Misses for Accelwattch-Hybrid from Accel-Sim
  3172. -accelwattch_hybrid_perfsim_CC_ACC 0 # Get Constant Cache Acesses for Accelwattch-Hybrid from Accel-Sim
  3173. -accelwattch_hybrid_perfsim_SHARED_ACC 0 # Get Shared Memory Acesses for Accelwattch-Hybrid from Accel-Sim
  3174. -accelwattch_hybrid_perfsim_DRAM_RD 0 # Get DRAM Reads for Accelwattch-Hybrid from Accel-Sim
  3175. -accelwattch_hybrid_perfsim_DRAM_WR 0 # Get DRAM Writes for Accelwattch-Hybrid from Accel-Sim
  3176. -accelwattch_hybrid_perfsim_NOC 0 # Get Interconnect Acesses for Accelwattch-Hybrid from Accel-Sim
  3177. -accelwattch_hybrid_perfsim_PIPE_DUTY 0 # Get Pipeline Duty Cycle Acesses for Accelwattch-Hybrid from Accel-Sim
  3178. -accelwattch_hybrid_perfsim_NUM_SM_IDLE 0 # Get Number of Idle SMs for Accelwattch-Hybrid from Accel-Sim
  3179. -accelwattch_hybrid_perfsim_CYCLES 0 # Get Executed Cycles for Accelwattch-Hybrid from Accel-Sim
  3180. -accelwattch_hybrid_perfsim_VOLTAGE 0 # Get Chip Voltage for Accelwattch-Hybrid from Accel-Sim
  3181. -power_trace_enabled 0 # produce a file for the power trace (1=On, 0=Off)
  3182. -power_trace_zlevel 6 # Compression level of the power trace output log (0=no comp, 9=highest)
  3183. -steady_power_levels_enabled 0 # produce a file for the steady power levels (1=On, 0=Off)
  3184. -steady_state_definition 8:4 # allowed deviation:number of samples
  3185. -gpgpu_max_cycle 0 # terminates gpu simulation early (0 = no limit)
  3186. -gpgpu_max_insn 0 # terminates gpu simulation early (0 = no limit)
  3187. -gpgpu_max_cta 0 # terminates gpu simulation early (0 = no limit)
  3188. -gpgpu_max_completed_cta 0 # terminates gpu simulation early (0 = no limit)
  3189. -gpgpu_runtime_stat 500 # display runtime statistics such as dram utilization {<freq>:<flag>}
  3190. -liveness_message_freq 1 # Minimum number of seconds between simulation liveness messages (0 = always print)
  3191. -gpgpu_compute_capability_major 8 # Major compute capability version number
  3192. -gpgpu_compute_capability_minor 6 # Minor compute capability version number
  3193. -gpgpu_flush_l1_cache 1 # Flush L1 cache at the end of each kernel call
  3194. -gpgpu_flush_l2_cache 0 # Flush L2 cache at the end of each kernel call
  3195. -gpgpu_deadlock_detect 1 # Stop the simulation at deadlock (1=on (default), 0=off)
  3196. -gpgpu_ptx_instruction_classification 0 # if enabled will classify ptx instruction types per kernel (Max 255 kernels now)
  3197. -gpgpu_ptx_sim_mode 0 # Select between Performance (default) or Functional simulation (1)
  3198. -gpgpu_clock_domains 1132:113200:1132:1132:3500.5 # Clock Domain Frequencies in MhZ {<Core Clock>:<ICNT Clock>:<Inter-chiplet ICNT Clock>:<L2 Clock>:<DRAM Clock>}
  3199. -gpgpu_max_concurrent_kernel 128 # maximum kernels that can run concurrently on GPU, set this value according to max resident grids for your compute capability
  3200. -gpgpu_cflog_interval 0 # Interval between each snapshot in control flow logger
  3201. -visualizer_enabled 0 # Turn on visualizer output (1=On, 0=Off)
  3202. -visualizer_outputfile NULL # Specifies the output log file for visualizer
  3203. -visualizer_zlevel 6 # Compression level of the visualizer output log (0=no comp, 9=highest)
  3204. -gpgpu_stack_size_limit 1024 # GPU thread stack size
  3205. -gpgpu_heap_size_limit 8388608 # GPU malloc heap size
  3206. -gpgpu_runtime_sync_depth_limit 2 # GPU device runtime synchronize depth
  3207. -gpgpu_runtime_pending_launch_count_limit 2048 # GPU device runtime pending launch count
  3208. -trace_enabled 0 # Turn on traces
  3209. -trace_components none # comma seperated list of traces to enable. Complete list found in trace_streams.tup. Default none
  3210. -trace_sampling_core 0 # The core which is printed using CORE_DPRINTF. Default 0
  3211. -trace_sampling_memory_partition -1 # The memory partition which is printed using MEMPART_DPRINTF. Default -1 (i.e. all)
  3212. -enable_ptx_file_line_stats 1 # Turn on PTX source line statistic profiling. (1 = On)
  3213. -ptx_line_stats_filename gpgpu_inst_stats.txt # Output file for PTX source line statistics.
  3214. -gpgpu_kernel_launch_latency 5000 # Kernel launch latency in cycles. Default: 0
  3215. -gpgpu_cdp_enabled 0 # Turn on CDP
  3216. -gpgpu_TB_launch_latency 0 # thread block launch latency in cycles. Default: 0
  3217. -trace ./kernelslist.g # traces kernel filetraces kernel file directory
  3218. -trace_opcode_latency_initiation_int 2,2 # Opcode latencies and initiation for integers in trace driven mode <latency,initiation>
  3219. -trace_opcode_latency_initiation_sp 2,1 # Opcode latencies and initiation for sp in trace driven mode <latency,initiation>
  3220. -trace_opcode_latency_initiation_dp 64,64 # Opcode latencies and initiation for dp in trace driven mode <latency,initiation>
  3221. -trace_opcode_latency_initiation_sfu 21,8 # Opcode latencies and initiation for sfu in trace driven mode <latency,initiation>
  3222. -trace_opcode_latency_initiation_tensor 32,32 # Opcode latencies and initiation for tensor in trace driven mode <latency,initiation>
  3223. -trace_opcode_latency_initiation_spec_op_1 4,4 # specialized unit config <latency,initiation>
  3224. -trace_opcode_latency_initiation_spec_op_2 200,4 # specialized unit config <latency,initiation>
  3225. -trace_opcode_latency_initiation_spec_op_3 32,32 # specialized unit config <latency,initiation>
  3226. -trace_opcode_latency_initiation_spec_op_4 4,1 # specialized unit config <latency,initiation>
  3227. -trace_opcode_latency_initiation_spec_op_5 4,4 # specialized unit config <latency,initiation>
  3228. -trace_opcode_latency_initiation_spec_op_6 4,4 # specialized unit config <latency,initiation>
  3229. -trace_opcode_latency_initiation_spec_op_7 4,4 # specialized unit config <latency,initiation>
  3230. -trace_opcode_latency_initiation_spec_op_8 4,4 # specialized unit config <latency,initiation>
  3231. DRAM Timing Options:
  3232. nbk 16 # number of banks
  3233. CCD 4 # column to column delay
  3234. RRD 12 # minimal delay between activation of rows in different banks
  3235. RCD 24 # row to column delay
  3236. RAS 55 # time needed to activate row
  3237. RP 24 # time needed to precharge (deactivate) row
  3238. RC 78 # row cycle time
  3239. CDLR 10 # switching from write to read (changes tWTR)
  3240. WR 24 # last data-in to row precharge
  3241. CL 24 # CAS latency
  3242. WL 8 # Write latency
  3243. nbkgrp 4 # number of bank groups
  3244. CCDL 6 # column to column delay between accesses to different bank groups
  3245. RTPL 4 # read to precharge delay between accesses to different bank groups
  3246. Total number of memory sub partition = 32
  3247. addr_dec_mask[CHIP] = 0000000000000f00 high:12 low:8
  3248. addr_dec_mask[BK] = 0000000000070080 high:19 low:7
  3249. addr_dec_mask[ROW] = 00000000fff80000 high:32 low:19
  3250. addr_dec_mask[COL] = 000000000000f07f high:16 low:0
  3251. addr_dec_mask[BURST] = 000000000000001f high:5 low:0
  3252. sub_partition_id_mask = 0000000000000080
  3253. GPGPU-Sim uArch: clock freqs: 1132000000.000000:113200000000.000000:1132000000.000000:1132000000.000000:3500500000.000000
  3254. GPGPU-Sim uArch: clock periods: 0.00000000088339222615:0.00000000088339222615:0.00000000000000000000:0.00000000088339222615:0.00000000028567347522
  3255. *** Initializing Memory Statistics ***
  3256. GPGPU-Sim uArch: interconnect node map (shaderID+MemID to icntID)
  3257. GPGPU-Sim uArch: Memory nodes ID start from index: 32
  3258. GPGPU-Sim uArch: 0 1 2 3 4 5 6 7
  3259. GPGPU-Sim uArch: 8 9 10 11 12 13 14 15
  3260. GPGPU-Sim uArch: 16 17 18 19 20 21 22 23
  3261. GPGPU-Sim uArch: 24 25 26 27 28 29 30 31
  3262. GPGPU-Sim uArch: 32 33 34 35 36 37 38 39
  3263. GPGPU-Sim uArch: 40 41 42 43 44 45 46 47
  3264. GPGPU-Sim uArch: 48 49 50 51 52 53 54 55
  3265. GPGPU-Sim uArch: 56 57 58 59 60 61 62 63
  3266. GPGPU-Sim uArch: interconnect node reverse map (icntID to shaderID+MemID)
  3267. GPGPU-Sim uArch: Memory nodes start from ID: 32
  3268. GPGPU-Sim uArch: 0 1 2 3 4 5 6 7
  3269. GPGPU-Sim uArch: 8 9 10 11 12 13 14 15
  3270. GPGPU-Sim uArch: 16 17 18 19 20 21 22 23
  3271. GPGPU-Sim uArch: 24 25 26 27 28 29 30 31
  3272. GPGPU-Sim uArch: 32 33 34 35 36 37 38 39
  3273. GPGPU-Sim uArch: 40 41 42 43 44 45 46 47
  3274. GPGPU-Sim uArch: 48 49 50 51 52 53 54 55
  3275. GPGPU-Sim uArch: 56 57 58 59 60 61 62 63
  3276. GPGPU-Sim uArch: interconnect node map (shaderID+MemID to icntID)
  3277. GPGPU-Sim uArch: Memory nodes ID start from index: 8
  3278. GPGPU-Sim uArch: 0 1 2 3
  3279. GPGPU-Sim uArch: 4 5 6 7
  3280. GPGPU-Sim uArch: 8 9 10 11
  3281. GPGPU-Sim uArch: 12 13 14 15
  3282. GPGPU-Sim uArch: interconnect node reverse map (icntID to shaderID+MemID)
  3283. GPGPU-Sim uArch: Memory nodes start from ID: 8
  3284. GPGPU-Sim uArch: 0 1 2 3
  3285. GPGPU-Sim uArch: 4 5 6 7
  3286. GPGPU-Sim uArch: 8 9 10 11
  3287. GPGPU-Sim uArch: 12 13 14 15
  3288. GPGPU-Sim uArch: performance model initialization complete.
  3289. launching memcpy command : MemcpyHtoD,0x00007f9d94000000,85191092
  3290. launching memcpy command : MemcpyHtoD,0x00007f9d14000000,2120204360
  3291. launching memcpy command : MemcpyHtoD,0x00007f9d0e000000,85191092
  3292. launching memcpy command : MemcpyHtoD,0x00007f9b00000000,2120204360
  3293. launching memcpy command : MemcpyHtoD,0x00007f9b7e600000,21297772
  3294. launching memcpy command : MemcpyHtoD,0x00007f9cee000000,530051090
  3295. launching memcpy command : MemcpyHtoD,0x00007f9dd6101000,28
  3296. launching memcpy command : MemcpyHtoD,0x00007f9dd6101600,491520
  3297. launching memcpy command : MemcpyHtoD,0x00007f9dd6100000,688
  3298. launching memcpy command : MemcpyHtoD,0x00007f9dd6100600,4
  3299. launching memcpy command : MemcpyHtoD,0x00007f9dd6100800,4
  3300. launching memcpy command : MemcpyHtoD,0x00007f9dd6101000,28
  3301. Processing kernel ./kernel-10.traceg
  3302. -kernel name = _ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_
  3303. -kernel id = 10
  3304. -grid dim = (544,1,1)
  3305. -block dim = (256,1,1)
  3306. -shmem = 12160
  3307. -nregs = 34
  3308. -binary version = 86
  3309. -cuda stream id = -685638256
  3310. -shmem base_addr = 0x00007f9e58000000
  3311. -local mem base_addr = 0x00007f9e56000000
  3312. -nvbit version = 1.5.3
  3313. -accelsim tracer version = 3
  3314. Header info loaded for kernel command : ./kernel-10.traceg
  3315. launching kernel name: _ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_ uid: 1
  3316. GPGPU-Sim uArch: Shader 0 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3317. GPGPU-Sim uArch: CTA/core = 6, limited by: threads
  3318. GPGPU-Sim: Reconfigure L1 cache to 28KB
  3319. thread block = 0,0,0
  3320. GPGPU-Sim uArch: Shader 1 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3321. thread block = 1,0,0
  3322. GPGPU-Sim uArch: Shader 2 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3323. thread block = 2,0,0
  3324. GPGPU-Sim uArch: Shader 3 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3325. thread block = 3,0,0
  3326. GPGPU-Sim uArch: Shader 4 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3327. thread block = 4,0,0
  3328. GPGPU-Sim uArch: Shader 5 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3329. thread block = 5,0,0
  3330. GPGPU-Sim uArch: Shader 6 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3331. thread block = 6,0,0
  3332. GPGPU-Sim uArch: Shader 7 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3333. thread block = 7,0,0
  3334. GPGPU-Sim uArch: Shader 8 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3335. thread block = 8,0,0
  3336. GPGPU-Sim uArch: Shader 9 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3337. thread block = 9,0,0
  3338. GPGPU-Sim uArch: Shader 10 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3339. thread block = 10,0,0
  3340. GPGPU-Sim uArch: Shader 11 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3341. thread block = 11,0,0
  3342. GPGPU-Sim uArch: Shader 12 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3343. thread block = 12,0,0
  3344. GPGPU-Sim uArch: Shader 13 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3345. thread block = 13,0,0
  3346. GPGPU-Sim uArch: Shader 14 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3347. thread block = 14,0,0
  3348. GPGPU-Sim uArch: Shader 15 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3349. thread block = 15,0,0
  3350. GPGPU-Sim uArch: Shader 16 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3351. thread block = 16,0,0
  3352. GPGPU-Sim uArch: Shader 17 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3353. thread block = 17,0,0
  3354. GPGPU-Sim uArch: Shader 18 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3355. thread block = 18,0,0
  3356. GPGPU-Sim uArch: Shader 19 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3357. thread block = 19,0,0
  3358. GPGPU-Sim uArch: Shader 20 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3359. thread block = 20,0,0
  3360. GPGPU-Sim uArch: Shader 21 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3361. thread block = 21,0,0
  3362. GPGPU-Sim uArch: Shader 22 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3363. thread block = 22,0,0
  3364. GPGPU-Sim uArch: Shader 23 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3365. thread block = 23,0,0
  3366. GPGPU-Sim uArch: Shader 24 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3367. thread block = 24,0,0
  3368. GPGPU-Sim uArch: Shader 25 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3369. thread block = 25,0,0
  3370. GPGPU-Sim uArch: Shader 26 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3371. thread block = 26,0,0
  3372. GPGPU-Sim uArch: Shader 27 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3373. thread block = 27,0,0
  3374. GPGPU-Sim uArch: Shader 28 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3375. thread block = 28,0,0
  3376. GPGPU-Sim uArch: Shader 29 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3377. thread block = 29,0,0
  3378. GPGPU-Sim uArch: Shader 30 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3379. thread block = 30,0,0
  3380. GPGPU-Sim uArch: Shader 31 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
  3381. thread block = 31,0,0
  3382. thread block = 32,0,0
  3383. thread block = 33,0,0
  3384. thread block = 34,0,0
  3385. thread block = 35,0,0
  3386. thread block = 36,0,0
  3387. thread block = 37,0,0
  3388. thread block = 38,0,0
  3389. thread block = 39,0,0
  3390. thread block = 40,0,0
  3391. thread block = 41,0,0
  3392. thread block = 42,0,0
  3393. thread block = 43,0,0
  3394. thread block = 44,0,0
  3395. thread block = 45,0,0
  3396. thread block = 46,0,0
  3397. thread block = 47,0,0
  3398. thread block = 48,0,0
  3399. thread block = 49,0,0
  3400. thread block = 50,0,0
  3401. thread block = 51,0,0
  3402. thread block = 52,0,0
  3403. thread block = 53,0,0
  3404. thread block = 54,0,0
  3405. thread block = 55,0,0
  3406. thread block = 56,0,0
  3407. thread block = 57,0,0
  3408. thread block = 58,0,0
  3409. thread block = 59,0,0
  3410. thread block = 60,0,0
  3411. thread block = 61,0,0
  3412. thread block = 62,0,0
  3413. thread block = 63,0,0
  3414. thread block = 64,0,0
  3415. thread block = 65,0,0
  3416. thread block = 66,0,0
  3417. thread block = 67,0,0
  3418. thread block = 68,0,0
  3419. thread block = 69,0,0
  3420. thread block = 70,0,0
  3421. thread block = 71,0,0
  3422. thread block = 72,0,0
  3423. thread block = 73,0,0
  3424. thread block = 74,0,0
  3425. thread block = 75,0,0
  3426. thread block = 76,0,0
  3427. thread block = 77,0,0
  3428. thread block = 78,0,0
  3429. thread block = 79,0,0
  3430. thread block = 80,0,0
  3431. thread block = 81,0,0
  3432. thread block = 82,0,0
  3433. thread block = 83,0,0
  3434. thread block = 84,0,0
  3435. thread block = 85,0,0
  3436. thread block = 86,0,0
  3437. thread block = 87,0,0
  3438. thread block = 88,0,0
  3439. thread block = 89,0,0
  3440. thread block = 90,0,0
  3441. thread block = 91,0,0
  3442. thread block = 92,0,0
  3443. thread block = 93,0,0
  3444. thread block = 94,0,0
  3445. thread block = 95,0,0
  3446. thread block = 96,0,0
  3447. thread block = 97,0,0
  3448. thread block = 98,0,0
  3449. thread block = 99,0,0
  3450. thread block = 100,0,0
  3451. thread block = 101,0,0
  3452. thread block = 102,0,0
  3453. thread block = 103,0,0
  3454. thread block = 104,0,0
  3455. thread block = 105,0,0
  3456. thread block = 106,0,0
  3457. thread block = 107,0,0
  3458. thread block = 108,0,0
  3459. thread block = 109,0,0
  3460. thread block = 110,0,0
  3461. thread block = 111,0,0
  3462. thread block = 112,0,0
  3463. thread block = 113,0,0
  3464. thread block = 114,0,0
  3465. thread block = 115,0,0
  3466. thread block = 116,0,0
  3467. thread block = 117,0,0
  3468. thread block = 118,0,0
  3469. thread block = 119,0,0
  3470. thread block = 120,0,0
  3471. thread block = 121,0,0
  3472. thread block = 122,0,0
  3473. thread block = 123,0,0
  3474. thread block = 124,0,0
  3475. thread block = 125,0,0
  3476. thread block = 126,0,0
  3477. thread block = 127,0,0
  3478. thread block = 128,0,0
  3479. thread block = 129,0,0
  3480. thread block = 130,0,0
  3481. thread block = 131,0,0
  3482. thread block = 132,0,0
  3483. thread block = 133,0,0
  3484. thread block = 134,0,0
  3485. thread block = 135,0,0
  3486. thread block = 136,0,0
  3487. thread block = 137,0,0
  3488. thread block = 138,0,0
  3489. thread block = 139,0,0
  3490. thread block = 140,0,0
  3491. thread block = 141,0,0
  3492. thread block = 142,0,0
  3493. thread block = 143,0,0
  3494. thread block = 144,0,0
  3495. thread block = 145,0,0
  3496. thread block = 146,0,0
  3497. thread block = 147,0,0
  3498. thread block = 148,0,0
  3499. thread block = 149,0,0
  3500. thread block = 150,0,0
  3501. thread block = 151,0,0
  3502. thread block = 152,0,0
  3503. thread block = 153,0,0
  3504. thread block = 154,0,0
  3505. thread block = 155,0,0
  3506. thread block = 156,0,0
  3507. thread block = 157,0,0
  3508. thread block = 158,0,0
  3509. thread block = 159,0,0
  3510. thread block = 160,0,0
  3511. thread block = 161,0,0
  3512. thread block = 162,0,0
  3513. thread block = 163,0,0
  3514. thread block = 164,0,0
  3515. thread block = 165,0,0
  3516. thread block = 166,0,0
  3517. thread block = 167,0,0
  3518. thread block = 168,0,0
  3519. thread block = 169,0,0
  3520. thread block = 170,0,0
  3521. thread block = 171,0,0
  3522. thread block = 172,0,0
  3523. thread block = 173,0,0
  3524. thread block = 174,0,0
  3525. thread block = 175,0,0
  3526. thread block = 176,0,0
  3527. thread block = 177,0,0
  3528. thread block = 178,0,0
  3529. thread block = 179,0,0
  3530. thread block = 180,0,0
  3531. thread block = 181,0,0
  3532. thread block = 182,0,0
  3533. thread block = 183,0,0
  3534. thread block = 184,0,0
  3535. thread block = 185,0,0
  3536. thread block = 186,0,0
  3537. thread block = 187,0,0
  3538. thread block = 188,0,0
  3539. thread block = 189,0,0
  3540. thread block = 190,0,0
  3541. thread block = 191,0,0
  3542. thread block = 192,0,0
  3543. thread block = 193,0,0
  3544. thread block = 194,0,0
  3545. thread block = 195,0,0
  3546. thread block = 196,0,0
  3547. thread block = 197,0,0
  3548. thread block = 198,0,0
  3549. thread block = 199,0,0
  3550. thread block = 200,0,0
  3551. thread block = 201,0,0
  3552. thread block = 202,0,0
  3553. thread block = 203,0,0
  3554. thread block = 204,0,0
  3555. thread block = 205,0,0
  3556. thread block = 206,0,0
  3557. thread block = 207,0,0
  3558. thread block = 208,0,0
  3559. thread block = 209,0,0
  3560. thread block = 210,0,0
  3561. thread block = 211,0,0
  3562. thread block = 212,0,0
  3563. thread block = 213,0,0
  3564. thread block = 214,0,0
  3565. thread block = 215,0,0
  3566. thread block = 216,0,0
  3567. thread block = 217,0,0
  3568. thread block = 218,0,0
  3569. thread block = 219,0,0
  3570. thread block = 220,0,0
  3571. thread block = 221,0,0
  3572. thread block = 222,0,0
  3573. thread block = 223,0,0
  3574. thread block = 224,0,0
  3575. thread block = 225,0,0
  3576. thread block = 226,0,0
  3577. thread block = 227,0,0
  3578. thread block = 228,0,0
  3579. thread block = 229,0,0
  3580. thread block = 230,0,0
  3581. thread block = 231,0,0
  3582. thread block = 232,0,0
  3583. thread block = 233,0,0
  3584. thread block = 234,0,0
  3585. thread block = 235,0,0
  3586. thread block = 236,0,0
  3587. thread block = 237,0,0
  3588. thread block = 238,0,0
  3589. thread block = 239,0,0
  3590. thread block = 240,0,0
  3591. thread block = 241,0,0
  3592. thread block = 242,0,0
  3593. thread block = 243,0,0
  3594. thread block = 244,0,0
  3595. thread block = 245,0,0
  3596. thread block = 246,0,0
  3597. thread block = 247,0,0
  3598. thread block = 248,0,0
  3599. thread block = 249,0,0
  3600. thread block = 250,0,0
  3601. thread block = 251,0,0
  3602. thread block = 252,0,0
  3603. thread block = 253,0,0
  3604. thread block = 254,0,0
  3605. thread block = 255,0,0
  3606. thread block = 256,0,0
  3607. thread block = 257,0,0
  3608. thread block = 258,0,0
  3609. thread block = 259,0,0
  3610. thread block = 260,0,0
  3611. thread block = 261,0,0
  3612. thread block = 262,0,0
  3613. thread block = 263,0,0
  3614. thread block = 264,0,0
  3615. thread block = 265,0,0
  3616. thread block = 266,0,0
  3617. thread block = 267,0,0
  3618. thread block = 268,0,0
  3619. thread block = 269,0,0
  3620. thread block = 270,0,0
  3621. thread block = 271,0,0
  3622. thread block = 272,0,0
  3623. thread block = 273,0,0
  3624. thread block = 274,0,0
  3625. thread block = 275,0,0
  3626. thread block = 276,0,0
  3627. thread block = 277,0,0
  3628. thread block = 278,0,0
  3629. thread block = 279,0,0
  3630. thread block = 280,0,0
  3631. thread block = 281,0,0
  3632. thread block = 282,0,0
  3633. thread block = 283,0,0
  3634. thread block = 284,0,0
  3635. thread block = 285,0,0
  3636. thread block = 286,0,0
  3637. thread block = 287,0,0
  3638. thread block = 288,0,0
  3639. thread block = 289,0,0
  3640. thread block = 290,0,0
  3641. thread block = 291,0,0
  3642. thread block = 292,0,0
  3643. thread block = 293,0,0
  3644. thread block = 294,0,0
  3645. thread block = 295,0,0
  3646. thread block = 296,0,0
  3647. thread block = 297,0,0
  3648. thread block = 298,0,0
  3649. thread block = 299,0,0
  3650. thread block = 300,0,0
  3651. thread block = 301,0,0
  3652. thread block = 302,0,0
  3653. thread block = 303,0,0
  3654. thread block = 304,0,0
  3655. thread block = 305,0,0
  3656. thread block = 306,0,0
  3657. thread block = 307,0,0
  3658. thread block = 308,0,0
  3659. thread block = 309,0,0
  3660. thread block = 310,0,0
  3661. thread block = 311,0,0
  3662. thread block = 312,0,0
  3663. thread block = 313,0,0
  3664. thread block = 314,0,0
  3665. thread block = 315,0,0
  3666. thread block = 316,0,0
  3667. thread block = 317,0,0
  3668. thread block = 318,0,0
  3669. thread block = 319,0,0
  3670. thread block = 320,0,0
  3671. thread block = 321,0,0
  3672. thread block = 322,0,0
  3673. thread block = 323,0,0
  3674. thread block = 324,0,0
  3675. thread block = 325,0,0
  3676. thread block = 326,0,0
  3677. thread block = 327,0,0
  3678. thread block = 328,0,0
  3679. thread block = 329,0,0
  3680. thread block = 330,0,0
  3681. thread block = 331,0,0
  3682. thread block = 332,0,0
  3683. thread block = 333,0,0
  3684. thread block = 334,0,0
  3685. thread block = 335,0,0
  3686. thread block = 336,0,0
  3687. thread block = 337,0,0
  3688. thread block = 338,0,0
  3689. thread block = 339,0,0
  3690. thread block = 340,0,0
  3691. thread block = 341,0,0
  3692. thread block = 342,0,0
  3693. thread block = 343,0,0
  3694. thread block = 344,0,0
  3695. thread block = 345,0,0
  3696. thread block = 346,0,0
  3697. thread block = 347,0,0
  3698. thread block = 348,0,0
  3699. thread block = 349,0,0
  3700. thread block = 350,0,0
  3701. thread block = 351,0,0
  3702. thread block = 352,0,0
  3703. thread block = 353,0,0
  3704. thread block = 354,0,0
  3705. thread block = 355,0,0
  3706. thread block = 356,0,0
  3707. thread block = 357,0,0
  3708. thread block = 358,0,0
  3709. thread block = 359,0,0
  3710. thread block = 360,0,0
  3711. thread block = 361,0,0
  3712. thread block = 362,0,0
  3713. thread block = 363,0,0
  3714. thread block = 364,0,0
  3715. thread block = 365,0,0
  3716. thread block = 366,0,0
  3717. thread block = 367,0,0
  3718. thread block = 368,0,0
  3719. thread block = 369,0,0
  3720. thread block = 370,0,0
  3721. thread block = 371,0,0
  3722. thread block = 372,0,0
  3723. thread block = 373,0,0
  3724. thread block = 374,0,0
  3725. thread block = 375,0,0
  3726. thread block = 376,0,0
  3727. thread block = 377,0,0
  3728. thread block = 378,0,0
  3729. thread block = 379,0,0
  3730. thread block = 380,0,0
  3731. thread block = 381,0,0
  3732. thread block = 382,0,0
  3733. thread block = 383,0,0
  3734. thread block = 384,0,0
  3735. thread block = 385,0,0
  3736. thread block = 386,0,0
  3737. thread block = 387,0,0
  3738. thread block = 388,0,0
  3739. thread block = 389,0,0
  3740. thread block = 390,0,0
  3741. thread block = 391,0,0
  3742. thread block = 392,0,0
  3743. thread block = 393,0,0
  3744. thread block = 394,0,0
  3745. thread block = 395,0,0
  3746. thread block = 396,0,0
  3747. thread block = 397,0,0
  3748. thread block = 398,0,0
  3749. thread block = 399,0,0
  3750. thread block = 400,0,0
  3751. thread block = 401,0,0
  3752. thread block = 402,0,0
  3753. thread block = 403,0,0
  3754. thread block = 404,0,0
  3755. thread block = 405,0,0
  3756. thread block = 406,0,0
  3757. thread block = 407,0,0
  3758. thread block = 408,0,0
  3759. thread block = 409,0,0
  3760. thread block = 410,0,0
  3761. thread block = 411,0,0
  3762. thread block = 412,0,0
  3763. thread block = 413,0,0
  3764. thread block = 414,0,0
  3765. thread block = 415,0,0
  3766. thread block = 416,0,0
  3767. thread block = 417,0,0
  3768. thread block = 418,0,0
  3769. thread block = 419,0,0
  3770. thread block = 420,0,0
  3771. thread block = 421,0,0
  3772. thread block = 422,0,0
  3773. thread block = 423,0,0
  3774. thread block = 424,0,0
  3775. thread block = 425,0,0
  3776. thread block = 426,0,0
  3777. thread block = 427,0,0
  3778. thread block = 428,0,0
  3779. thread block = 429,0,0
  3780. thread block = 430,0,0
  3781. thread block = 431,0,0
  3782. thread block = 432,0,0
  3783. thread block = 433,0,0
  3784. thread block = 434,0,0
  3785. thread block = 435,0,0
  3786. thread block = 436,0,0
  3787. thread block = 437,0,0
  3788. thread block = 438,0,0
  3789. thread block = 439,0,0
  3790. thread block = 440,0,0
  3791. thread block = 441,0,0
  3792. thread block = 442,0,0
  3793. thread block = 443,0,0
  3794. thread block = 444,0,0
  3795. thread block = 445,0,0
  3796. thread block = 446,0,0
  3797. thread block = 447,0,0
  3798. thread block = 448,0,0
  3799. thread block = 449,0,0
  3800. thread block = 450,0,0
  3801. thread block = 451,0,0
  3802. thread block = 452,0,0
  3803. thread block = 453,0,0
  3804. thread block = 454,0,0
  3805. thread block = 455,0,0
  3806. thread block = 456,0,0
  3807. thread block = 457,0,0
  3808. thread block = 458,0,0
  3809. thread block = 459,0,0
  3810. thread block = 460,0,0
  3811. thread block = 461,0,0
  3812. thread block = 462,0,0
  3813. thread block = 463,0,0
  3814. thread block = 464,0,0
  3815. thread block = 465,0,0
  3816. thread block = 466,0,0
  3817. thread block = 467,0,0
  3818. thread block = 468,0,0
  3819. thread block = 469,0,0
  3820. thread block = 470,0,0
  3821. thread block = 471,0,0
  3822. thread block = 472,0,0
  3823. thread block = 473,0,0
  3824. thread block = 474,0,0
  3825. thread block = 475,0,0
  3826. thread block = 476,0,0
  3827. thread block = 477,0,0
  3828. thread block = 478,0,0
  3829. thread block = 479,0,0
  3830. thread block = 480,0,0
  3831. thread block = 481,0,0
  3832. thread block = 482,0,0
  3833. thread block = 483,0,0
  3834. thread block = 484,0,0
  3835. thread block = 485,0,0
  3836. thread block = 486,0,0
  3837. thread block = 487,0,0
  3838. thread block = 488,0,0
  3839. thread block = 489,0,0
  3840. thread block = 490,0,0
  3841. thread block = 491,0,0
  3842. thread block = 492,0,0
  3843. thread block = 493,0,0
  3844. thread block = 494,0,0
  3845. thread block = 495,0,0
  3846. thread block = 496,0,0
  3847. thread block = 497,0,0
  3848. thread block = 498,0,0
  3849. thread block = 499,0,0
  3850. thread block = 500,0,0
  3851. thread block = 501,0,0
  3852. thread block = 502,0,0
  3853. thread block = 503,0,0
  3854. thread block = 504,0,0
  3855. thread block = 505,0,0
  3856. thread block = 506,0,0
  3857. thread block = 507,0,0
  3858. thread block = 508,0,0
  3859. thread block = 509,0,0
  3860. thread block = 510,0,0
  3861. thread block = 511,0,0
  3862. thread block = 512,0,0
  3863. thread block = 513,0,0
  3864. thread block = 514,0,0
  3865. thread block = 515,0,0
  3866. thread block = 516,0,0
  3867. thread block = 517,0,0
  3868. thread block = 518,0,0
  3869. thread block = 519,0,0
  3870. thread block = 520,0,0
  3871. thread block = 521,0,0
  3872. thread block = 522,0,0
  3873. thread block = 523,0,0
  3874. thread block = 524,0,0
  3875. thread block = 525,0,0
  3876. thread block = 526,0,0
  3877. thread block = 527,0,0
  3878. thread block = 528,0,0
  3879. thread block = 529,0,0
  3880. thread block = 530,0,0
  3881. thread block = 531,0,0
  3882. thread block = 532,0,0
  3883. thread block = 533,0,0
  3884. thread block = 534,0,0
  3885. thread block = 535,0,0
  3886. thread block = 536,0,0
  3887. thread block = 537,0,0
  3888. thread block = 538,0,0
  3889. thread block = 539,0,0
  3890. thread block = 540,0,0
  3891. thread block = 541,0,0
  3892. thread block = 542,0,0
  3893. thread block = 543,0,0
  3894. Destroy streams for kernel 1: size 0
  3895. kernel_name = _ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_
  3896. kernel_launch_uid = 1
  3897. Hossein: Number of Local Requests: 35
  3898. Hossein: Number of Remote Requests: 513
  3899. gpu_sim_cycle = 8748
  3900. gpu_sim_insn = 4086966
  3901. gpu_ipc = 467.1886
  3902. gpu_tot_sim_cycle = 8748
  3903. gpu_tot_sim_insn = 4086966
  3904. gpu_tot_ipc = 467.1886
  3905. gpu_tot_issued_cta = 544
  3906. gpu_occupancy = 87.7916%
  3907. gpu_tot_occupancy = 87.7916%
  3908. max_total_param_size = 0
  3909. gpu_stall_dramfull = 336
  3910. gpu_stall_icnt2sh = 0
  3911. partiton_level_parallism = 0.0626
  3912. partiton_level_parallism_total = 0.0626
  3913. partiton_level_parallism_util = 1.0000
  3914. partiton_level_parallism_util_total = 1.0000
  3915. L2_BW = 2.2692 GB/Sec
  3916. L2_BW_total = 2.2692 GB/Sec
  3917. gpu_total_sim_rate=85145
  3918.  
  3919. ========= Core cache stats =========
  3920. L1I_cache:
  3921. L1I_total_cache_accesses = 0
  3922. L1I_total_cache_misses = 0
  3923. L1I_total_cache_pending_hits = 0
  3924. L1I_total_cache_reservation_fails = 0
  3925. L1D_cache:
  3926. L1D_cache_core[0]: Access = 2, Miss = 2, Miss_rate = 1.000, Pending_hits = 0, Reservation_fails = 0
  3927. L1D_cache_core[1]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3928. L1D_cache_core[2]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3929. L1D_cache_core[3]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3930. L1D_cache_core[4]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3931. L1D_cache_core[5]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3932. L1D_cache_core[6]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3933. L1D_cache_core[7]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3934. L1D_cache_core[8]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3935. L1D_cache_core[9]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3936. L1D_cache_core[10]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3937. L1D_cache_core[11]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3938. L1D_cache_core[12]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3939. L1D_cache_core[13]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3940. L1D_cache_core[14]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3941. L1D_cache_core[15]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3942. L1D_cache_core[16]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3943. L1D_cache_core[17]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3944. L1D_cache_core[18]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3945. L1D_cache_core[19]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3946. L1D_cache_core[20]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3947. L1D_cache_core[21]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3948. L1D_cache_core[22]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3949. L1D_cache_core[23]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3950. L1D_cache_core[24]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3951. L1D_cache_core[25]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3952. L1D_cache_core[26]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3953. L1D_cache_core[27]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3954. L1D_cache_core[28]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3955. L1D_cache_core[29]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3956. L1D_cache_core[30]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3957. L1D_cache_core[31]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  3958. L1D_total_cache_accesses = 2
  3959. L1D_total_cache_misses = 2
  3960. L1D_total_cache_miss_rate = 1.0000
  3961. L1D_total_cache_pending_hits = 0
  3962. L1D_total_cache_reservation_fails = 0
  3963. L1D_cache_data_port_util = 0.000
  3964. L1D_cache_fill_port_util = 0.000
  3965. L1C_cache:
  3966. L1C_total_cache_accesses = 0
  3967. L1C_total_cache_misses = 0
  3968. L1C_total_cache_pending_hits = 0
  3969. L1C_total_cache_reservation_fails = 0
  3970. L1T_cache:
  3971. L1T_total_cache_accesses = 0
  3972. L1T_total_cache_misses = 0
  3973. L1T_total_cache_pending_hits = 0
  3974. L1T_total_cache_reservation_fails = 0
  3975.  
  3976. Total_core_cache_stats:
  3977. Total_core_cache_stats_breakdown[GLOBAL_ACC_R][HIT] = 0
  3978. Total_core_cache_stats_breakdown[GLOBAL_ACC_R][HIT_RESERVED] = 0
  3979. Total_core_cache_stats_breakdown[GLOBAL_ACC_R][MISS] = 0
  3980. Total_core_cache_stats_breakdown[GLOBAL_ACC_R][RESERVATION_FAIL] = 0
  3981. Total_core_cache_stats_breakdown[GLOBAL_ACC_R][SECTOR_MISS] = 0
  3982. Total_core_cache_stats_breakdown[GLOBAL_ACC_R][MSHR_HIT] = 0
  3983. Total_core_cache_stats_breakdown[LOCAL_ACC_R][HIT] = 0
  3984. Total_core_cache_stats_breakdown[LOCAL_ACC_R][HIT_RESERVED] = 0
  3985. Total_core_cache_stats_breakdown[LOCAL_ACC_R][MISS] = 0
  3986. Total_core_cache_stats_breakdown[LOCAL_ACC_R][RESERVATION_FAIL] = 0
  3987. Total_core_cache_stats_breakdown[LOCAL_ACC_R][SECTOR_MISS] = 0
  3988. Total_core_cache_stats_breakdown[LOCAL_ACC_R][MSHR_HIT] = 0
  3989. Total_core_cache_stats_breakdown[CONST_ACC_R][HIT] = 0
  3990. Total_core_cache_stats_breakdown[CONST_ACC_R][HIT_RESERVED] = 0
  3991. Total_core_cache_stats_breakdown[CONST_ACC_R][MISS] = 0
  3992. Total_core_cache_stats_breakdown[CONST_ACC_R][RESERVATION_FAIL] = 0
  3993. Total_core_cache_stats_breakdown[CONST_ACC_R][SECTOR_MISS] = 0
  3994. Total_core_cache_stats_breakdown[CONST_ACC_R][MSHR_HIT] = 0
  3995. Total_core_cache_stats_breakdown[TEXTURE_ACC_R][HIT] = 0
  3996. Total_core_cache_stats_breakdown[TEXTURE_ACC_R][HIT_RESERVED] = 0
  3997. Total_core_cache_stats_breakdown[TEXTURE_ACC_R][MISS] = 0
  3998. Total_core_cache_stats_breakdown[TEXTURE_ACC_R][RESERVATION_FAIL] = 0
  3999. Total_core_cache_stats_breakdown[TEXTURE_ACC_R][SECTOR_MISS] = 0
  4000. Total_core_cache_stats_breakdown[TEXTURE_ACC_R][MSHR_HIT] = 0
  4001. Total_core_cache_stats_breakdown[GLOBAL_ACC_W][HIT] = 0
  4002. Total_core_cache_stats_breakdown[GLOBAL_ACC_W][HIT_RESERVED] = 0
  4003. Total_core_cache_stats_breakdown[GLOBAL_ACC_W][MISS] = 2
  4004. Total_core_cache_stats_breakdown[GLOBAL_ACC_W][RESERVATION_FAIL] = 0
  4005. Total_core_cache_stats_breakdown[GLOBAL_ACC_W][SECTOR_MISS] = 0
  4006. Total_core_cache_stats_breakdown[GLOBAL_ACC_W][MSHR_HIT] = 0
  4007. Total_core_cache_stats_breakdown[LOCAL_ACC_W][HIT] = 0
  4008. Total_core_cache_stats_breakdown[LOCAL_ACC_W][HIT_RESERVED] = 0
  4009. Total_core_cache_stats_breakdown[LOCAL_ACC_W][MISS] = 0
  4010. Total_core_cache_stats_breakdown[LOCAL_ACC_W][RESERVATION_FAIL] = 0
  4011. Total_core_cache_stats_breakdown[LOCAL_ACC_W][SECTOR_MISS] = 0
  4012. Total_core_cache_stats_breakdown[LOCAL_ACC_W][MSHR_HIT] = 0
  4013. Total_core_cache_stats_breakdown[L1_WRBK_ACC][HIT] = 0
  4014. Total_core_cache_stats_breakdown[L1_WRBK_ACC][HIT_RESERVED] = 0
  4015. Total_core_cache_stats_breakdown[L1_WRBK_ACC][MISS] = 0
  4016. Total_core_cache_stats_breakdown[L1_WRBK_ACC][RESERVATION_FAIL] = 0
  4017. Total_core_cache_stats_breakdown[L1_WRBK_ACC][SECTOR_MISS] = 0
  4018. Total_core_cache_stats_breakdown[L1_WRBK_ACC][MSHR_HIT] = 0
  4019. Total_core_cache_stats_breakdown[L2_WRBK_ACC][HIT] = 0
  4020. Total_core_cache_stats_breakdown[L2_WRBK_ACC][HIT_RESERVED] = 0
  4021. Total_core_cache_stats_breakdown[L2_WRBK_ACC][MISS] = 0
  4022. Total_core_cache_stats_breakdown[L2_WRBK_ACC][RESERVATION_FAIL] = 0
  4023. Total_core_cache_stats_breakdown[L2_WRBK_ACC][SECTOR_MISS] = 0
  4024. Total_core_cache_stats_breakdown[L2_WRBK_ACC][MSHR_HIT] = 0
  4025. Total_core_cache_stats_breakdown[INST_ACC_R][HIT] = 0
  4026. Total_core_cache_stats_breakdown[INST_ACC_R][HIT_RESERVED] = 0
  4027. Total_core_cache_stats_breakdown[INST_ACC_R][MISS] = 0
  4028. Total_core_cache_stats_breakdown[INST_ACC_R][RESERVATION_FAIL] = 0
  4029. Total_core_cache_stats_breakdown[INST_ACC_R][SECTOR_MISS] = 0
  4030. Total_core_cache_stats_breakdown[INST_ACC_R][MSHR_HIT] = 0
  4031. Total_core_cache_stats_breakdown[L1_WR_ALLOC_R][HIT] = 0
  4032. Total_core_cache_stats_breakdown[L1_WR_ALLOC_R][HIT_RESERVED] = 0
  4033. Total_core_cache_stats_breakdown[L1_WR_ALLOC_R][MISS] = 0
  4034. Total_core_cache_stats_breakdown[L1_WR_ALLOC_R][RESERVATION_FAIL] = 0
  4035. Total_core_cache_stats_breakdown[L1_WR_ALLOC_R][SECTOR_MISS] = 0
  4036. Total_core_cache_stats_breakdown[L1_WR_ALLOC_R][MSHR_HIT] = 0
  4037. Total_core_cache_stats_breakdown[L2_WR_ALLOC_R][HIT] = 0
  4038. Total_core_cache_stats_breakdown[L2_WR_ALLOC_R][HIT_RESERVED] = 0
  4039. Total_core_cache_stats_breakdown[L2_WR_ALLOC_R][MISS] = 0
  4040. Total_core_cache_stats_breakdown[L2_WR_ALLOC_R][RESERVATION_FAIL] = 0
  4041. Total_core_cache_stats_breakdown[L2_WR_ALLOC_R][SECTOR_MISS] = 0
  4042. Total_core_cache_stats_breakdown[L2_WR_ALLOC_R][MSHR_HIT] = 0
  4043. Total_core_cache_stats_breakdown[GLOBAL_ACC_W][TOTAL_ACCESS] = 2
  4044.  
  4045. Total_core_cache_fail_stats:
  4046. ctas_completed 544, Shader 0 warp_id issue ditsribution:
  4047. warp_id:
  4048. 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47,
  4049. distro:
  4050. 277, 159, 159, 159, 188, 174, 174, 174, 240, 96, 96, 96, 96, 96, 96, 96, 240, 96, 96, 96, 96, 96, 96, 96, 240, 96, 96, 96, 96, 96, 96, 96, 240, 96, 96, 96, 96, 96, 96, 96, 160, 64, 64, 64, 64, 64, 64, 64,
  4051. gpgpu_n_tot_thrd_icount = 5329152
  4052. gpgpu_n_tot_w_icount = 166536
  4053. gpgpu_n_stall_shd_mem = 0
  4054. gpgpu_n_mem_read_local = 0
  4055. gpgpu_n_mem_write_local = 0
  4056. gpgpu_n_mem_read_global = 546
  4057. gpgpu_n_mem_write_global = 2
  4058. gpgpu_n_mem_texture = 0
  4059. gpgpu_n_mem_const = 0
  4060. gpgpu_n_load_insn = 546
  4061. gpgpu_n_store_insn = 2
  4062. gpgpu_n_shmem_insn = 424700
  4063. gpgpu_n_sstarr_insn = 0
  4064. gpgpu_n_tex_insn = 0
  4065. gpgpu_n_const_mem_insn = 0
  4066. gpgpu_n_param_mem_insn = 0
  4067. gpgpu_n_shmem_bkconflict = 0
  4068. gpgpu_n_cache_bkconflict = 0
  4069. gpgpu_n_intrawarp_mshr_merge = 0
  4070. gpgpu_n_cmem_portconflict = 0
  4071. gpgpu_stall_shd_mem[c_mem][resource_stall] = 0
  4072. gpgpu_stall_shd_mem[s_mem][bk_conf] = 0
  4073. gpgpu_stall_shd_mem[gl_mem][resource_stall] = 0
  4074. gpgpu_stall_shd_mem[gl_mem][coal_stall] = 0
  4075. gpgpu_stall_shd_mem[gl_mem][data_port_stall] = 0
  4076. gpu_reg_bank_conflict_stalls = 0
  4077. Warp Occupancy Distribution:
  4078. Stall:91835 W0_Idle:89655 W0_Scoreboard:65779 W1:20697 W2:0 W3:0 W4:0 W5:34 W6:0 W7:0 W8:0 W9:0 W10:0 W11:0 W12:0 W13:0 W14:0 W15:0 W16:0 W17:0 W18:0 W19:0 W20:0 W21:0 W22:0 W23:0 W24:0 W25:0 W26:0 W27:15 W28:0 W29:0 W30:0 W31:2178 W32:124943
  4079. single_issue_nums: WS0:61281 WS1:35085 WS2:35085 WS3:35085
  4080. dual_issue_nums: WS0:0 WS1:0 WS2:0 WS3:0
  4081. traffic_breakdown_coretomem[GLOBAL_ACC_R] = 4360 {8:545,}
  4082. traffic_breakdown_coretomem[GLOBAL_ACC_W] = 80 {40:2,}
  4083. traffic_breakdown_coretomem[GLOBAL_ATOMIC] = 40 {40:1,}
  4084. traffic_breakdown_memtocore[GLOBAL_ACC_R] = 21800 {40:545,}
  4085. traffic_breakdown_memtocore[GLOBAL_ACC_W] = 16 {8:2,}
  4086. traffic_breakdown_memtocore[GLOBAL_ATOMIC] = 40 {40:1,}
  4087. maxmflatency = 549
  4088. max_icnt2mem_latency = 85
  4089. maxmrqlatency = 0
  4090. max_icnt2sh_latency = 7
  4091. averagemflatency = 314
  4092. avg_icnt2mem_latency = 23
  4093. avg_mrq_latency = 0
  4094. avg_icnt2sh_latency = 7
  4095. mrq_lat_table:2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4096. dq_lat_table:0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4097. mf_lat_table:0 0 0 0 0 0 0 355 65 128 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4098. icnt2mem_lat_table:0 0 267 104 14 95 68 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4099. icnt2sh_lat_table:0 0 548 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4100. mf_lat_pw_table:0 0 0 0 0 0 0 5 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4101. maximum concurrent accesses to same row:
  4102. dram[0]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4103. dram[1]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4104. dram[2]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4105. dram[3]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4106. dram[4]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4107. dram[5]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4108. dram[6]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4109. dram[7]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4110. dram[8]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4111. dram[9]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4112. dram[10]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4113. dram[11]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4114. dram[12]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4115. dram[13]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4116. dram[14]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4117. dram[15]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4118. maximum service time to same row:
  4119. dram[0]: 7506 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4120. dram[1]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4121. dram[2]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4122. dram[3]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4123. dram[4]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4124. dram[5]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4125. dram[6]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4126. dram[7]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4127. dram[8]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4128. dram[9]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4129. dram[10]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4130. dram[11]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4131. dram[12]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4132. dram[13]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4133. dram[14]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4134. dram[15]: 5680 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4135. average row accesses per activate:
  4136. dram[0]: 1.000000 -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan
  4137. dram[1]: -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan
  4138. dram[2]: -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan
  4139. dram[3]: -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan
  4140. dram[4]: -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan
  4141. dram[5]: -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan
  4142. dram[6]: -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan
  4143. dram[7]: -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan
  4144. dram[8]: -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan
  4145. dram[9]: -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan
  4146. dram[10]: -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan
  4147. dram[11]: -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan
  4148. dram[12]: -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan
  4149. dram[13]: -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan
  4150. dram[14]: -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan
  4151. dram[15]: 1.000000 -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan -nan
  4152. average row locality = 2/2 = 1.000000
  4153. number of total memory accesses made:
  4154. dram[0]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4155. dram[1]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4156. dram[2]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4157. dram[3]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4158. dram[4]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4159. dram[5]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4160. dram[6]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4161. dram[7]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4162. dram[8]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4163. dram[9]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4164. dram[10]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4165. dram[11]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4166. dram[12]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4167. dram[13]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4168. dram[14]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4169. dram[15]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4170. total accesses: 0
  4171. min_bank_accesses = 0!
  4172. min_chip_accesses = 0!
  4173. number of total read accesses:
  4174. dram[0]: 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4175. dram[1]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4176. dram[2]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4177. dram[3]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4178. dram[4]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4179. dram[5]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4180. dram[6]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4181. dram[7]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4182. dram[8]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4183. dram[9]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4184. dram[10]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4185. dram[11]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4186. dram[12]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4187. dram[13]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4188. dram[14]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4189. dram[15]: 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4190. total dram reads = 2
  4191. min_bank_accesses = 0!
  4192. min_chip_accesses = 0!
  4193. number of total write accesses:
  4194. dram[0]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4195. dram[1]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4196. dram[2]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4197. dram[3]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4198. dram[4]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4199. dram[5]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4200. dram[6]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4201. dram[7]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4202. dram[8]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4203. dram[9]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4204. dram[10]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4205. dram[11]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4206. dram[12]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4207. dram[13]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4208. dram[14]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4209. dram[15]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4210. total dram writes = 0
  4211. min_bank_accesses = 0!
  4212. min_chip_accesses = 0!
  4213. average mf latency per bank:
  4214. dram[0]: 476 none none none none none none none none none none none none none none none
  4215. dram[1]: none none none none none none none none none none none none none none none none
  4216. dram[2]: none none none none none none none none none none none none none none none none
  4217. dram[3]: none none none none none none none none none none none none none none none none
  4218. dram[4]: none none none none none none none none none none none none none none none none
  4219. dram[5]: none none none none none none none none none none none none none none none none
  4220. dram[6]: none none none none none none none none none none none none none none none none
  4221. dram[7]: none none none none none none none none none none none none none none none none
  4222. dram[8]: none none none none none none none none none none none none none none none none
  4223. dram[9]: none none none none none none none none none none none none none none none none
  4224. dram[10]: none none none none none none none none none none none none none none none none
  4225. dram[11]: none none none none none none none none none none none none none none none none
  4226. dram[12]: none none none none none none none none none none none none none none none none
  4227. dram[13]: none none none none none none none none none none none none none none none none
  4228. dram[14]: none none none none none none none none none none none none none none none none
  4229. dram[15]: 171358 none none none none none none none none none none none none none none none
  4230. maximum mf latency per bank:
  4231. dram[0]: 476 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4232. dram[1]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4233. dram[2]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4234. dram[3]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4235. dram[4]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4236. dram[5]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4237. dram[6]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4238. dram[7]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4239. dram[8]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4240. dram[9]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4241. dram[10]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4242. dram[11]: 243 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4243. dram[12]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4244. dram[13]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4245. dram[14]: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4246. dram[15]: 549 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  4247. Memory Partition 0:
  4248. Cache L2_bank_000:
  4249. MSHR contents
  4250.  
  4251. Cache L2_bank_001:
  4252. MSHR contents
  4253.  
  4254. In Dram Latency Queue (total = 0):
  4255. DRAM[0]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
  4256. n_cmd=27049 n_nop=27047 n_act=1 n_pre=0 n_ref_event=0 n_req=1 n_rd=1 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0.0001479
  4257. n_activity=78 dram_eff=0.05128
  4258. bk0: 1a 27025i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i
  4259.  
  4260. ------------------------------------------------------------------------
  4261.  
  4262. Row_Buffer_Locality = 0.000000
  4263. Row_Buffer_Locality_read = 0.000000
  4264. Row_Buffer_Locality_write = -nan
  4265. Bank_Level_Parallism = 1.000000
  4266. Bank_Level_Parallism_Col = 1.000000
  4267. Bank_Level_Parallism_Ready = 1.000000
  4268. write_to_read_ratio_blp_rw_average = 0.000000
  4269. GrpLevelPara = 1.000000
  4270.  
  4271. BW Util details:
  4272. bwutil = 0.000148
  4273. total_CMD = 27049
  4274. util_bw = 4
  4275. Wasted_Col = 24
  4276. Wasted_Row = 0
  4277. Idle = 27021
  4278.  
  4279. BW Util Bottlenecks:
  4280. RCDc_limit = 24
  4281. RCDWRc_limit = 0
  4282. WTRc_limit = 0
  4283. RTWc_limit = 0
  4284. CCDLc_limit = 0
  4285. rwq = 0
  4286. CCDLc_limit_alone = 0
  4287. WTRc_limit_alone = 0
  4288. RTWc_limit_alone = 0
  4289.  
  4290. Commands details:
  4291. total_CMD = 27049
  4292. n_nop = 27047
  4293. Read = 1
  4294. Write = 0
  4295. L2_Alloc = 0
  4296. L2_WB = 0
  4297. n_act = 1
  4298. n_pre = 0
  4299. n_ref = 0
  4300. n_req = 1
  4301. total_req = 1
  4302.  
  4303. Dual Bus Interface Util:
  4304. issued_total_row = 1
  4305. issued_total_col = 1
  4306. Row_Bus_Util = 0.000037
  4307. CoL_Bus_Util = 0.000037
  4308. Either_Row_CoL_Bus_Util = 0.000074
  4309. Issued_on_Two_Bus_Simul_Util = 0.000000
  4310. issued_two_Eff = 0.000000
  4311. queue_avg = 0.000000
  4312.  
  4313.  
  4314. dram_util_bins: 0 0 0 0 0 0 0 0 0 0
  4315. dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
  4316. mrqq: max=0 avg=0
  4317. Memory Partition 1:
  4318. Cache L2_bank_002:
  4319. MSHR contents
  4320.  
  4321. Cache L2_bank_003:
  4322. MSHR contents
  4323.  
  4324. In Dram Latency Queue (total = 0):
  4325. DRAM[1]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
  4326. n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
  4327. n_activity=0 dram_eff=-nan
  4328. bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i
  4329.  
  4330. ------------------------------------------------------------------------
  4331.  
  4332. Row_Buffer_Locality = -nan
  4333. Row_Buffer_Locality_read = -nan
  4334. Row_Buffer_Locality_write = -nan
  4335. Bank_Level_Parallism = -nan
  4336. Bank_Level_Parallism_Col = -nan
  4337. Bank_Level_Parallism_Ready = -nan
  4338. write_to_read_ratio_blp_rw_average = -nan
  4339. GrpLevelPara = -nan
  4340.  
  4341. BW Util details:
  4342. bwutil = 0.000000
  4343. total_CMD = 27049
  4344. util_bw = 0
  4345. Wasted_Col = 0
  4346. Wasted_Row = 0
  4347. Idle = 27049
  4348.  
  4349. BW Util Bottlenecks:
  4350. RCDc_limit = 0
  4351. RCDWRc_limit = 0
  4352. WTRc_limit = 0
  4353. RTWc_limit = 0
  4354. CCDLc_limit = 0
  4355. rwq = 0
  4356. CCDLc_limit_alone = 0
  4357. WTRc_limit_alone = 0
  4358. RTWc_limit_alone = 0
  4359.  
  4360. Commands details:
  4361. total_CMD = 27049
  4362. n_nop = 27049
  4363. Read = 0
  4364. Write = 0
  4365. L2_Alloc = 0
  4366. L2_WB = 0
  4367. n_act = 0
  4368. n_pre = 0
  4369. n_ref = 0
  4370. n_req = 0
  4371. total_req = 0
  4372.  
  4373. Dual Bus Interface Util:
  4374. issued_total_row = 0
  4375. issued_total_col = 0
  4376. Row_Bus_Util = 0.000000
  4377. CoL_Bus_Util = 0.000000
  4378. Either_Row_CoL_Bus_Util = 0.000000
  4379. Issued_on_Two_Bus_Simul_Util = 0.000000
  4380. issued_two_Eff = -nan
  4381. queue_avg = 0.000000
  4382.  
  4383.  
  4384. dram_util_bins: 0 0 0 0 0 0 0 0 0 0
  4385. dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
  4386. mrqq: max=0 avg=0
  4387. Memory Partition 2:
  4388. Cache L2_bank_004:
  4389. MSHR contents
  4390.  
  4391. Cache L2_bank_005:
  4392. MSHR contents
  4393.  
  4394. In Dram Latency Queue (total = 0):
  4395. DRAM[2]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
  4396. n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
  4397. n_activity=0 dram_eff=-nan
  4398. bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i
  4399.  
  4400. ------------------------------------------------------------------------
  4401.  
  4402. Row_Buffer_Locality = -nan
  4403. Row_Buffer_Locality_read = -nan
  4404. Row_Buffer_Locality_write = -nan
  4405. Bank_Level_Parallism = -nan
  4406. Bank_Level_Parallism_Col = -nan
  4407. Bank_Level_Parallism_Ready = -nan
  4408. write_to_read_ratio_blp_rw_average = -nan
  4409. GrpLevelPara = -nan
  4410.  
  4411. BW Util details:
  4412. bwutil = 0.000000
  4413. total_CMD = 27049
  4414. util_bw = 0
  4415. Wasted_Col = 0
  4416. Wasted_Row = 0
  4417. Idle = 27049
  4418.  
  4419. BW Util Bottlenecks:
  4420. RCDc_limit = 0
  4421. RCDWRc_limit = 0
  4422. WTRc_limit = 0
  4423. RTWc_limit = 0
  4424. CCDLc_limit = 0
  4425. rwq = 0
  4426. CCDLc_limit_alone = 0
  4427. WTRc_limit_alone = 0
  4428. RTWc_limit_alone = 0
  4429.  
  4430. Commands details:
  4431. total_CMD = 27049
  4432. n_nop = 27049
  4433. Read = 0
  4434. Write = 0
  4435. L2_Alloc = 0
  4436. L2_WB = 0
  4437. n_act = 0
  4438. n_pre = 0
  4439. n_ref = 0
  4440. n_req = 0
  4441. total_req = 0
  4442.  
  4443. Dual Bus Interface Util:
  4444. issued_total_row = 0
  4445. issued_total_col = 0
  4446. Row_Bus_Util = 0.000000
  4447. CoL_Bus_Util = 0.000000
  4448. Either_Row_CoL_Bus_Util = 0.000000
  4449. Issued_on_Two_Bus_Simul_Util = 0.000000
  4450. issued_two_Eff = -nan
  4451. queue_avg = 0.000000
  4452.  
  4453.  
  4454. dram_util_bins: 0 0 0 0 0 0 0 0 0 0
  4455. dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
  4456. mrqq: max=0 avg=0
  4457. Memory Partition 3:
  4458. Cache L2_bank_006:
  4459. MSHR contents
  4460.  
  4461. Cache L2_bank_007:
  4462. MSHR contents
  4463.  
  4464. In Dram Latency Queue (total = 0):
  4465. DRAM[3]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
  4466. n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
  4467. n_activity=0 dram_eff=-nan
  4468. bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i
  4469.  
  4470. ------------------------------------------------------------------------
  4471.  
  4472. Row_Buffer_Locality = -nan
  4473. Row_Buffer_Locality_read = -nan
  4474. Row_Buffer_Locality_write = -nan
  4475. Bank_Level_Parallism = -nan
  4476. Bank_Level_Parallism_Col = -nan
  4477. Bank_Level_Parallism_Ready = -nan
  4478. write_to_read_ratio_blp_rw_average = -nan
  4479. GrpLevelPara = -nan
  4480.  
  4481. BW Util details:
  4482. bwutil = 0.000000
  4483. total_CMD = 27049
  4484. util_bw = 0
  4485. Wasted_Col = 0
  4486. Wasted_Row = 0
  4487. Idle = 27049
  4488.  
  4489. BW Util Bottlenecks:
  4490. RCDc_limit = 0
  4491. RCDWRc_limit = 0
  4492. WTRc_limit = 0
  4493. RTWc_limit = 0
  4494. CCDLc_limit = 0
  4495. rwq = 0
  4496. CCDLc_limit_alone = 0
  4497. WTRc_limit_alone = 0
  4498. RTWc_limit_alone = 0
  4499.  
  4500. Commands details:
  4501. total_CMD = 27049
  4502. n_nop = 27049
  4503. Read = 0
  4504. Write = 0
  4505. L2_Alloc = 0
  4506. L2_WB = 0
  4507. n_act = 0
  4508. n_pre = 0
  4509. n_ref = 0
  4510. n_req = 0
  4511. total_req = 0
  4512.  
  4513. Dual Bus Interface Util:
  4514. issued_total_row = 0
  4515. issued_total_col = 0
  4516. Row_Bus_Util = 0.000000
  4517. CoL_Bus_Util = 0.000000
  4518. Either_Row_CoL_Bus_Util = 0.000000
  4519. Issued_on_Two_Bus_Simul_Util = 0.000000
  4520. issued_two_Eff = -nan
  4521. queue_avg = 0.000000
  4522.  
  4523.  
  4524. dram_util_bins: 0 0 0 0 0 0 0 0 0 0
  4525. dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
  4526. mrqq: max=0 avg=0
  4527. Memory Partition 4:
  4528. Cache L2_bank_008:
  4529. MSHR contents
  4530.  
  4531. Cache L2_bank_009:
  4532. MSHR contents
  4533.  
  4534. In Dram Latency Queue (total = 0):
  4535. DRAM[4]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
  4536. n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
  4537. n_activity=0 dram_eff=-nan
  4538. bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i
  4539.  
  4540. ------------------------------------------------------------------------
  4541.  
  4542. Row_Buffer_Locality = -nan
  4543. Row_Buffer_Locality_read = -nan
  4544. Row_Buffer_Locality_write = -nan
  4545. Bank_Level_Parallism = -nan
  4546. Bank_Level_Parallism_Col = -nan
  4547. Bank_Level_Parallism_Ready = -nan
  4548. write_to_read_ratio_blp_rw_average = -nan
  4549. GrpLevelPara = -nan
  4550.  
  4551. BW Util details:
  4552. bwutil = 0.000000
  4553. total_CMD = 27049
  4554. util_bw = 0
  4555. Wasted_Col = 0
  4556. Wasted_Row = 0
  4557. Idle = 27049
  4558.  
  4559. BW Util Bottlenecks:
  4560. RCDc_limit = 0
  4561. RCDWRc_limit = 0
  4562. WTRc_limit = 0
  4563. RTWc_limit = 0
  4564. CCDLc_limit = 0
  4565. rwq = 0
  4566. CCDLc_limit_alone = 0
  4567. WTRc_limit_alone = 0
  4568. RTWc_limit_alone = 0
  4569.  
  4570. Commands details:
  4571. total_CMD = 27049
  4572. n_nop = 27049
  4573. Read = 0
  4574. Write = 0
  4575. L2_Alloc = 0
  4576. L2_WB = 0
  4577. n_act = 0
  4578. n_pre = 0
  4579. n_ref = 0
  4580. n_req = 0
  4581. total_req = 0
  4582.  
  4583. Dual Bus Interface Util:
  4584. issued_total_row = 0
  4585. issued_total_col = 0
  4586. Row_Bus_Util = 0.000000
  4587. CoL_Bus_Util = 0.000000
  4588. Either_Row_CoL_Bus_Util = 0.000000
  4589. Issued_on_Two_Bus_Simul_Util = 0.000000
  4590. issued_two_Eff = -nan
  4591. queue_avg = 0.000000
  4592.  
  4593.  
  4594. dram_util_bins: 0 0 0 0 0 0 0 0 0 0
  4595. dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
  4596. mrqq: max=0 avg=0
  4597. Memory Partition 5:
  4598. Cache L2_bank_010:
  4599. MSHR contents
  4600.  
  4601. Cache L2_bank_011:
  4602. MSHR contents
  4603.  
  4604. In Dram Latency Queue (total = 0):
  4605. DRAM[5]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
  4606. n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
  4607. n_activity=0 dram_eff=-nan
  4608. bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i
  4609.  
  4610. ------------------------------------------------------------------------
  4611.  
  4612. Row_Buffer_Locality = -nan
  4613. Row_Buffer_Locality_read = -nan
  4614. Row_Buffer_Locality_write = -nan
  4615. Bank_Level_Parallism = -nan
  4616. Bank_Level_Parallism_Col = -nan
  4617. Bank_Level_Parallism_Ready = -nan
  4618. write_to_read_ratio_blp_rw_average = -nan
  4619. GrpLevelPara = -nan
  4620.  
  4621. BW Util details:
  4622. bwutil = 0.000000
  4623. total_CMD = 27049
  4624. util_bw = 0
  4625. Wasted_Col = 0
  4626. Wasted_Row = 0
  4627. Idle = 27049
  4628.  
  4629. BW Util Bottlenecks:
  4630. RCDc_limit = 0
  4631. RCDWRc_limit = 0
  4632. WTRc_limit = 0
  4633. RTWc_limit = 0
  4634. CCDLc_limit = 0
  4635. rwq = 0
  4636. CCDLc_limit_alone = 0
  4637. WTRc_limit_alone = 0
  4638. RTWc_limit_alone = 0
  4639.  
  4640. Commands details:
  4641. total_CMD = 27049
  4642. n_nop = 27049
  4643. Read = 0
  4644. Write = 0
  4645. L2_Alloc = 0
  4646. L2_WB = 0
  4647. n_act = 0
  4648. n_pre = 0
  4649. n_ref = 0
  4650. n_req = 0
  4651. total_req = 0
  4652.  
  4653. Dual Bus Interface Util:
  4654. issued_total_row = 0
  4655. issued_total_col = 0
  4656. Row_Bus_Util = 0.000000
  4657. CoL_Bus_Util = 0.000000
  4658. Either_Row_CoL_Bus_Util = 0.000000
  4659. Issued_on_Two_Bus_Simul_Util = 0.000000
  4660. issued_two_Eff = -nan
  4661. queue_avg = 0.000000
  4662.  
  4663.  
  4664. dram_util_bins: 0 0 0 0 0 0 0 0 0 0
  4665. dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
  4666. mrqq: max=0 avg=0
  4667. Memory Partition 6:
  4668. Cache L2_bank_012:
  4669. MSHR contents
  4670.  
  4671. Cache L2_bank_013:
  4672. MSHR contents
  4673.  
  4674. In Dram Latency Queue (total = 0):
  4675. DRAM[6]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
  4676. n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
  4677. n_activity=0 dram_eff=-nan
  4678. bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i
  4679.  
  4680. ------------------------------------------------------------------------
  4681.  
  4682. Row_Buffer_Locality = -nan
  4683. Row_Buffer_Locality_read = -nan
  4684. Row_Buffer_Locality_write = -nan
  4685. Bank_Level_Parallism = -nan
  4686. Bank_Level_Parallism_Col = -nan
  4687. Bank_Level_Parallism_Ready = -nan
  4688. write_to_read_ratio_blp_rw_average = -nan
  4689. GrpLevelPara = -nan
  4690.  
  4691. BW Util details:
  4692. bwutil = 0.000000
  4693. total_CMD = 27049
  4694. util_bw = 0
  4695. Wasted_Col = 0
  4696. Wasted_Row = 0
  4697. Idle = 27049
  4698.  
  4699. BW Util Bottlenecks:
  4700. RCDc_limit = 0
  4701. RCDWRc_limit = 0
  4702. WTRc_limit = 0
  4703. RTWc_limit = 0
  4704. CCDLc_limit = 0
  4705. rwq = 0
  4706. CCDLc_limit_alone = 0
  4707. WTRc_limit_alone = 0
  4708. RTWc_limit_alone = 0
  4709.  
  4710. Commands details:
  4711. total_CMD = 27049
  4712. n_nop = 27049
  4713. Read = 0
  4714. Write = 0
  4715. L2_Alloc = 0
  4716. L2_WB = 0
  4717. n_act = 0
  4718. n_pre = 0
  4719. n_ref = 0
  4720. n_req = 0
  4721. total_req = 0
  4722.  
  4723. Dual Bus Interface Util:
  4724. issued_total_row = 0
  4725. issued_total_col = 0
  4726. Row_Bus_Util = 0.000000
  4727. CoL_Bus_Util = 0.000000
  4728. Either_Row_CoL_Bus_Util = 0.000000
  4729. Issued_on_Two_Bus_Simul_Util = 0.000000
  4730. issued_two_Eff = -nan
  4731. queue_avg = 0.000000
  4732.  
  4733.  
  4734. dram_util_bins: 0 0 0 0 0 0 0 0 0 0
  4735. dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
  4736. mrqq: max=0 avg=0
  4737. Memory Partition 7:
  4738. Cache L2_bank_014:
  4739. MSHR contents
  4740.  
  4741. Cache L2_bank_015:
  4742. MSHR contents
  4743.  
  4744. In Dram Latency Queue (total = 0):
  4745. DRAM[7]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
  4746. n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
  4747. n_activity=0 dram_eff=-nan
  4748. bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i
  4749.  
  4750. ------------------------------------------------------------------------
  4751.  
  4752. Row_Buffer_Locality = -nan
  4753. Row_Buffer_Locality_read = -nan
  4754. Row_Buffer_Locality_write = -nan
  4755. Bank_Level_Parallism = -nan
  4756. Bank_Level_Parallism_Col = -nan
  4757. Bank_Level_Parallism_Ready = -nan
  4758. write_to_read_ratio_blp_rw_average = -nan
  4759. GrpLevelPara = -nan
  4760.  
  4761. BW Util details:
  4762. bwutil = 0.000000
  4763. total_CMD = 27049
  4764. util_bw = 0
  4765. Wasted_Col = 0
  4766. Wasted_Row = 0
  4767. Idle = 27049
  4768.  
  4769. BW Util Bottlenecks:
  4770. RCDc_limit = 0
  4771. RCDWRc_limit = 0
  4772. WTRc_limit = 0
  4773. RTWc_limit = 0
  4774. CCDLc_limit = 0
  4775. rwq = 0
  4776. CCDLc_limit_alone = 0
  4777. WTRc_limit_alone = 0
  4778. RTWc_limit_alone = 0
  4779.  
  4780. Commands details:
  4781. total_CMD = 27049
  4782. n_nop = 27049
  4783. Read = 0
  4784. Write = 0
  4785. L2_Alloc = 0
  4786. L2_WB = 0
  4787. n_act = 0
  4788. n_pre = 0
  4789. n_ref = 0
  4790. n_req = 0
  4791. total_req = 0
  4792.  
  4793. Dual Bus Interface Util:
  4794. issued_total_row = 0
  4795. issued_total_col = 0
  4796. Row_Bus_Util = 0.000000
  4797. CoL_Bus_Util = 0.000000
  4798. Either_Row_CoL_Bus_Util = 0.000000
  4799. Issued_on_Two_Bus_Simul_Util = 0.000000
  4800. issued_two_Eff = -nan
  4801. queue_avg = 0.000000
  4802.  
  4803.  
  4804. dram_util_bins: 0 0 0 0 0 0 0 0 0 0
  4805. dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
  4806. mrqq: max=0 avg=0
  4807. Memory Partition 8:
  4808. Cache L2_bank_016:
  4809. MSHR contents
  4810.  
  4811. Cache L2_bank_017:
  4812. MSHR contents
  4813.  
  4814. In Dram Latency Queue (total = 0):
  4815. DRAM[8]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
  4816. n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
  4817. n_activity=0 dram_eff=-nan
  4818. bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i
  4819.  
  4820. ------------------------------------------------------------------------
  4821.  
  4822. Row_Buffer_Locality = -nan
  4823. Row_Buffer_Locality_read = -nan
  4824. Row_Buffer_Locality_write = -nan
  4825. Bank_Level_Parallism = -nan
  4826. Bank_Level_Parallism_Col = -nan
  4827. Bank_Level_Parallism_Ready = -nan
  4828. write_to_read_ratio_blp_rw_average = -nan
  4829. GrpLevelPara = -nan
  4830.  
  4831. BW Util details:
  4832. bwutil = 0.000000
  4833. total_CMD = 27049
  4834. util_bw = 0
  4835. Wasted_Col = 0
  4836. Wasted_Row = 0
  4837. Idle = 27049
  4838.  
  4839. BW Util Bottlenecks:
  4840. RCDc_limit = 0
  4841. RCDWRc_limit = 0
  4842. WTRc_limit = 0
  4843. RTWc_limit = 0
  4844. CCDLc_limit = 0
  4845. rwq = 0
  4846. CCDLc_limit_alone = 0
  4847. WTRc_limit_alone = 0
  4848. RTWc_limit_alone = 0
  4849.  
  4850. Commands details:
  4851. total_CMD = 27049
  4852. n_nop = 27049
  4853. Read = 0
  4854. Write = 0
  4855. L2_Alloc = 0
  4856. L2_WB = 0
  4857. n_act = 0
  4858. n_pre = 0
  4859. n_ref = 0
  4860. n_req = 0
  4861. total_req = 0
  4862.  
  4863. Dual Bus Interface Util:
  4864. issued_total_row = 0
  4865. issued_total_col = 0
  4866. Row_Bus_Util = 0.000000
  4867. CoL_Bus_Util = 0.000000
  4868. Either_Row_CoL_Bus_Util = 0.000000
  4869. Issued_on_Two_Bus_Simul_Util = 0.000000
  4870. issued_two_Eff = -nan
  4871. queue_avg = 0.000000
  4872.  
  4873.  
  4874. dram_util_bins: 0 0 0 0 0 0 0 0 0 0
  4875. dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
  4876. mrqq: max=0 avg=0
  4877. Memory Partition 9:
  4878. Cache L2_bank_018:
  4879. MSHR contents
  4880.  
  4881. Cache L2_bank_019:
  4882. MSHR contents
  4883.  
  4884. In Dram Latency Queue (total = 0):
  4885. DRAM[9]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
  4886. n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
  4887. n_activity=0 dram_eff=-nan
  4888. bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i
  4889.  
  4890. ------------------------------------------------------------------------
  4891.  
  4892. Row_Buffer_Locality = -nan
  4893. Row_Buffer_Locality_read = -nan
  4894. Row_Buffer_Locality_write = -nan
  4895. Bank_Level_Parallism = -nan
  4896. Bank_Level_Parallism_Col = -nan
  4897. Bank_Level_Parallism_Ready = -nan
  4898. write_to_read_ratio_blp_rw_average = -nan
  4899. GrpLevelPara = -nan
  4900.  
  4901. BW Util details:
  4902. bwutil = 0.000000
  4903. total_CMD = 27049
  4904. util_bw = 0
  4905. Wasted_Col = 0
  4906. Wasted_Row = 0
  4907. Idle = 27049
  4908.  
  4909. BW Util Bottlenecks:
  4910. RCDc_limit = 0
  4911. RCDWRc_limit = 0
  4912. WTRc_limit = 0
  4913. RTWc_limit = 0
  4914. CCDLc_limit = 0
  4915. rwq = 0
  4916. CCDLc_limit_alone = 0
  4917. WTRc_limit_alone = 0
  4918. RTWc_limit_alone = 0
  4919.  
  4920. Commands details:
  4921. total_CMD = 27049
  4922. n_nop = 27049
  4923. Read = 0
  4924. Write = 0
  4925. L2_Alloc = 0
  4926. L2_WB = 0
  4927. n_act = 0
  4928. n_pre = 0
  4929. n_ref = 0
  4930. n_req = 0
  4931. total_req = 0
  4932.  
  4933. Dual Bus Interface Util:
  4934. issued_total_row = 0
  4935. issued_total_col = 0
  4936. Row_Bus_Util = 0.000000
  4937. CoL_Bus_Util = 0.000000
  4938. Either_Row_CoL_Bus_Util = 0.000000
  4939. Issued_on_Two_Bus_Simul_Util = 0.000000
  4940. issued_two_Eff = -nan
  4941. queue_avg = 0.000000
  4942.  
  4943.  
  4944. dram_util_bins: 0 0 0 0 0 0 0 0 0 0
  4945. dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
  4946. mrqq: max=0 avg=0
  4947. Memory Partition 10:
  4948. Cache L2_bank_020:
  4949. MSHR contents
  4950.  
  4951. Cache L2_bank_021:
  4952. MSHR contents
  4953.  
  4954. In Dram Latency Queue (total = 0):
  4955. DRAM[10]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
  4956. n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
  4957. n_activity=0 dram_eff=-nan
  4958. bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i
  4959.  
  4960. ------------------------------------------------------------------------
  4961.  
  4962. Row_Buffer_Locality = -nan
  4963. Row_Buffer_Locality_read = -nan
  4964. Row_Buffer_Locality_write = -nan
  4965. Bank_Level_Parallism = -nan
  4966. Bank_Level_Parallism_Col = -nan
  4967. Bank_Level_Parallism_Ready = -nan
  4968. write_to_read_ratio_blp_rw_average = -nan
  4969. GrpLevelPara = -nan
  4970.  
  4971. BW Util details:
  4972. bwutil = 0.000000
  4973. total_CMD = 27049
  4974. util_bw = 0
  4975. Wasted_Col = 0
  4976. Wasted_Row = 0
  4977. Idle = 27049
  4978.  
  4979. BW Util Bottlenecks:
  4980. RCDc_limit = 0
  4981. RCDWRc_limit = 0
  4982. WTRc_limit = 0
  4983. RTWc_limit = 0
  4984. CCDLc_limit = 0
  4985. rwq = 0
  4986. CCDLc_limit_alone = 0
  4987. WTRc_limit_alone = 0
  4988. RTWc_limit_alone = 0
  4989.  
  4990. Commands details:
  4991. total_CMD = 27049
  4992. n_nop = 27049
  4993. Read = 0
  4994. Write = 0
  4995. L2_Alloc = 0
  4996. L2_WB = 0
  4997. n_act = 0
  4998. n_pre = 0
  4999. n_ref = 0
  5000. n_req = 0
  5001. total_req = 0
  5002.  
  5003. Dual Bus Interface Util:
  5004. issued_total_row = 0
  5005. issued_total_col = 0
  5006. Row_Bus_Util = 0.000000
  5007. CoL_Bus_Util = 0.000000
  5008. Either_Row_CoL_Bus_Util = 0.000000
  5009. Issued_on_Two_Bus_Simul_Util = 0.000000
  5010. issued_two_Eff = -nan
  5011. queue_avg = 0.000000
  5012.  
  5013.  
  5014. dram_util_bins: 0 0 0 0 0 0 0 0 0 0
  5015. dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
  5016. mrqq: max=0 avg=0
  5017. Memory Partition 11:
  5018. Cache L2_bank_022:
  5019. MSHR contents
  5020.  
  5021. Cache L2_bank_023:
  5022. MSHR contents
  5023.  
  5024. In Dram Latency Queue (total = 0):
  5025. DRAM[11]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
  5026. n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
  5027. n_activity=0 dram_eff=-nan
  5028. bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i
  5029.  
  5030. ------------------------------------------------------------------------
  5031.  
  5032. Row_Buffer_Locality = -nan
  5033. Row_Buffer_Locality_read = -nan
  5034. Row_Buffer_Locality_write = -nan
  5035. Bank_Level_Parallism = -nan
  5036. Bank_Level_Parallism_Col = -nan
  5037. Bank_Level_Parallism_Ready = -nan
  5038. write_to_read_ratio_blp_rw_average = -nan
  5039. GrpLevelPara = -nan
  5040.  
  5041. BW Util details:
  5042. bwutil = 0.000000
  5043. total_CMD = 27049
  5044. util_bw = 0
  5045. Wasted_Col = 0
  5046. Wasted_Row = 0
  5047. Idle = 27049
  5048.  
  5049. BW Util Bottlenecks:
  5050. RCDc_limit = 0
  5051. RCDWRc_limit = 0
  5052. WTRc_limit = 0
  5053. RTWc_limit = 0
  5054. CCDLc_limit = 0
  5055. rwq = 0
  5056. CCDLc_limit_alone = 0
  5057. WTRc_limit_alone = 0
  5058. RTWc_limit_alone = 0
  5059.  
  5060. Commands details:
  5061. total_CMD = 27049
  5062. n_nop = 27049
  5063. Read = 0
  5064. Write = 0
  5065. L2_Alloc = 0
  5066. L2_WB = 0
  5067. n_act = 0
  5068. n_pre = 0
  5069. n_ref = 0
  5070. n_req = 0
  5071. total_req = 0
  5072.  
  5073. Dual Bus Interface Util:
  5074. issued_total_row = 0
  5075. issued_total_col = 0
  5076. Row_Bus_Util = 0.000000
  5077. CoL_Bus_Util = 0.000000
  5078. Either_Row_CoL_Bus_Util = 0.000000
  5079. Issued_on_Two_Bus_Simul_Util = 0.000000
  5080. issued_two_Eff = -nan
  5081. queue_avg = 0.000000
  5082.  
  5083.  
  5084. dram_util_bins: 0 0 0 0 0 0 0 0 0 0
  5085. dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
  5086. mrqq: max=0 avg=0
  5087. Memory Partition 12:
  5088. Cache L2_bank_024:
  5089. MSHR contents
  5090.  
  5091. Cache L2_bank_025:
  5092. MSHR contents
  5093.  
  5094. In Dram Latency Queue (total = 0):
  5095. DRAM[12]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
  5096. n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
  5097. n_activity=0 dram_eff=-nan
  5098. bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i
  5099.  
  5100. ------------------------------------------------------------------------
  5101.  
  5102. Row_Buffer_Locality = -nan
  5103. Row_Buffer_Locality_read = -nan
  5104. Row_Buffer_Locality_write = -nan
  5105. Bank_Level_Parallism = -nan
  5106. Bank_Level_Parallism_Col = -nan
  5107. Bank_Level_Parallism_Ready = -nan
  5108. write_to_read_ratio_blp_rw_average = -nan
  5109. GrpLevelPara = -nan
  5110.  
  5111. BW Util details:
  5112. bwutil = 0.000000
  5113. total_CMD = 27049
  5114. util_bw = 0
  5115. Wasted_Col = 0
  5116. Wasted_Row = 0
  5117. Idle = 27049
  5118.  
  5119. BW Util Bottlenecks:
  5120. RCDc_limit = 0
  5121. RCDWRc_limit = 0
  5122. WTRc_limit = 0
  5123. RTWc_limit = 0
  5124. CCDLc_limit = 0
  5125. rwq = 0
  5126. CCDLc_limit_alone = 0
  5127. WTRc_limit_alone = 0
  5128. RTWc_limit_alone = 0
  5129.  
  5130. Commands details:
  5131. total_CMD = 27049
  5132. n_nop = 27049
  5133. Read = 0
  5134. Write = 0
  5135. L2_Alloc = 0
  5136. L2_WB = 0
  5137. n_act = 0
  5138. n_pre = 0
  5139. n_ref = 0
  5140. n_req = 0
  5141. total_req = 0
  5142.  
  5143. Dual Bus Interface Util:
  5144. issued_total_row = 0
  5145. issued_total_col = 0
  5146. Row_Bus_Util = 0.000000
  5147. CoL_Bus_Util = 0.000000
  5148. Either_Row_CoL_Bus_Util = 0.000000
  5149. Issued_on_Two_Bus_Simul_Util = 0.000000
  5150. issued_two_Eff = -nan
  5151. queue_avg = 0.000000
  5152.  
  5153.  
  5154. dram_util_bins: 0 0 0 0 0 0 0 0 0 0
  5155. dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
  5156. mrqq: max=0 avg=0
  5157. Memory Partition 13:
  5158. Cache L2_bank_026:
  5159. MSHR contents
  5160.  
  5161. Cache L2_bank_027:
  5162. MSHR contents
  5163.  
  5164. In Dram Latency Queue (total = 0):
  5165. DRAM[13]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
  5166. n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
  5167. n_activity=0 dram_eff=-nan
  5168. bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i
  5169.  
  5170. ------------------------------------------------------------------------
  5171.  
  5172. Row_Buffer_Locality = -nan
  5173. Row_Buffer_Locality_read = -nan
  5174. Row_Buffer_Locality_write = -nan
  5175. Bank_Level_Parallism = -nan
  5176. Bank_Level_Parallism_Col = -nan
  5177. Bank_Level_Parallism_Ready = -nan
  5178. write_to_read_ratio_blp_rw_average = -nan
  5179. GrpLevelPara = -nan
  5180.  
  5181. BW Util details:
  5182. bwutil = 0.000000
  5183. total_CMD = 27049
  5184. util_bw = 0
  5185. Wasted_Col = 0
  5186. Wasted_Row = 0
  5187. Idle = 27049
  5188.  
  5189. BW Util Bottlenecks:
  5190. RCDc_limit = 0
  5191. RCDWRc_limit = 0
  5192. WTRc_limit = 0
  5193. RTWc_limit = 0
  5194. CCDLc_limit = 0
  5195. rwq = 0
  5196. CCDLc_limit_alone = 0
  5197. WTRc_limit_alone = 0
  5198. RTWc_limit_alone = 0
  5199.  
  5200. Commands details:
  5201. total_CMD = 27049
  5202. n_nop = 27049
  5203. Read = 0
  5204. Write = 0
  5205. L2_Alloc = 0
  5206. L2_WB = 0
  5207. n_act = 0
  5208. n_pre = 0
  5209. n_ref = 0
  5210. n_req = 0
  5211. total_req = 0
  5212.  
  5213. Dual Bus Interface Util:
  5214. issued_total_row = 0
  5215. issued_total_col = 0
  5216. Row_Bus_Util = 0.000000
  5217. CoL_Bus_Util = 0.000000
  5218. Either_Row_CoL_Bus_Util = 0.000000
  5219. Issued_on_Two_Bus_Simul_Util = 0.000000
  5220. issued_two_Eff = -nan
  5221. queue_avg = 0.000000
  5222.  
  5223.  
  5224. dram_util_bins: 0 0 0 0 0 0 0 0 0 0
  5225. dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
  5226. mrqq: max=0 avg=0
  5227. Memory Partition 14:
  5228. Cache L2_bank_028:
  5229. MSHR contents
  5230.  
  5231. Cache L2_bank_029:
  5232. MSHR contents
  5233.  
  5234. In Dram Latency Queue (total = 0):
  5235. DRAM[14]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
  5236. n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
  5237. n_activity=0 dram_eff=-nan
  5238. bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i
  5239.  
  5240. ------------------------------------------------------------------------
  5241.  
  5242. Row_Buffer_Locality = -nan
  5243. Row_Buffer_Locality_read = -nan
  5244. Row_Buffer_Locality_write = -nan
  5245. Bank_Level_Parallism = -nan
  5246. Bank_Level_Parallism_Col = -nan
  5247. Bank_Level_Parallism_Ready = -nan
  5248. write_to_read_ratio_blp_rw_average = -nan
  5249. GrpLevelPara = -nan
  5250.  
  5251. BW Util details:
  5252. bwutil = 0.000000
  5253. total_CMD = 27049
  5254. util_bw = 0
  5255. Wasted_Col = 0
  5256. Wasted_Row = 0
  5257. Idle = 27049
  5258.  
  5259. BW Util Bottlenecks:
  5260. RCDc_limit = 0
  5261. RCDWRc_limit = 0
  5262. WTRc_limit = 0
  5263. RTWc_limit = 0
  5264. CCDLc_limit = 0
  5265. rwq = 0
  5266. CCDLc_limit_alone = 0
  5267. WTRc_limit_alone = 0
  5268. RTWc_limit_alone = 0
  5269.  
  5270. Commands details:
  5271. total_CMD = 27049
  5272. n_nop = 27049
  5273. Read = 0
  5274. Write = 0
  5275. L2_Alloc = 0
  5276. L2_WB = 0
  5277. n_act = 0
  5278. n_pre = 0
  5279. n_ref = 0
  5280. n_req = 0
  5281. total_req = 0
  5282.  
  5283. Dual Bus Interface Util:
  5284. issued_total_row = 0
  5285. issued_total_col = 0
  5286. Row_Bus_Util = 0.000000
  5287. CoL_Bus_Util = 0.000000
  5288. Either_Row_CoL_Bus_Util = 0.000000
  5289. Issued_on_Two_Bus_Simul_Util = 0.000000
  5290. issued_two_Eff = -nan
  5291. queue_avg = 0.000000
  5292.  
  5293.  
  5294. dram_util_bins: 0 0 0 0 0 0 0 0 0 0
  5295. dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
  5296. mrqq: max=0 avg=0
  5297. Memory Partition 15:
  5298. Cache L2_bank_030:
  5299. MSHR contents
  5300.  
  5301. Cache L2_bank_031:
  5302. MSHR contents
  5303.  
  5304. In Dram Latency Queue (total = 0):
  5305. DRAM[15]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
  5306. n_cmd=27049 n_nop=27047 n_act=1 n_pre=0 n_ref_event=0 n_req=1 n_rd=1 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0.0001479
  5307. n_activity=78 dram_eff=0.05128
  5308. bk0: 1a 27025i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i
  5309.  
  5310. ------------------------------------------------------------------------
  5311.  
  5312. Row_Buffer_Locality = 0.000000
  5313. Row_Buffer_Locality_read = 0.000000
  5314. Row_Buffer_Locality_write = -nan
  5315. Bank_Level_Parallism = 1.000000
  5316. Bank_Level_Parallism_Col = 1.000000
  5317. Bank_Level_Parallism_Ready = 1.000000
  5318. write_to_read_ratio_blp_rw_average = 0.000000
  5319. GrpLevelPara = 1.000000
  5320.  
  5321. BW Util details:
  5322. bwutil = 0.000148
  5323. total_CMD = 27049
  5324. util_bw = 4
  5325. Wasted_Col = 24
  5326. Wasted_Row = 0
  5327. Idle = 27021
  5328.  
  5329. BW Util Bottlenecks:
  5330. RCDc_limit = 24
  5331. RCDWRc_limit = 0
  5332. WTRc_limit = 0
  5333. RTWc_limit = 0
  5334. CCDLc_limit = 0
  5335. rwq = 0
  5336. CCDLc_limit_alone = 0
  5337. WTRc_limit_alone = 0
  5338. RTWc_limit_alone = 0
  5339.  
  5340. Commands details:
  5341. total_CMD = 27049
  5342. n_nop = 27047
  5343. Read = 1
  5344. Write = 0
  5345. L2_Alloc = 0
  5346. L2_WB = 0
  5347. n_act = 1
  5348. n_pre = 0
  5349. n_ref = 0
  5350. n_req = 1
  5351. total_req = 1
  5352.  
  5353. Dual Bus Interface Util:
  5354. issued_total_row = 1
  5355. issued_total_col = 1
  5356. Row_Bus_Util = 0.000037
  5357. CoL_Bus_Util = 0.000037
  5358. Either_Row_CoL_Bus_Util = 0.000074
  5359. Issued_on_Two_Bus_Simul_Util = 0.000000
  5360. issued_two_Eff = 0.000000
  5361. queue_avg = 0.000000
  5362.  
  5363.  
  5364. dram_util_bins: 0 0 0 0 0 0 0 0 0 0
  5365. dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
  5366. mrqq: max=0 avg=0
  5367.  
  5368. ========= L2 cache stats =========
  5369. L2_cache_bank[0]: Access = 1, Miss = 1, Miss_rate = 1.000, Pending_hits = 0, Reservation_fails = 0
  5370. L2_cache_bank[1]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5371. L2_cache_bank[2]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5372. L2_cache_bank[3]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5373. L2_cache_bank[4]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5374. L2_cache_bank[5]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5375. L2_cache_bank[6]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5376. L2_cache_bank[7]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5377. L2_cache_bank[8]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5378. L2_cache_bank[9]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5379. L2_cache_bank[10]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5380. L2_cache_bank[11]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5381. L2_cache_bank[12]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5382. L2_cache_bank[13]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5383. L2_cache_bank[14]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5384. L2_cache_bank[15]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5385. L2_cache_bank[16]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5386. L2_cache_bank[17]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5387. L2_cache_bank[18]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5388. L2_cache_bank[19]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5389. L2_cache_bank[20]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5390. L2_cache_bank[21]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5391. L2_cache_bank[22]: Access = 1, Miss = 1, Miss_rate = 1.000, Pending_hits = 0, Reservation_fails = 0
  5392. L2_cache_bank[23]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5393. L2_cache_bank[24]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5394. L2_cache_bank[25]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5395. L2_cache_bank[26]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5396. L2_cache_bank[27]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5397. L2_cache_bank[28]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5398. L2_cache_bank[29]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5399. L2_cache_bank[30]: Access = 546, Miss = 1, Miss_rate = 0.002, Pending_hits = 3, Reservation_fails = 268
  5400. L2_cache_bank[31]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
  5401. L2_total_cache_accesses = 548
  5402. L2_total_cache_misses = 3
  5403. L2_total_cache_miss_rate = 0.0055
  5404. L2_total_cache_pending_hits = 3
  5405. L2_total_cache_reservation_fails = 268
  5406. L2_total_cache_breakdown:
  5407. L2_cache_stats_breakdown[GLOBAL_ACC_R][HIT] = 541
  5408. L2_cache_stats_breakdown[GLOBAL_ACC_R][HIT_RESERVED] = 3
  5409. L2_cache_stats_breakdown[GLOBAL_ACC_R][MISS] = 2
  5410. L2_cache_stats_breakdown[GLOBAL_ACC_R][RESERVATION_FAIL] = 268
  5411. L2_cache_stats_breakdown[GLOBAL_ACC_R][SECTOR_MISS] = 0
  5412. L2_cache_stats_breakdown[GLOBAL_ACC_R][MSHR_HIT] = 3
  5413. L2_cache_stats_breakdown[LOCAL_ACC_R][HIT] = 0
  5414. L2_cache_stats_breakdown[LOCAL_ACC_R][HIT_RESERVED] = 0
  5415. L2_cache_stats_breakdown[LOCAL_ACC_R][MISS] = 0
  5416. L2_cache_stats_breakdown[LOCAL_ACC_R][RESERVATION_FAIL] = 0
  5417. L2_cache_stats_breakdown[LOCAL_ACC_R][SECTOR_MISS] = 0
  5418. L2_cache_stats_breakdown[LOCAL_ACC_R][MSHR_HIT] = 0
  5419. L2_cache_stats_breakdown[CONST_ACC_R][HIT] = 0
  5420. L2_cache_stats_breakdown[CONST_ACC_R][HIT_RESERVED] = 0
  5421. L2_cache_stats_breakdown[CONST_ACC_R][MISS] = 0
  5422. L2_cache_stats_breakdown[CONST_ACC_R][RESERVATION_FAIL] = 0
  5423. L2_cache_stats_breakdown[CONST_ACC_R][SECTOR_MISS] = 0
  5424. L2_cache_stats_breakdown[CONST_ACC_R][MSHR_HIT] = 0
  5425. L2_cache_stats_breakdown[TEXTURE_ACC_R][HIT] = 0
  5426. L2_cache_stats_breakdown[TEXTURE_ACC_R][HIT_RESERVED] = 0
  5427. L2_cache_stats_breakdown[TEXTURE_ACC_R][MISS] = 0
  5428. L2_cache_stats_breakdown[TEXTURE_ACC_R][RESERVATION_FAIL] = 0
  5429. L2_cache_stats_breakdown[TEXTURE_ACC_R][SECTOR_MISS] = 0
  5430. L2_cache_stats_breakdown[TEXTURE_ACC_R][MSHR_HIT] = 0
  5431. L2_cache_stats_breakdown[GLOBAL_ACC_W][HIT] = 1
  5432. L2_cache_stats_breakdown[GLOBAL_ACC_W][HIT_RESERVED] = 0
  5433. L2_cache_stats_breakdown[GLOBAL_ACC_W][MISS] = 1
  5434. L2_cache_stats_breakdown[GLOBAL_ACC_W][RESERVATION_FAIL] = 0
  5435. L2_cache_stats_breakdown[GLOBAL_ACC_W][SECTOR_MISS] = 0
  5436. L2_cache_stats_breakdown[GLOBAL_ACC_W][MSHR_HIT] = 0
  5437. L2_cache_stats_breakdown[LOCAL_ACC_W][HIT] = 0
  5438. L2_cache_stats_breakdown[LOCAL_ACC_W][HIT_RESERVED] = 0
  5439. L2_cache_stats_breakdown[LOCAL_ACC_W][MISS] = 0
  5440. L2_cache_stats_breakdown[LOCAL_ACC_W][RESERVATION_FAIL] = 0
  5441. L2_cache_stats_breakdown[LOCAL_ACC_W][SECTOR_MISS] = 0
  5442. L2_cache_stats_breakdown[LOCAL_ACC_W][MSHR_HIT] = 0
  5443. L2_cache_stats_breakdown[L1_WRBK_ACC][HIT] = 0
  5444. L2_cache_stats_breakdown[L1_WRBK_ACC][HIT_RESERVED] = 0
  5445. L2_cache_stats_breakdown[L1_WRBK_ACC][MISS] = 0
  5446. L2_cache_stats_breakdown[L1_WRBK_ACC][RESERVATION_FAIL] = 0
  5447. L2_cache_stats_breakdown[L1_WRBK_ACC][SECTOR_MISS] = 0
  5448. L2_cache_stats_breakdown[L1_WRBK_ACC][MSHR_HIT] = 0
  5449. L2_cache_stats_breakdown[L2_WRBK_ACC][HIT] = 0
  5450. L2_cache_stats_breakdown[L2_WRBK_ACC][HIT_RESERVED] = 0
  5451. L2_cache_stats_breakdown[L2_WRBK_ACC][MISS] = 0
  5452. L2_cache_stats_breakdown[L2_WRBK_ACC][RESERVATION_FAIL] = 0
  5453. L2_cache_stats_breakdown[L2_WRBK_ACC][SECTOR_MISS] = 0
  5454. L2_cache_stats_breakdown[L2_WRBK_ACC][MSHR_HIT] = 0
  5455. L2_cache_stats_breakdown[INST_ACC_R][HIT] = 0
  5456. L2_cache_stats_breakdown[INST_ACC_R][HIT_RESERVED] = 0
  5457. L2_cache_stats_breakdown[INST_ACC_R][MISS] = 0
  5458. L2_cache_stats_breakdown[INST_ACC_R][RESERVATION_FAIL] = 0
  5459. L2_cache_stats_breakdown[INST_ACC_R][SECTOR_MISS] = 0
  5460. L2_cache_stats_breakdown[INST_ACC_R][MSHR_HIT] = 0
  5461. L2_cache_stats_breakdown[L1_WR_ALLOC_R][HIT] = 0
  5462. L2_cache_stats_breakdown[L1_WR_ALLOC_R][HIT_RESERVED] = 0
  5463. L2_cache_stats_breakdown[L1_WR_ALLOC_R][MISS] = 0
  5464. L2_cache_stats_breakdown[L1_WR_ALLOC_R][RESERVATION_FAIL] = 0
  5465. L2_cache_stats_breakdown[L1_WR_ALLOC_R][SECTOR_MISS] = 0
  5466. L2_cache_stats_breakdown[L1_WR_ALLOC_R][MSHR_HIT] = 0
  5467. L2_cache_stats_breakdown[L2_WR_ALLOC_R][HIT] = 0
  5468. L2_cache_stats_breakdown[L2_WR_ALLOC_R][HIT_RESERVED] = 0
  5469. L2_cache_stats_breakdown[L2_WR_ALLOC_R][MISS] = 0
  5470. L2_cache_stats_breakdown[L2_WR_ALLOC_R][RESERVATION_FAIL] = 0
  5471. L2_cache_stats_breakdown[L2_WR_ALLOC_R][SECTOR_MISS] = 0
  5472. L2_cache_stats_breakdown[L2_WR_ALLOC_R][MSHR_HIT] = 0
  5473. L2_cache_stats_breakdown[GLOBAL_ACC_R][TOTAL_ACCESS] = 546
  5474. L2_cache_stats_breakdown[GLOBAL_ACC_W][TOTAL_ACCESS] = 2
  5475. L2_total_cache_reservation_fail_breakdown:
  5476. L2_cache_stats_fail_breakdown[GLOBAL_ACC_R][MSHR_MERGE_ENRTY_FAIL] = 268
  5477. L2_cache_data_port_util = 0.002
  5478. L2_cache_fill_port_util = 0.000
  5479.  
  5480. icnt_total_pkts_mem_to_simt=548
  5481. icnt_total_pkts_simt_to_mem=548
  5482. LD_mem_lat_dist 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  5483. ST_mem_lat_dist 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
  5484. ----------------------------Interconnect-DETAILS--------------------------------
  5485. ----------------------------NOC-DETAILS--------------------------------
  5486. Class 0:
  5487. Packet latency average = 5.11429
  5488. minimum = 5
  5489. maximum = 7
  5490. Network latency average = 5.11429
  5491. minimum = 5
  5492. maximum = 7
  5493. Slowest packet = 3
  5494. Flit latency average = 5.11429
  5495. minimum = 5
  5496. maximum = 7
  5497. Slowest flit = 3
  5498. Fragmentation average = 0
  5499. minimum = 0
  5500. maximum = 0
  5501. Injected packet rate average = 0.000125029
  5502. minimum = 0 (at node 1)
  5503. maximum = 0.0038866 (at node 62)
  5504. Accepted packet rate average = 0.000125029
  5505. minimum = 0 (at node 1)
  5506. maximum = 0.0038866 (at node 62)
  5507. Injected flit rate average = 0.000125029
  5508. minimum = 0 (at node 1)
  5509. maximum = 0.0038866 (at node 62)
  5510. Accepted flit rate average= 0.000125029
  5511. minimum = 0 (at node 1)
  5512. maximum = 0.0038866 (at node 62)
  5513. Injected packet length average = 1
  5514. Accepted packet length average = 1
  5515. Total in-flight flits = 0 (0 measured)
  5516. ====== Overall Traffic Statistics ======
  5517. ====== Traffic class 0 ======
  5518. Packet latency average = 5.11429 (1 samples)
  5519. minimum = 5 (1 samples)
  5520. maximum = 7 (1 samples)
  5521. Network latency average = 5.11429 (1 samples)
  5522. minimum = 5 (1 samples)
  5523. maximum = 7 (1 samples)
  5524. Flit latency average = 5.11429 (1 samples)
  5525. minimum = 5 (1 samples)
  5526. maximum = 7 (1 samples)
  5527. Fragmentation average = 0 (1 samples)
  5528. minimum = 0 (1 samples)
  5529. maximum = 0 (1 samples)
  5530. Injected packet rate average = 0.000125029 (1 samples)
  5531. minimum = 0 (1 samples)
  5532. maximum = 0.0038866 (1 samples)
  5533. Accepted packet rate average = 0.000125029 (1 samples)
  5534. minimum = 0 (1 samples)
  5535. maximum = 0.0038866 (1 samples)
  5536. Injected flit rate average = 0.000125029 (1 samples)
  5537. minimum = 0 (1 samples)
  5538. maximum = 0.0038866 (1 samples)
  5539. Accepted flit rate average = 0.000125029 (1 samples)
  5540. minimum = 0 (1 samples)
  5541. maximum = 0.0038866 (1 samples)
  5542. Injected packet size average = 1 (1 samples)
  5543. Accepted packet size average = 1 (1 samples)
  5544. Hops average = 1 (1 samples)
  5545. ----------------------------chLet-DETAILS--------------------------------
  5546. Class 0:
  5547. Packet latency average = 12.1462
  5548. minimum = 5
  5549. maximum = 75
  5550. Network latency average = 12.0273
  5551. minimum = 5
  5552. maximum = 74
  5553. Slowest packet = 28
  5554. Flit latency average = 12.0273
  5555. minimum = 5
  5556. maximum = 74
  5557. Slowest flit = 89
  5558. Fragmentation average = 0
  5559. minimum = 0
  5560. maximum = 0
  5561. Injected packet rate average = 0.00733025
  5562. minimum = 0.00377229 (at node 4)
  5563. maximum = 0.0585277 (at node 15)
  5564. Accepted packet rate average = 0.00733025
  5565. minimum = 0.00377229 (at node 4)
  5566. maximum = 0.0585277 (at node 15)
  5567. Injected flit rate average = 0.00733025
  5568. minimum = 0.00377229 (at node 4)
  5569. maximum = 0.0585277 (at node 15)
  5570. Accepted flit rate average= 0.00733025
  5571. minimum = 0.00377229 (at node 4)
  5572. maximum = 0.0585277 (at node 15)
  5573. Injected packet length average = 1
  5574. Accepted packet length average = 1
  5575. Total in-flight flits = 0 (0 measured)
  5576. ====== Overall Traffic Statistics ======
  5577. ====== Traffic class 0 ======
  5578. Packet latency average = 12.1462 (1 samples)
  5579. minimum = 5 (1 samples)
  5580. maximum = 75 (1 samples)
  5581. Network latency average = 12.0273 (1 samples)
  5582. minimum = 5 (1 samples)
  5583. maximum = 74 (1 samples)
  5584. Flit latency average = 12.0273 (1 samples)
  5585. minimum = 5 (1 samples)
  5586. maximum = 74 (1 samples)
  5587. Fragmentation average = 0 (1 samples)
  5588. minimum = 0 (1 samples)
  5589. maximum = 0 (1 samples)
  5590. Injected packet rate average = 0.00733025 (1 samples)
  5591. minimum = 0.00377229 (1 samples)
  5592. maximum = 0.0585277 (1 samples)
  5593. Accepted packet rate average = 0.00733025 (1 samples)
  5594. minimum = 0.00377229 (1 samples)
  5595. maximum = 0.0585277 (1 samples)
  5596. Injected flit rate average = 0.00733025 (1 samples)
  5597. minimum = 0.00377229 (1 samples)
  5598. maximum = 0.0585277 (1 samples)
  5599. Accepted flit rate average = 0.00733025 (1 samples)
  5600. minimum = 0.00377229 (1 samples)
  5601. maximum = 0.0585277 (1 samples)
  5602. Injected packet size average = 1 (1 samples)
  5603. Accepted packet size average = 1 (1 samples)
  5604. Hops average = 1 (1 samples)
  5605. ----------------------------END-of-Interconnect-DETAILS-------------------------
  5606.  
  5607.  
  5608. gpgpu_simulation_time = 0 days, 0 hrs, 0 min, 48 sec (48 sec)
  5609. gpgpu_simulation_rate = 85145 (inst/sec)
  5610. gpgpu_simulation_rate = 182 (cycle/sec)
  5611. gpgpu_silicon_slowdown = 6219780x
  5612. GPGPU-Sim: *** simulation thread exiting ***
  5613. GPGPU-Sim: *** exit detected ***
  5614. mnaderan@rtx3080:test$
  5615.  
Add Comment
Please, Sign In to add comment