accelsim-chiplet

mnaderan@rtx3080:~$ git clone -b dev https://github.ugent.be/mnaderan/accelsim-chiplet
Cloning into 'accelsim-chiplet'...
Username for 'https://github.ugent.be': mnaderan
Password for 'https://mnaderan@github.ugent.be':
remote: Enumerating objects: 5949, done.
remote: Counting objects: 100% (534/534), done.
remote: Compressing objects: 100% (261/261), done.
remote: Total 5949 (delta 328), reused 424 (delta 269), pack-reused 5415
Receiving objects: 100% (5949/5949), 1.52 MiB | 11.17 MiB/s, done.
Resolving deltas: 100% (3721/3721), done.
mnaderan@rtx3080:~$ cd accelsim-chiplet/
mnaderan@rtx3080:accelsim-chiplet$ source gpu-simulator/setup_environment.sh
Cloning into '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim'...
Username for 'https://github.ugent.be': mnaderan
Password for 'https://mnaderan@github.ugent.be':
remote: Enumerating objects: 15686, done.
remote: Counting objects: 100% (237/237), done.
remote: Compressing objects: 100% (177/177), done.
remote: Total 15686 (delta 115), reused 92 (delta 60), pack-reused 15449
Receiving objects: 100% (15686/15686), 34.79 MiB | 11.03 MiB/s, done.
Resolving deltas: 100% (11729/11729), done.
Already on 'dev'
Your branch is up to date with 'origin/dev'.
GPGPU-Sim version 4.2.0 (build gpgpu-sim_git-commit-60af80a7d140a30c781ca485707a1d4dbb8031fa-modified_594.0) configured with AccelWattch.

----------------------------------------------------------------------------
INFO - If you only care about PTX execution, ignore this message. GPGPU-Sim supports PTX execution in modern CUDA.
If you want to run PTXPLUS (sm_1x SASS) with a modern card configuration - set the envronment variable
$PTXAS_CUDA_INSTALL_PATH to point a CUDA version compabible with your card configurations (i.e. 8+ for PASCAL, 9+ for VOLTA etc..)
For example: "export $PTXAS_CUDA_INSTALL_PATH=/usr/local/cuda-9.1"

The following text describes why:
If you are using PTXPLUS, only sm_1x is supported and it requires that the app and simulator binaries are compiled in CUDA 4.2 or less.
The simulator requires it since CUDA headers desribe struct sizes in the exec which change from gen to gen.
The apps require 4.2 because new versions of CUDA tools have dropped parsing support for generating sm_1x
When running using modern config (i.e. volta) and PTXPLUS with CUDA 4.2, the $PTXAS_CUDA_INSTALL_PATH env variable is required to get proper register usage
(and hence occupancy) using a version of CUDA that knows the register usage on the real card.

----------------------------------------------------------------------------
setup_environment succeeded
mnaderan@rtx3080:accelsim-chiplet$ make -j -C gpu-simulator/
make: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator'
if [ ! -d ./bin/release ]; then mkdir -p ./bin/release; fi;
if [ ! -d ./build/release ]; then mkdir -p ./build/release; fi;
touch ./build/release/main.makedepend
makedepend -f./build/release/main.makedepend -p./build/release/ main.cc 2> /dev/null
make -C trace-driven depend
make -C trace-parser depend
make -C gpgpu-sim
echo "const char *g_accelsim_version=\"accelsim-commit-9e19b6621cc94d89d8f7b18e65d802ad6979c195_modified_0.0\";" > ./build/release/accelsim_version.h
make[1]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/trace-driven'
make[1]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/trace-parser'
g++ -Wall -O3 -g3 -fPIC -std=c++11  -I./build/release -I./trace-driven -I./trace-parser -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/usr/local/cuda-11.6/include -c main.cc -o ./build/release/main.o
make[1]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim'
touch ../build/release/trace-driven.Makefile.makedepend
touch ../build/release/trace-parser.Makefile.makedepend
makedepend -f../build/release/trace-driven.Makefile.makedepend -p../build/release/ trace_driven.cc 2> /dev/null
makedepend -f../build/release/trace-parser.Makefile.makedepend -p../build/release/ trace_parser.cc 2> /dev/null
make[1]: 'depend' is up to date.
make[1]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/trace-parser'
make -C trace-parser
make[1]: 'depend' is up to date.
make[1]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/trace-driven'
make -C trace-driven
make[1]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/trace-parser'
make[1]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/trace-driven'
touch ../build/release/trace-parser.Makefile.makedepend
makedepend -f../build/release/trace-parser.Makefile.makedepend -p../build/release/ trace_parser.cc 2> /dev/null
touch ../build/release/trace-driven.Makefile.makedepend
makedepend -f../build/release/trace-driven.Makefile.makedepend -p../build/release/ trace_driven.cc 2> /dev/null
g++ -O3 -g3 -fPIC -std=c++11 -Wall -I/usr/local/cuda-11.6/include -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -c trace_driven.cc -o ../build/release/trace_driven.o
g++ -O3 -g3 -fPIC -std=c++11 -Wall -I/usr/local/cuda-11.6/include -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -c trace_parser.cc -o ../build/release/trace_parser.o

        Building GPGPU-Sim version 4.2.0 (build gpgpu-sim_git-commit-60af80a7d140a30c781ca485707a1d4dbb8031fa_modified_0.0) with CUDA version 11.6

if [ ! -d lib/gcc-9.4.0/cuda-11060/release ]; then mkdir -p lib/gcc-9.4.0/cuda-11060/release; fi;
Warning: gpgpu-sim is building without opencl support. Make sure NVOPENCL_LIBDIR and NVOPENCL_INCDIR are set
if [ ! -d /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda ]; then mkdir -p /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda; fi;
if [ ! -d /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim ]; then mkdir -p /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim; fi;
if [ ! -d /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/decuda_pred_table ]; then mkdir -p /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/decuda_pred_table; fi;
if [ ! -d /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim ]; then mkdir -p /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim; fi;
if [ ! -d /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libopencl ]; then mkdir -p /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libopencl; fi;
if [ ! -d /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libopencl/bin ]; then mkdir -p /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libopencl/bin; fi;
if [ ! -d /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2 ]; then mkdir -p /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2; fi;
if [ ! -d /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus ]; then mkdir -p /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus; fi;
if [ ! -d /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch ]; then mkdir -p /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch; fi;
if [ ! -d /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/cacti ]; then mkdir -p /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/cacti; fi;
make -C ./src/cuda-sim/ depend
make -C /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ depend
make[2]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/cuda-sim'
make[2]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch'
make[3]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch'
touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/Makefile.makedepend
makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/Makefile.makedepend -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/ Ucache.cc XML_Parse.cc arbiter.cc area.cc array.cc bank.cc basic_circuit.cc basic_components.cc cacti_interface.cc component.cc core.cc crossbar.cc decoder.cc htree2.cc interconnect.cc io.cc iocontrollers.cc logic.cc main.cc mat.cc memoryctrl.cc noc.cc nuca.cc parameter.cc processor.cc router.cc sharedcache.cc subarray.cc technology.cc uca.cc wire.cc xmlParser.cc gpgpu_sim_wrapper.cc  2> /dev/null
make -C ./cacti/ depend
make[4]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/cacti'
make[5]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/cacti'
touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/cacti/Makefile.makedepend
makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/cacti/Makefile.makedepend -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/cacti/ area.cc bank.cc mat.cc main.cc Ucache.cc io.cc technology.cc basic_circuit.cc parameter.cc decoder.cc component.cc uca.cc subarray.cc wire.cc htree2.cc cacti_interface.cc router.cc nuca.cc crossbar.cc arbiter.cc  2> /dev/null
make[5]: Nothing to be done for 'depend'.
make[5]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/cacti'
make[4]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/cacti'
make[3]: Nothing to be done for 'depend'.
make[3]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch'
make[2]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch'
make -C /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/
make[2]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch'
mkdir -p obj_opt
make[3]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch'
touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/Makefile.makedepend
makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/Makefile.makedepend -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/ Ucache.cc XML_Parse.cc arbiter.cc area.cc array.cc bank.cc basic_circuit.cc basic_components.cc cacti_interface.cc component.cc core.cc crossbar.cc decoder.cc htree2.cc interconnect.cc io.cc iocontrollers.cc logic.cc main.cc mat.cc memoryctrl.cc noc.cc nuca.cc parameter.cc processor.cc router.cc sharedcache.cc subarray.cc technology.cc uca.cc wire.cc xmlParser.cc gpgpu_sim_wrapper.cc  2> /dev/null
make -C ./cacti/ depend
make[4]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/cacti'
make[5]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/cacti'
touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/cacti/Makefile.makedepend
makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/cacti/Makefile.makedepend -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/cacti/ area.cc bank.cc mat.cc main.cc Ucache.cc io.cc technology.cc basic_circuit.cc parameter.cc decoder.cc component.cc uca.cc subarray.cc wire.cc htree2.cc cacti_interface.cc router.cc nuca.cc crossbar.cc arbiter.cc  2> /dev/null
make[5]: Nothing to be done for 'depend'.
make[5]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/cacti'
make[4]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/cacti'
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c cacti/Ucache.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/Ucache.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c XML_Parse.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/XML_Parse.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c cacti/arbiter.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/arbiter.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c cacti/area.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/area.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c array.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/array.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c cacti/bank.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/bank.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c cacti/basic_circuit.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/basic_circuit.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c basic_components.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/basic_components.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c cacti/cacti_interface.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/cacti_interface.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c cacti/component.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/component.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c core.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/core.o
touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/Makefile.makedepend
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c cacti/crossbar.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/crossbar.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c cacti/decoder.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/decoder.o
makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/Makefile.makedepend -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ cuda_device_printf.cc cuda_device_runtime.cc cuda-sim.cc instructions.cc memory.cc ptx_ir.cc ptx_loader.cc ptx_parser.cc ptx_sim.cc ptx-stats.cc 2> /dev/null
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c cacti/htree2.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/htree2.o
make[2]: 'depend' is up to date.
make[2]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/cuda-sim'
make -C ./src/cuda-sim/
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c interconnect.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/interconnect.o
make[2]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/cuda-sim'
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c cacti/io.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/io.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c iocontrollers.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/iocontrollers.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c logic.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/logic.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c main.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/main.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c cacti/mat.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/mat.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c memoryctrl.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/memoryctrl.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c noc.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/noc.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c cacti/nuca.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/nuca.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c cacti/parameter.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/parameter.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c processor.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/processor.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c cacti/router.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/router.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c sharedcache.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/sharedcache.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c cacti/subarray.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/subarray.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c cacti/technology.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/technology.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c cacti/uca.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/uca.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c cacti/wire.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/wire.o
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c xmlParser.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/xmlParser.o
In file included from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/../abstract_hardware_model.h:217,
                 from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/cuda-sim.h:36,
                 from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/gpgpu_context.h:3,
                 from main.cc:13:
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
   67 |   bool done() const { return m_updates == m_issued; }
      |                              ~~~~~~~~~~^~~~~~~~~~~
g++ -m64 -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -c gpgpu_sim_wrapper.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/gpgpu_sim_wrapper.o
In file included from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/cuda-sim.h:36,
                 from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/gpgpu_context.h:3,
                 from main.cc:13:
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/Makefile.makedepend
In file included from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/cuda-sim.h:36,
                 from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/gpgpu_context.h:3,
                 from main.cc:13:
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/Makefile.makedepend -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ cuda_device_printf.cc cuda_device_runtime.cc cuda-sim.cc instructions.cc memory.cc ptx_ir.cc ptx_loader.cc ptx_parser.cc ptx_sim.cc ptx-stats.cc 2> /dev/null
bison --name-prefix=ptx_ -v -d ptx.y --file-prefix=/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx
ptx.y: warning: 1 nonterminal useless in grammar [-Wother]
ptx.y: warning: 2 rules useless in grammar [-Wother]
ptx.y:573.1-7: warning: nonterminal useless in grammar: vp_spec [-Wother]
  573 | vp_spec: WMMA_DIRECTIVE LAYOUT CONFIGURATION{recognizer->add_space_spec(global_space,0);recognizer->add_ptr_spec(global_space);recognizer->add_wmma_option($1);recognizer...
      | ^~~~~~~
ptx.y: warning: 57 reduce/reduce conflicts [-Wconflicts-rr]
ptx.y:277.11-112: warning: rule useless in parser due to conflicts [-Wother]
  277 |         | WEAK_DIRECTIVE FUNC_DIRECTIVE { $$ = 0; recognizer->g_func_decl=1; recognizer->func_header(".func"); }
      |           ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
g++  -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include  -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC  -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x memory.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/memory.o
In file included from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/cuda-sim.h:37,
                 from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/gpgpu_context.h:3,
                 from main.cc:13:
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
g++  -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include  -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC  -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x decuda_pred_table/decuda_pred_table.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/decuda_pred_table/decuda_pred_table.o
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
 1298 |     if (inst.op != m_supported_op) {
      |         ~~~~~~~~^~~~~~~~~~~~~~~~~
flex --outfile=/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/lex.ptx_.c ptx.l
ptx.l:173: undeclared start condition IN_INST
ptx.l:237: warning, rule cannot be matched
bison --name-prefix=ptxinfo_ -v -d ptxinfo.y --file-prefix=/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptxinfo
ptxinfo.y: warning: 2 reduce/reduce conflicts [-Wconflicts-rr]
flex --outfile=/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/lex.ptxinfo_.c ptxinfo.l
created /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/instructions.h
In file included from memory.h:32,
                 from memory.cc:29:
../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from memory.h:32,
                 from memory.cc:29:
../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
cat /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx.tab.h | grep "=" | sed 's/^[ ]\+//' | sed 's/[=,]//g' | sed 's/\([_A-Z1-9]\+\)[ ]\+\([0-9]\+\)/\1 \1/' | sed 's/^/DEF(/' | sed 's/ /,"/' | sed 's/$/")/' > /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx_parser_decode.def
g++  -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include  -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC  -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x cuda_device_printf.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/cuda_device_printf.o
g++  -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include  -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC  -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x instructions.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/instructions.o
main.cc: In function ‘int main(int, const char**)’:
main.cc:157:29: warning: comparison of integer expressions of different signedness: ‘int’ and ‘std::vector<long unsigned int>::size_type’ {aka ‘long unsigned int’} [-Wsign-compare]
  157 |           for (int l = 0; l < busy_streams.size(); l++) {
      |                           ~~^~~~~~~~~~~~~~~~~~~~~
main.cc:164:27: warning: deleting object of polymorphic class type ‘function_info’ which has non-virtual destructor might cause undefined behavior [-Wdelete-non-virtual-dtor]
  164 |           delete k->entry();
      |                           ^
g++  -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include  -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC  -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x cuda-sim.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/cuda-sim.o
g++  -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include  -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC  -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x ptx_ir.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx_ir.o
g++  -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include  -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC  -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x ptx_sim.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx_sim.o
g++  -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include  -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC  -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x ptx-stats.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx-stats.o
g++  -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include  -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC  -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x -DYYDEBUG /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx.tab.c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx.tab.o
g++  -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include  -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC  -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/lex.ptx_.c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/lex.ptx_.o
g++  -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include  -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC  -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x -DYYDEBUG /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptxinfo.tab.c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptxinfo.tab.o
g++  -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include  -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC  -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/lex.ptxinfo_.c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/lex.ptxinfo_.o
g++  -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include  -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC  -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x cuda_device_runtime.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/cuda_device_runtime.o
In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:37,
                 from ../../libcuda/gpgpu_context.h:3,
                 from memory.cc:31:
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
In file included from cuda-math.h:354,
                 from instructions.cc:53:
/usr/local/cuda-11.6/include/math_functions.h:54:2: warning: #warning "math_functions.h is an internal header file and must not be used directly.  This file will be removed in a future CUDA release.  Please use cuda_runtime_api.h or cuda_runtime.h instead." [-Wcpp]
   54 | #warning "math_functions.h is an internal header file and must not be used directly.  This file will be removed in a future CUDA release.  Please use cuda_runtime_api.h or cuda_runtime.h instead."
      |  ^~~~~~~
In file included from cuda-sim.h:36,
                 from cuda-sim.cc:32:
../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:966:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                 ^~~~~~~~~~~~~~~~~~~~~~  ~~
      |                                         |
      |                                         address_type {aka long long unsigned int}
In file included from cuda-sim.h:36,
                 from cuda-sim.cc:32:
../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:1160:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                 ^~~~~~~~~~~~~~~~~~~~~~  ~~
      |                                         |
      |                                         address_type {aka long long unsigned int}
In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:36,
                 from ../../libcuda/gpgpu_context.h:3,
                 from ptx-stats.cc:32:
../../libcuda/../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../../libcuda/../src/cuda-sim/../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:36,
                 from ../../libcuda/gpgpu_context.h:3,
                 from ptx-stats.cc:32:
../../libcuda/../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../../libcuda/../src/cuda-sim/../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
memory.cc: In instantiation of ‘void memory_space_impl<BSIZE>::print(const char*, FILE*) const [with unsigned int BSIZE = 32; FILE = _IO_FILE]’:
memory.cc:182:16:   required from here
memory.cc:172:26: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘long long unsigned int’ [-Wformat=]
  172 |     fprintf(fout, "%s %08x:", m_name.c_str(), i_page->first);
      |                       ~~~^                    ~~~~~~~~~~~~~
      |                          |                            |
      |                          unsigned int                 long long unsigned int
      |                       %08llx
memory.cc: In instantiation of ‘void memory_space_impl<BSIZE>::read_single_block(mem_addr_t, mem_addr_t, size_t, void*) const [with unsigned int BSIZE = 32; mem_addr_t = long long unsigned int; size_t = long unsigned int]’:
memory.cc:182:16:   required from here
memory.cc:112:18: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘mem_addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  112 |         "addr=0x%x, length=%zu\n",
      |                 ~^
      |                  |
      |                  unsigned int
      |                 %llx
  113 |         m_name.c_str(), addr, length);
      |                         ~~~~
      |                         |
      |                         mem_addr_t {aka long long unsigned int}
memory.cc:115:43: warning: format ‘%lx’ expects argument of type ‘long unsigned int’, but argument 2 has type ‘long long unsigned int’ [-Wformat=]
  115 |         "GPGPU-Sim PTX: (addr+length)=0x%lx > 0x%x=(index+1)*BSIZE, "
      |                                         ~~^
      |                                           |
      |                                           long unsigned int
      |                                         %llx
  116 |         "index=0x%x, BSIZE=0x%x\n",
  117 |         (addr + length), (blk_idx + 1) * BSIZE, blk_idx, BSIZE);
      |         ~~~~~~~~~~~~~~~
      |               |
      |               long long unsigned int
memory.cc:115:50: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘mem_addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  115 |         "GPGPU-Sim PTX: (addr+length)=0x%lx > 0x%x=(index+1)*BSIZE, "
      |                                                 ~^
      |                                                  |
      |                                                  unsigned int
      |                                                 %llx
  116 |         "index=0x%x, BSIZE=0x%x\n",
  117 |         (addr + length), (blk_idx + 1) * BSIZE, blk_idx, BSIZE);
      |                          ~~~~~~~~~~~~~~~~~~~~~
      |                                        |
      |                                        mem_addr_t {aka long long unsigned int}
memory.cc:116:19: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘mem_addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  116 |         "index=0x%x, BSIZE=0x%x\n",
      |                  ~^
      |                   |
      |                   unsigned int
      |                  %llx
  117 |         (addr + length), (blk_idx + 1) * BSIZE, blk_idx, BSIZE);
      |                                                 ~~~~~~~
      |                                                 |
      |                                                 mem_addr_t {aka long long unsigned int}
memory.cc: In instantiation of ‘void memory_space_impl<BSIZE>::print(const char*, FILE*) const [with unsigned int BSIZE = 64; FILE = _IO_FILE]’:
memory.cc:183:16:   required from here
memory.cc:172:26: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘long long unsigned int’ [-Wformat=]
  172 |     fprintf(fout, "%s %08x:", m_name.c_str(), i_page->first);
      |                       ~~~^                    ~~~~~~~~~~~~~
      |                          |                            |
      |                          unsigned int                 long long unsigned int
      |                       %08llx
memory.cc: In instantiation of ‘void memory_space_impl<BSIZE>::read_single_block(mem_addr_t, mem_addr_t, size_t, void*) const [with unsigned int BSIZE = 64; mem_addr_t = long long unsigned int; size_t = long unsigned int]’:
memory.cc:183:16:   required from here
memory.cc:112:18: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘mem_addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  112 |         "addr=0x%x, length=%zu\n",
      |                 ~^
      |                  |
      |                  unsigned int
      |                 %llx
  113 |         m_name.c_str(), addr, length);
      |                         ~~~~
      |                         |
      |                         mem_addr_t {aka long long unsigned int}
memory.cc:115:43: warning: format ‘%lx’ expects argument of type ‘long unsigned int’, but argument 2 has type ‘long long unsigned int’ [-Wformat=]
  115 |         "GPGPU-Sim PTX: (addr+length)=0x%lx > 0x%x=(index+1)*BSIZE, "
      |                                         ~~^
      |                                           |
      |                                           long unsigned int
      |                                         %llx
  116 |         "index=0x%x, BSIZE=0x%x\n",
  117 |         (addr + length), (blk_idx + 1) * BSIZE, blk_idx, BSIZE);
      |         ~~~~~~~~~~~~~~~
      |               |
      |               long long unsigned int
memory.cc:115:50: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘mem_addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  115 |         "GPGPU-Sim PTX: (addr+length)=0x%lx > 0x%x=(index+1)*BSIZE, "
      |                                                 ~^
      |                                                  |
      |                                                  unsigned int
      |                                                 %llx
  116 |         "index=0x%x, BSIZE=0x%x\n",
  117 |         (addr + length), (blk_idx + 1) * BSIZE, blk_idx, BSIZE);
      |                          ~~~~~~~~~~~~~~~~~~~~~
      |                                        |
      |                                        mem_addr_t {aka long long unsigned int}
memory.cc:116:19: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘mem_addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  116 |         "index=0x%x, BSIZE=0x%x\n",
      |                  ~^
      |                   |
      |                   unsigned int
      |                  %llx
  117 |         (addr + length), (blk_idx + 1) * BSIZE, blk_idx, BSIZE);
      |                                                 ~~~~~~~
      |                                                 |
      |                                                 mem_addr_t {aka long long unsigned int}
In file included from cuda-sim.h:37,
                 from cuda-sim.cc:32:
../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
../gpgpu-sim/shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
../gpgpu-sim/shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
memory.cc: In instantiation of ‘void memory_space_impl<BSIZE>::print(const char*, FILE*) const [with unsigned int BSIZE = 8192; FILE = _IO_FILE]’:
memory.cc:184:16:   required from here
memory.cc:172:26: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘long long unsigned int’ [-Wformat=]
  172 |     fprintf(fout, "%s %08x:", m_name.c_str(), i_page->first);
      |                       ~~~^                    ~~~~~~~~~~~~~
      |                          |                            |
      |                          unsigned int                 long long unsigned int
      |                       %08llx
memory.cc: In instantiation of ‘void memory_space_impl<BSIZE>::read_single_block(mem_addr_t, mem_addr_t, size_t, void*) const [with unsigned int BSIZE = 8192; mem_addr_t = long long unsigned int; size_t = long unsigned int]’:
memory.cc:184:16:   required from here
memory.cc:112:18: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘mem_addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  112 |         "addr=0x%x, length=%zu\n",
      |                 ~^
      |                  |
      |                  unsigned int
      |                 %llx
  113 |         m_name.c_str(), addr, length);
      |                         ~~~~
      |                         |
      |                         mem_addr_t {aka long long unsigned int}
memory.cc:115:43: warning: format ‘%lx’ expects argument of type ‘long unsigned int’, but argument 2 has type ‘long long unsigned int’ [-Wformat=]
  115 |         "GPGPU-Sim PTX: (addr+length)=0x%lx > 0x%x=(index+1)*BSIZE, "
      |                                         ~~^
      |                                           |
      |                                           long unsigned int
      |                                         %llx
  116 |         "index=0x%x, BSIZE=0x%x\n",
  117 |         (addr + length), (blk_idx + 1) * BSIZE, blk_idx, BSIZE);
      |         ~~~~~~~~~~~~~~~
      |               |
      |               long long unsigned int
memory.cc:115:50: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘mem_addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  115 |         "GPGPU-Sim PTX: (addr+length)=0x%lx > 0x%x=(index+1)*BSIZE, "
      |                                                 ~^
      |                                                  |
      |                                                  unsigned int
      |                                                 %llx
  116 |         "index=0x%x, BSIZE=0x%x\n",
  117 |         (addr + length), (blk_idx + 1) * BSIZE, blk_idx, BSIZE);
      |                          ~~~~~~~~~~~~~~~~~~~~~
      |                                        |
      |                                        mem_addr_t {aka long long unsigned int}
memory.cc:116:19: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘mem_addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  116 |         "index=0x%x, BSIZE=0x%x\n",
      |                  ~^
      |                   |
      |                   unsigned int
      |                  %llx
  117 |         (addr + length), (blk_idx + 1) * BSIZE, blk_idx, BSIZE);
      |                                                 ~~~~~~~
      |                                                 |
      |                                                 mem_addr_t {aka long long unsigned int}
memory.cc: In instantiation of ‘void memory_space_impl<BSIZE>::print(const char*, FILE*) const [with unsigned int BSIZE = 16384; FILE = _IO_FILE]’:
memory.cc:185:16:   required from here
memory.cc:172:26: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘long long unsigned int’ [-Wformat=]
  172 |     fprintf(fout, "%s %08x:", m_name.c_str(), i_page->first);
      |                       ~~~^                    ~~~~~~~~~~~~~
      |                          |                            |
      |                          unsigned int                 long long unsigned int
      |                       %08llx
memory.cc: In instantiation of ‘void memory_space_impl<BSIZE>::read_single_block(mem_addr_t, mem_addr_t, size_t, void*) const [with unsigned int BSIZE = 16384; mem_addr_t = long long unsigned int; size_t = long unsigned int]’:
memory.cc:185:16:   required from here
memory.cc:112:18: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘mem_addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  112 |         "addr=0x%x, length=%zu\n",
      |                 ~^
      |                  |
      |                  unsigned int
      |                 %llx
  113 |         m_name.c_str(), addr, length);
      |                         ~~~~
      |                         |
      |                         mem_addr_t {aka long long unsigned int}
memory.cc:115:43: warning: format ‘%lx’ expects argument of type ‘long unsigned int’, but argument 2 has type ‘long long unsigned int’ [-Wformat=]
  115 |         "GPGPU-Sim PTX: (addr+length)=0x%lx > 0x%x=(index+1)*BSIZE, "
      |                                         ~~^
      |                                           |
      |                                           long unsigned int
      |                                         %llx
  116 |         "index=0x%x, BSIZE=0x%x\n",
  117 |         (addr + length), (blk_idx + 1) * BSIZE, blk_idx, BSIZE);
      |         ~~~~~~~~~~~~~~~
      |               |
      |               long long unsigned int
memory.cc:115:50: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘mem_addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  115 |         "GPGPU-Sim PTX: (addr+length)=0x%lx > 0x%x=(index+1)*BSIZE, "
      |                                                 ~^
      |                                                  |
      |                                                  unsigned int
      |                                                 %llx
  116 |         "index=0x%x, BSIZE=0x%x\n",
  117 |         (addr + length), (blk_idx + 1) * BSIZE, blk_idx, BSIZE);
      |                          ~~~~~~~~~~~~~~~~~~~~~
      |                                        |
      |                                        mem_addr_t {aka long long unsigned int}
memory.cc:116:19: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘mem_addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  116 |         "index=0x%x, BSIZE=0x%x\n",
      |                  ~^
      |                   |
      |                   unsigned int
      |                  %llx
  117 |         (addr + length), (blk_idx + 1) * BSIZE, blk_idx, BSIZE);
      |                                                 ~~~~~~~
      |                                                 |
      |                                                 mem_addr_t {aka long long unsigned int}
In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:37,
                 from ../../libcuda/gpgpu_context.h:3,
                 from ptx-stats.cc:32:
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
In file included from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/abstract_hardware_model.h:217,
                 from ../ISA_Def/ampere_opcode.h:9,
                 from trace_driven.cc:40:
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/stream_manager.h: In member function ‘bool CUevent_st::done() const’:
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
   67 |   bool done() const { return m_updates == m_issued; }
      |                              ~~~~~~~~~~^~~~~~~~~~~
In file included from ptx_ir.h:32,
                 from cuda_device_printf.cc:30:
../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
cuda-sim.cc: In member function ‘void cuda_sim::ptx_print_insn(address_type, FILE*)’:
cuda-sim.cc:548:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  548 |     fprintf(fp, "<no instruction at address 0x%x>", pc);
      |                 ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  ~~
      |                                                     |
      |                                                     address_type {aka long long unsigned int}
cuda-sim.cc: In member function ‘std::string cuda_sim::ptx_get_insn_str(address_type)’:
cuda-sim.cc:562:30: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  562 |     snprintf(buff, STR_SIZE, "<no instruction at address 0x%x>", pc);
      |                              ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  ~~
      |                                                                  |
      |                                                                  address_type {aka long long unsigned int}
cuda-sim.cc: In member function ‘void function_info::add_param_data(unsigned int, gpgpu_ptx_sim_arg*)’:
cuda-sim.cc:1374:11: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
 1374 |           "GPGPU-Sim PTX: deferred allocation of shared region for \"%s\" from "
      |           ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 1375 |           "0x%x to 0x%x (shared memory space)\n",
      |           ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 1376 |           p->name().c_str(), m_symtab->get_shared_next(),
      |                              ~~~~~~~~~~~~~~~~~~~~~~~~~~~
      |                                                       |
      |                                                       addr_t {aka long long unsigned int}
cuda-sim.cc:1374:11: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
cuda-sim.cc: In member function ‘void function_info::list_param(FILE*) const’:
cuda-sim.cc:1506:19: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
 1506 |     fprintf(fout, "%s: %#08x\n", name.c_str(), param_addr);
      |                   ^~~~~~~~~~~~~                ~~~~~~~~~~
      |                                                |
      |                                                addr_t {aka long long unsigned int}
g++  -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include  -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC  -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x ptx_parser.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx_parser.o
cuda-sim.cc: In member function ‘void function_info::ptx_jit_config(std::map<long long unsigned int, long unsigned int>, memory_space*, gpgpu_t*, dim3, dim3)’:
cuda-sim.cc:1534:3: warning: NULL used in arithmetic [-Wpointer-arith]
 1534 |   assert(system(buff) != NULL);
      |   ^~~~~~
g++  -c -O3 -g3 -Wall -Wno-unused-function -Wno-sign-compare -I/usr/local/cuda-11.6/include  -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ -I. -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release -fPIC  -DTRACING_ON=1 -DCUDART_VERSION=11060 -std=c++0x ptx_loader.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx_loader.o
In file included from ptx_ir.h:32,
                 from cuda_device_printf.cc:30:
../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
cuda-sim.cc: In member function ‘void ptx_thread_info::ptx_exec_inst(warp_inst_t&, unsigned int)’:
cuda-sim.cc:1879:11: warning: format ‘%u’ expects argument of type ‘unsigned int’, but argument 12 has type ‘addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
 1879 |           "%u [thd=%u][i=%u] : ctaid=(%u,%u,%u) tid=(%u,%u,%u) icount=%u "
      |           ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 1880 |           "[pc=%u] (%s:%u - %s)  [0x%llx]\n",
      |           ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 1881 |           m_gpu->gpgpu_ctx->func_sim->g_ptx_sim_num_insn, get_uid(), pI->uid(),
 1882 |           ctaid.x, ctaid.y, ctaid.z, tid.x, tid.y, tid.z, get_icount(), pc,
      |                                                                         ~~
      |                                                                         |
      |                                                                         addr_t {aka long long unsigned int}
In file included from ./../../libcuda/../src/cuda-sim/cuda-sim.h:36,
                 from ./../../libcuda/gpgpu_context.h:3,
                 from ptx.y:33:
./../../libcuda/../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
./../../libcuda/../src/cuda-sim/../abstract_hardware_model.h:966:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                 ^~~~~~~~~~~~~~~~~~~~~~  ~~
      |                                         |
      |                                         address_type {aka long long unsigned int}
In file included from ptx_ir.h:32,
                 from ptx_ir.cc:32:
../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:966:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                 ^~~~~~~~~~~~~~~~~~~~~~  ~~
      |                                         |
      |                                         address_type {aka long long unsigned int}
In file included from ptx_sim.h:32,
                 from ptx_sim.cc:29:
../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:966:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                 ^~~~~~~~~~~~~~~~~~~~~~  ~~
      |                                         |
      |                                         address_type {aka long long unsigned int}
In file included from ptx_ir.h:32,
                 from ptx_ir.cc:32:
../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:1160:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                 ^~~~~~~~~~~~~~~~~~~~~~  ~~
      |                                         |
      |                                         address_type {aka long long unsigned int}
cuda-sim.cc: In member function ‘void cuda_sim::read_sim_environment_variables()’:
cuda-sim.cc:2379:20: warning: format ‘%d’ expects argument of type ‘int*’, but argument 3 has type ‘addr_t*’ {aka ‘long long unsigned int*’} [-Wformat=]
 2379 |     sscanf(dbg_pc, "%d", &g_debug_pc);
      |                    ^~~~  ~~~~~~~~~~~
      |                          |
      |                          addr_t* {aka long long unsigned int*}
In file included from ./../../libcuda/../src/cuda-sim/cuda-sim.h:36,
                 from ./../../libcuda/gpgpu_context.h:3,
                 from ptx.y:33:
./../../libcuda/../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
./../../libcuda/../src/cuda-sim/../abstract_hardware_model.h:1160:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                 ^~~~~~~~~~~~~~~~~~~~~~  ~~
      |                                         |
      |                                         address_type {aka long long unsigned int}
In file included from ptx_sim.h:32,
                 from ptx_sim.cc:29:
../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:1160:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                 ^~~~~~~~~~~~~~~~~~~~~~  ~~
      |                                         |
      |                                         address_type {aka long long unsigned int}
In file included from ./ptx_ir.h:32,
                 from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/instructions.h:2,
                 from instructions.cc:32:
./../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
./../abstract_hardware_model.h:966:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                 ^~~~~~~~~~~~~~~~~~~~~~  ~~
      |                                         |
      |                                         address_type {aka long long unsigned int}
In file included from ./../../libcuda/../src/cuda-sim/cuda-sim.h:36,
                 from ./../../libcuda/gpgpu_context.h:3,
                 from ptxinfo.l:42:
./../../libcuda/../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
./../../libcuda/../src/cuda-sim/../abstract_hardware_model.h:966:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                 ^~~~~~~~~~~~~~~~~~~~~~  ~~
      |                                         |
      |                                         address_type {aka long long unsigned int}
In file included from ./ptx_ir.h:32,
                 from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/instructions.h:2,
                 from instructions.cc:32:
./../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
./../abstract_hardware_model.h:1160:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                 ^~~~~~~~~~~~~~~~~~~~~~  ~~
      |                                         |
      |                                         address_type {aka long long unsigned int}
In file included from ../ISA_Def/ampere_opcode.h:9,
                 from trace_driven.cc:40:
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from ./../../libcuda/../src/cuda-sim/cuda-sim.h:36,
                 from ./../../libcuda/gpgpu_context.h:3,
                 from ptxinfo.l:42:
./../../libcuda/../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
./../../libcuda/../src/cuda-sim/../abstract_hardware_model.h:1160:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                 ^~~~~~~~~~~~~~~~~~~~~~  ~~
      |                                         |
      |                                         address_type {aka long long unsigned int}
In file included from ./ptx_parser.h:32,
                 from ptx.l:43:
./../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
./../abstract_hardware_model.h:966:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                 ^~~~~~~~~~~~~~~~~~~~~~  ~~
      |                                         |
      |                                         address_type {aka long long unsigned int}
In file included from ../ISA_Def/ampere_opcode.h:9,
                 from trace_driven.cc:40:
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from ./ptx_parser.h:32,
                 from ptx.l:43:
./../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
./../abstract_hardware_model.h:1160:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                 ^~~~~~~~~~~~~~~~~~~~~~  ~~
      |                                         |
      |                                         address_type {aka long long unsigned int}
In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:36,
                 from ../../libcuda/gpgpu_context.h:3,
                 from cuda_device_runtime.cc:12:
../../libcuda/../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../../libcuda/../src/cuda-sim/../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:36,
                 from ../../libcuda/gpgpu_context.h:3,
                 from cuda_device_runtime.cc:12:
../../libcuda/../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../../libcuda/../src/cuda-sim/../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from ./../../libcuda/../src/cuda-sim/cuda-sim.h:37,
                 from ./../../libcuda/gpgpu_context.h:3,
                 from ptx.y:33:
./../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
./../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
./../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
./../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:37,
                 from ../../libcuda/gpgpu_context.h:3,
                 from cuda_device_runtime.cc:12:
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
In file included from ./../../libcuda/../src/cuda-sim/cuda-sim.h:37,
                 from ./../../libcuda/gpgpu_context.h:3,
                 from ptxinfo.l:42:
./../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
./../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
./../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
./../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
In file included from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/cuda-sim/cuda-sim.h:37,
                 from trace_driven.cc:48:
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/cuda-sim/../gpgpu-sim/shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/cuda-sim/../gpgpu-sim/shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
 1298 |     if (inst.op != m_supported_op) {
      |         ~~~~~~~~^~~~~~~~~~~~~~~~~
In file included from ptx_parser.h:32,
                 from ptx_parser.cc:29:
../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:966:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                 ^~~~~~~~~~~~~~~~~~~~~~  ~~
      |                                         |
      |                                         address_type {aka long long unsigned int}
In file included from ptx_parser.h:32,
                 from ptx_parser.cc:29:
../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:1160:17: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                 ^~~~~~~~~~~~~~~~~~~~~~  ~~
      |                                         |
      |                                         address_type {aka long long unsigned int}
cuda-sim.cc: In member function ‘void function_info::ptx_jit_config(std::map<long long unsigned int, long unsigned int>, memory_space*, gpgpu_t*, dim3, dim3)’:
cuda-sim.cc:1536:8: warning: ignoring return value of ‘char* fgets(char*, int, FILE*)’, declared with attribute warn_unused_result [-Wunused-result]
 1536 |   fgets(buff, 1024, fp);
      |   ~~~~~^~~~~~~~~~~~~~~~
In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:37,
                 from ../../libcuda/gpgpu_context.h:3,
                 from ptx_ir.cc:44:
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:37,
                 from ../../libcuda/gpgpu_context.h:3,
                 from ptx_sim.cc:34:
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:36,
                 from ../../libcuda/gpgpu_context.h:3,
                 from ptx_loader.cc:34:
../../libcuda/../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../../libcuda/../src/cuda-sim/../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:36,
                 from ../../libcuda/gpgpu_context.h:3,
                 from ptx_loader.cc:34:
../../libcuda/../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../../libcuda/../src/cuda-sim/../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from ../gpgpu-sim/gpu-sim.h:44,
                 from instructions.cc:51:
../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
../gpgpu-sim/shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
../gpgpu-sim/shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
In file included from ./../../libcuda/../src/cuda-sim/cuda-sim.h:37,
                 from ./../../libcuda/gpgpu_context.h:3,
                 from ptx.l:46:
./../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
./../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
./../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
./../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
trace_driven.cc: In member function ‘bool trace_warp_inst_t::parse_from_trace_struct(const inst_trace_t&, const std::unordered_map<std::__cxx11::basic_string<char>, OpcodeChar>*, const trace_config*, const kernel_trace_t*)’:
trace_driven.cc:199:5: warning: this ‘if’ clause does not guard... [-Wmisleading-indentation]
  199 |     if(it2 != OpcPowerMap->end())
      |     ^~
trace_driven.cc:201:7: note: ...this statement, but the latter is misleadingly indented as if it were guarded by the ‘if’
  201 |       oprnd_type = get_oprnd_type(op, sp_op);
      |       ^~~~~~~~~~
In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:37,
                 from ../../libcuda/gpgpu_context.h:3,
                 from ptx_loader.cc:34:
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
ptx_sim.cc: In function ‘void print_reg(FILE*, std::string, ptx_reg_t, symbol_table*)’:
ptx_sim.cc:372:19: warning: format ‘%f’ expects argument of type ‘double’, but argument 3 has type ‘half_float::half’ [-Wformat=]
  372 |       fprintf(fp, ".f16 %f [0x%04x]\n", value.f16, (unsigned)value.u16);
      |                   ^~~~~~~~~~~~~~~~~~~~  ~~~~~~~~~
      |                                               |
      |                                               half_float::half
ptx_ir.cc: In member function ‘std::string ptx_instruction::to_string() const’:
ptx_ir.cc:1473:59: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
 1473 |         snprintf(buf + used_bytes, STR_SIZE - used_bytes, " PC=0x%03x ", m_PC);
      |                                                           ^~~~~~~~~~~~~  ~~~~
      |                                                                          |
      |                                                                          addr_t {aka long long unsigned int}
instructions.cc: In member function ‘void ptx_thread_info::print_reg_thread(char*)’:
instructions.cc:202:19: warning: format ‘%llu’ expects argument of type ‘long long unsigned int’, but argument 4 has type ‘const ptx_reg_t’ [-Wformat=]
  202 |       fprintf(fp, "%s %llu %s %d\n", name.c_str(), it->second, dec.c_str(),
      |                   ^~~~~~~~~~~~~~~~~                ~~~~~~~~~~
      |                                                        |
      |                                                        const ptx_reg_t
instructions.cc: In function ‘void mma_impl(const ptx_instruction*, core_t*, warp_inst_t)’:
instructions.cc:1951:27: warning: dereferencing type-punned pointer will break strict-aliasing rules [-Wstrict-aliasing]
 1951 |           nw_v[k].f16 = *((half *)&hex_val);
      |                          ~^~~~~~~~~~~~~~~~~
instructions.cc:1951:27: warning: dereferencing type-punned pointer will break strict-aliasing rules [-Wstrict-aliasing]
In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:37,
                 from ../../libcuda/gpgpu_context.h:3,
                 from ptx_parser.cc:30:
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
ptx_loader.cc: In member function ‘void gpgpu_context::print_ptx_file(const char*, unsigned int, const char*)’:
ptx_loader.cc:98:27: warning: format ‘%u’ expects argument of type ‘unsigned int’, but argument 4 has type ‘addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
   98 |       snprintf(pc, 64, "%4u", pI->get_PC());
      |                         ~~^   ~~~~~~~~~~~~
      |                           |             |
      |                           unsigned int  addr_t {aka long long unsigned int}
      |                         %4llu
ptx_parser.cc: In member function ‘void ptx_recognizer::end_function()’:
ptx_parser.cc:209:21: warning: format ‘%d’ expects argument of type ‘int’, but argument 3 has type ‘addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  209 |   PTX_PARSE_DPRINTF("function %s, PC = %d\n", g_func_info->get_name().c_str(),
      |                     ^~~~~~~~~~~~~~~~~~~~~~~~
  210 |                     g_func_info->get_start_PC());
      |                     ~~~~~~~~~~~~~~~~~~~~~~~~~~~
      |                                              |
      |                                              addr_t {aka long long unsigned int}
ptx_parser.cc:55:12: note: in definition of macro ‘PTX_PARSE_DPRINTF’
   55 |     printf(__VA_ARGS__);                                                  \
      |            ^~~~~~~~~~~
ptx_parser.cc: In member function ‘void ptx_recognizer::add_identifier(const char*, int, unsigned int)’:
ptx_parser.cc:488:11: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  488 |           "GPGPU-Sim PTX: allocating stack frame region for .param \"%s\" from "
      |           ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  489 |           "0x%x to 0x%lx\n",
      |           ~~~~~~~~~~~~~~~~~
  490 |           identifier, g_current_symbol_table->get_local_next(),
      |                       ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
      |                                                             |
      |                                                             addr_t {aka long long unsigned int}
ptx_parser.cc:488:11: warning: format ‘%lx’ expects argument of type ‘long unsigned int’, but argument 4 has type ‘long long unsigned int’ [-Wformat=]
ptx_parser.cc: In member function ‘void ptx_recognizer::add_constptr(const char*, const char*, int)’:
ptx_parser.cc:524:10: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘addr_t’ {aka ‘long long unsigned int’} [-Wformat=]
  524 |   printf("GPGPU-Sim PTX: moving \"%s\" from 0x%x to 0x%x (%s+%x)\n",
      |          ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  525 |          identifier1, s1->get_address(), addr + offset, identifier2, offset);
      |                       ~~~~~~~~~~~~~~~~~
      |                                      |
      |                                      addr_t {aka long long unsigned int}
make[1]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/trace-parser'
In file included from /usr/include/string.h:495,
                 from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/lex.ptxinfo_.c:243:
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:87:24:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ specified bound 1024 equals destination size [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:85:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:84:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:83:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:82:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:81:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:80:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:79:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:78:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:77:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:75:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:74:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:72:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:71:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:70:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:69:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:68:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:67:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:66:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:65:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:64:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:63:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:62:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:61:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:60:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:59:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:58:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:57:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:56:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptxinfo_lex(YYSTYPE*, yyscan_t, ptxinfo_data*)’ at ptxinfo.l:55:1:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ output truncated before terminating nul copying as many bytes from a string as its length [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
cuda_device_runtime.cc: In member function ‘void cuda_device_runtime::gpgpusim_cuda_getParameterBufferV2(const ptx_instruction*, ptx_thread_info*, const function_info*)’:
cuda_device_runtime.cc:93:48: warning: ‘child_kernel_entry’ may be used uninitialized in this function [-Wmaybe-uninitialized]
   93 |   g_cuda_device_launch_param_map[param_buffer] = device_launch_config;
cuda_device_runtime.cc: In member function ‘void cuda_device_runtime::gpgpusim_cuda_launchDeviceV2(const ptx_instruction*, ptx_thread_info*, const function_info*)’:
cuda_device_runtime.cc:126:29: warning: ‘device_launch_op’ may be used uninitialized in this function [-Wmaybe-uninitialized]
  126 |   device_launch_operation_t device_launch_op;
      |                             ^~~~~~~~~~~~~~~~
In file included from /usr/include/string.h:495,
                 from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/lex.ptx_.c:243:
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘int ptx_lex(YYSTYPE*, yyscan_t, ptx_recognizer*)’ at ptx.l:436:27:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: warning: ‘char* __builtin_strncpy(char*, const char*, long unsigned int)’ specified bound 4096 equals destination size [-Wstringop-truncation]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
ptx_loader.cc: In member function ‘void gpgpu_context::gpgpu_ptx_info_load_from_filename(const char*, unsigned int)’:
ptx_loader.cc:357:37: warning: ‘%s’ directive output may be truncated writing up to 1023 bytes into a region of size 995 [-Wformat-truncation=]
  357 |       "$CUDA_INSTALL_PATH/bin/ptxas %s -v %s --output-file  /dev/null 2> %s",
      |                                     ^~
  358 |       extra_flags, filename, ptxas_filename.c_str());
      |       ~~~~~~~~~~~
In file included from /usr/include/stdio.h:867,
                 from /usr/include/c++/9/cstdio:42,
                 from /usr/include/c++/9/ext/string_conversions.h:43,
                 from /usr/include/c++/9/bits/basic_string.h:6496,
                 from /usr/include/c++/9/string:55,
                 from ptx_loader.h:31,
                 from ptx_loader.cc:29:
/usr/include/x86_64-linux-gnu/bits/stdio2.h:67:35: note: ‘__builtin___snprintf_chk’ output 63 or more bytes (assuming 1086) into a destination of size 1024
   67 |   return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
      |          ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   68 |        __bos (__s), __fmt, __va_arg_pack ());
      |        ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
ptx_loader.cc: In function ‘char* get_app_binary_name()’:
ptx_loader.cc:341:25: warning: ‘self_exe_path’ may be used uninitialized in this function [-Wmaybe-uninitialized]
  341 |   self_exe_path = strtok(self_exe_path, ".");
      |                   ~~~~~~^~~~~~~~~~~~~~~~~~~~
ptx_loader.cc: In member function ‘char* ptxinfo_data::gpgpu_ptx_sim_convert_ptx_and_sass_to_ptxplus(std::string, std::string, std::string)’:
ptx_loader.cc:125:43: warning: ‘%s’ directive output may be truncated writing up to 1023 bytes into a region of size 941 [-Wformat-truncation=]
  125 |            "cuobjdump_to_ptxplus %s %s %s %s",
      |                                           ^~
  126 |            ptxfilename.c_str(), sassfilename.c_str(), elffilename.c_str(),
  127 |            fname_ptxplus);
      |            ~~~~~~~~~~~~~
In file included from /usr/include/stdio.h:867,
                 from /usr/include/c++/9/cstdio:42,
                 from /usr/include/c++/9/ext/string_conversions.h:43,
                 from /usr/include/c++/9/bits/basic_string.h:6496,
                 from /usr/include/c++/9/string:55,
                 from ptx_loader.h:31,
                 from ptx_loader.cc:29:
/usr/include/x86_64-linux-gnu/bits/stdio2.h:67:35: note: ‘__builtin___snprintf_chk’ output 84 or more bytes (assuming 1107) into a destination of size 1024
   67 |   return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
      |          ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   68 |        __bos (__s), __fmt, __va_arg_pack ());
      |        ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
ptx_loader.cc:152:43: warning: ‘%s’ directive output may be truncated writing up to 1023 bytes into a region of size 1018 [-Wformat-truncation=]
  152 |     snprintf(rm_commandline, 1024, "rm -f %s", fname_ptxplus);
      |                                           ^~   ~~~~~~~~~~~~~
In file included from /usr/include/stdio.h:867,
                 from /usr/include/c++/9/cstdio:42,
                 from /usr/include/c++/9/ext/string_conversions.h:43,
                 from /usr/include/c++/9/bits/basic_string.h:6496,
                 from /usr/include/c++/9/string:55,
                 from ptx_loader.h:31,
                 from ptx_loader.cc:29:
/usr/include/x86_64-linux-gnu/bits/stdio2.h:67:35: note: ‘__builtin___snprintf_chk’ output between 7 and 1030 bytes into a destination of size 1024
   67 |   return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
      |          ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   68 |        __bos (__s), __fmt, __va_arg_pack ());
      |        ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
ptx_loader.cc: In function ‘void fix_duplicate_errors(char*)’:
ptx_loader.cc:217:38: warning: ‘%s’ directive output may be truncated writing up to 1023 bytes into a region of size 1020 [-Wformat-truncation=]
  217 |   snprintf(commandline, 1024, "mv %s %s", fname2, tempfile);
      |                                      ^~           ~~~~~~~~
In file included from /usr/include/stdio.h:867,
                 from /usr/include/c++/9/cstdio:42,
                 from /usr/include/c++/9/ext/string_conversions.h:43,
                 from /usr/include/c++/9/bits/basic_string.h:6496,
                 from /usr/include/c++/9/string:55,
                 from ptx_loader.h:31,
                 from ptx_loader.cc:29:
/usr/include/x86_64-linux-gnu/bits/stdio2.h:67:35: note: ‘__builtin___snprintf_chk’ output 5 or more bytes (assuming 1028) into a destination of size 1024
   67 |   return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
      |          ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   68 |        __bos (__s), __fmt, __va_arg_pack ());
      |        ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
ptx_loader.cc:311:38: warning: ‘%s’ directive output may be truncated writing up to 1023 bytes into a region of size 1018 [-Wformat-truncation=]
  311 |   snprintf(commandline, 1024, "rm -f %s", tempfile);
      |                                      ^~   ~~~~~~~~
In file included from /usr/include/stdio.h:867,
                 from /usr/include/c++/9/cstdio:42,
                 from /usr/include/c++/9/ext/string_conversions.h:43,
                 from /usr/include/c++/9/bits/basic_string.h:6496,
                 from /usr/include/c++/9/string:55,
                 from ptx_loader.h:31,
                 from ptx_loader.cc:29:
/usr/include/x86_64-linux-gnu/bits/stdio2.h:67:35: note: ‘__builtin___snprintf_chk’ output between 7 and 1030 bytes into a destination of size 1024
   67 |   return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
      |          ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   68 |        __bos (__s), __fmt, __va_arg_pack ());
      |        ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
ptx_loader.cc:269:55: warning: ‘funcptr’ may be used uninitialized in this function [-Wmaybe-uninitialized]
  269 |       fwrite(startptr, sizeof(char), funcptr - offset + 1 - startptr, ptxdest);
      |                                      ~~~~~~~~~~~~~~~~~^~~
ptx_loader.cc: In member function ‘void gpgpu_context::gpgpu_ptxinfo_load_from_string(const char*, unsigned int, unsigned int, int)’:
ptx_loader.cc:421:41: warning: ‘info’ directive output may be truncated writing 4 bytes into a region of size between 1 and 1024 [-Wformat-truncation=]
  421 |     snprintf(tempfile_ptxinfo, 1024, "%sinfo", fname);
      |                                         ^~~~
In file included from /usr/include/stdio.h:867,
                 from /usr/include/c++/9/cstdio:42,
                 from /usr/include/c++/9/ext/string_conversions.h:43,
                 from /usr/include/c++/9/bits/basic_string.h:6496,
                 from /usr/include/c++/9/string:55,
                 from ptx_loader.h:31,
                 from ptx_loader.cc:29:
/usr/include/x86_64-linux-gnu/bits/stdio2.h:67:35: note: ‘__builtin___snprintf_chk’ output between 5 and 1028 bytes into a destination of size 1024
   67 |   return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
      |          ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   68 |        __bos (__s), __fmt, __va_arg_pack ());
      |        ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
ptx_loader.cc:444:50: warning: ‘%s’ directive output may be truncated writing up to 1023 bytes into a region of size 989 [-Wformat-truncation=]
  444 |              "$PTXAS_CUDA_INSTALL_PATH/bin/ptxas %s -v %s --output-file  "
      |                                                  ^~
  445 |              "/dev/null 2> %s",
  446 |              extra_flags, fname2, tempfile_ptxinfo);
      |              ~~~~~~~~~~~
In file included from /usr/include/stdio.h:867,
                 from /usr/include/c++/9/cstdio:42,
                 from /usr/include/c++/9/ext/string_conversions.h:43,
                 from /usr/include/c++/9/bits/basic_string.h:6496,
                 from /usr/include/c++/9/string:55,
                 from ptx_loader.h:31,
                 from ptx_loader.cc:29:
/usr/include/x86_64-linux-gnu/bits/stdio2.h:67:35: note: ‘__builtin___snprintf_chk’ output between 69 and 3138 bytes into a destination of size 1024
   67 |   return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
      |          ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   68 |        __bos (__s), __fmt, __va_arg_pack ());
      |        ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
ptx_loader.cc:463:48: warning: ‘%s’ directive output may be truncated writing up to 1023 bytes into a region of size 995 [-Wformat-truncation=]
  463 |                  "$CUDA_INSTALL_PATH/bin/ptxas %s -v %s --output-file  "
      |                                                ^~
  464 |                  "/dev/null 2> %s",
  465 |                  extra_flags, fname2, tempfile_ptxinfo);
      |                  ~~~~~~~~~~~
In file included from /usr/include/stdio.h:867,
                 from /usr/include/c++/9/cstdio:42,
                 from /usr/include/c++/9/ext/string_conversions.h:43,
                 from /usr/include/c++/9/bits/basic_string.h:6496,
                 from /usr/include/c++/9/string:55,
                 from ptx_loader.h:31,
                 from ptx_loader.cc:29:
/usr/include/x86_64-linux-gnu/bits/stdio2.h:67:35: note: ‘__builtin___snprintf_chk’ output between 63 and 3132 bytes into a destination of size 1024
   67 |   return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
      |          ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   68 |        __bos (__s), __fmt, __va_arg_pack ());
      |        ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
ptx_loader.cc:512:41: warning: ‘info’ directive output may be truncated writing 4 bytes into a region of size between 1 and 1024 [-Wformat-truncation=]
  512 |     snprintf(tempfile_ptxinfo, 1024, "%sinfo", fname);
      |                                         ^~~~
In file included from /usr/include/stdio.h:867,
                 from /usr/include/c++/9/cstdio:42,
                 from /usr/include/c++/9/ext/string_conversions.h:43,
                 from /usr/include/c++/9/bits/basic_string.h:6496,
                 from /usr/include/c++/9/string:55,
                 from ptx_loader.h:31,
                 from ptx_loader.cc:29:
/usr/include/x86_64-linux-gnu/bits/stdio2.h:67:35: note: ‘__builtin___snprintf_chk’ output between 5 and 1028 bytes into a destination of size 1024
   67 |   return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
      |          ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   68 |        __bos (__s), __fmt, __va_arg_pack ());
      |        ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
ptx_loader.cc:527:39: warning: ‘%s’ directive output may be truncated writing up to 1023 bytes into a region of size 995 [-Wformat-truncation=]
  527 |         "$CUDA_INSTALL_PATH/bin/ptxas %s -v %s --output-file  /dev/null 2> %s",
      |                                       ^~
  528 |         extra_flags, fname2, tempfile_ptxinfo);
      |         ~~~~~~~~~~~
In file included from /usr/include/stdio.h:867,
                 from /usr/include/c++/9/cstdio:42,
                 from /usr/include/c++/9/ext/string_conversions.h:43,
                 from /usr/include/c++/9/bits/basic_string.h:6496,
                 from /usr/include/c++/9/string:55,
                 from ptx_loader.h:31,
                 from ptx_loader.cc:29:
/usr/include/x86_64-linux-gnu/bits/stdio2.h:67:35: note: ‘__builtin___snprintf_chk’ output between 63 and 3132 bytes into a destination of size 1024
   67 |   return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
      |          ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   68 |        __bos (__s), __fmt, __va_arg_pack ());
      |        ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
ptx_loader.cc:575:42: warning: ‘%s’ directive output may be truncated writing up to 1023 bytes into a region of size 1018 [-Wformat-truncation=]
  575 |       snprintf(commandline, 1024, "rm -f %s %s %s", fname, fname2,
      |                                          ^~         ~~~~~
In file included from /usr/include/stdio.h:867,
                 from /usr/include/c++/9/cstdio:42,
                 from /usr/include/c++/9/ext/string_conversions.h:43,
                 from /usr/include/c++/9/bits/basic_string.h:6496,
                 from /usr/include/c++/9/string:55,
                 from ptx_loader.h:31,
                 from ptx_loader.cc:29:
/usr/include/x86_64-linux-gnu/bits/stdio2.h:67:35: note: ‘__builtin___snprintf_chk’ output between 9 and 3078 bytes into a destination of size 1024
   67 |   return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
      |          ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   68 |        __bos (__s), __fmt, __va_arg_pack ());
      |        ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
ptx_loader.cc:572:42: warning: ‘%s’ directive output may be truncated writing up to 1023 bytes into a region of size 1018 [-Wformat-truncation=]
  572 |       snprintf(commandline, 1024, "rm -f %s %s %s", fname, fname2,
      |                                          ^~         ~~~~~
In file included from /usr/include/stdio.h:867,
                 from /usr/include/c++/9/cstdio:42,
                 from /usr/include/c++/9/ext/string_conversions.h:43,
                 from /usr/include/c++/9/bits/basic_string.h:6496,
                 from /usr/include/c++/9/string:55,
                 from ptx_loader.h:31,
                 from ptx_loader.cc:29:
/usr/include/x86_64-linux-gnu/bits/stdio2.h:67:35: note: ‘__builtin___snprintf_chk’ output between 9 and 3078 bytes into a destination of size 1024
   67 |   return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
      |          ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   68 |        __bos (__s), __fmt, __va_arg_pack ());
      |        ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
make[1]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/trace-driven'
ar rcs /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/libgpgpu_ptx_sim.a /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx.tab.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/lex.ptx_.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptxinfo.tab.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/lex.ptxinfo_.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx_parser.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx_loader.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/cuda_device_printf.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/instructions.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/cuda-sim.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx_ir.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx_sim.o  /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/memory.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx-stats.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/decuda_pred_table/decuda_pred_table.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptx.tab.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/lex.ptx_.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/ptxinfo.tab.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/lex.ptxinfo_.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/cuda_device_runtime.o
make[2]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/cuda-sim'
make -C ./src/gpgpu-sim/ depend
make -C ./libcuda/ depend
make[2]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim'
make -C ./cuobjdump_to_ptxplus/ depend
make[2]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda'
make[2]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/cuobjdump_to_ptxplus'
touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/Makefile.makedepend
makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/Makefile.makedepend cuobjdumpInst.cc cuobjdumpInstList.cc cuobjdump_to_ptxplus.cc 2> /dev/null
make[2]: 'depend' is up to date.
make[2]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/cuobjdump_to_ptxplus'
make -C ./cuobjdump_to_ptxplus/
make[2]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/cuobjdump_to_ptxplus'
g++ -ggdb -fPIC -Wall -Wno-unused-function -Wno-sign-compare  -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/../cuda-sim/ -I . -I ../src/cuda-sim/ -c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/cuobjdumpInst.o cuobjdumpInst.cc
g++ -ggdb -fPIC -Wall -Wno-unused-function -Wno-sign-compare  -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/../cuda-sim/ -I . -I ../src/cuda-sim/ -c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/cuobjdumpInstList.o cuobjdumpInstList.cc
g++ -ggdb -fPIC -Wall -Wno-unused-function -Wno-sign-compare  -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/../cuda-sim/ -I . -I ../src/cuda-sim/ -c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/cuobjdump_to_ptxplus.o cuobjdump_to_ptxplus.cc
bison -t -d --report=all --verbose --name-prefix=ptx_ -v ptx.y  --file-prefix=/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/ptx
flex -B  -o/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/lex.ptx_.c ptx.l
bison -t -d --report=all --verbose -p sass_ -o/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/sass_parser.cc sass.y --file-prefix=/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus//home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/sass_parser.cc
bison -t -d --report=all --verbose -p elf_ -o/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/elf_parser.cc elf.y --file-prefix=/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus//home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/elf_parser.cc
bison -t -d --report=all --verbose -p header_ -o/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/header_parser.cc header.y --file-prefix=/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus//home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/header_parser.cc
ptx.l:162: undeclared start condition IN_INST
ptx.y: warning: 1 nonterminal useless in grammar [-Wother]
ptx.y: warning: 2 rules useless in grammar [-Wother]
ptx.y:544.1-7: warning: nonterminal useless in grammar: vp_spec [-Wother]
  544 | vp_spec: WMMA_DIRECTIVE LAYOUT CONFIGURATION{add_space_spec(global_space,0);add_ptr_spec(global_space);add_wmma_option($1);add_wmma_option($2);add_wmma_optiptx.l:215: warning, rule cannot be matched
on($3);}
      | ^~~~~~~
ptx.y: warning: 57 reduce/reduce conflicts [-Wconflicts-rr]
ptx.y:266.11-88: warning: rule useless in parser due to conflicts [-Wother]
  266 |         | WEAK_DIRECTIVE FUNC_DIRECTIVE { $$ = 0; g_func_decl=1; func_header(".func"); }
      |           ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
:
flex -B  -P elf_ -o/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/elf_lexer.cc elf.l
g++ -ggdb -fPIC -Wall -Wno-unused-function -Wno-sign-compare  -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/../cuda-sim/ -I . -I ../src/cuda-sim/ -c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/elf_parser.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/elf_parser.cc
g++ -ggdb -fPIC -Wall -Wno-unused-function -Wno-sign-compare  -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/../cuda-sim/ -I . -I ../src/cuda-sim/ -c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/elf_lexer.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/elf_lexer.cc
g++ -ggdb -fPIC -Wall -Wno-unused-function -Wno-sign-compare  -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/../cuda-sim/ -I . -I ../src/cuda-sim/ -c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/header_parser.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/header_parser.cc
:
flex -B  -P header_ -o/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/header_lexer.cc header.l
g++ -ggdb -fPIC -Wall -Wno-unused-function -Wno-sign-compare  -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/../cuda-sim/ -I . -I ../src/cuda-sim/ -c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/header_lexer.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/header_lexer.cc
touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/Makefile.makedepend
makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/Makefile.makedepend -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/ addrdec.cc dram.cc dram_sched.cc gpu-cache.cc gpu-misc.cc gpu-sim.cc hashing.cc histogram.cc icnt_wrapper.cc l2cache.cc local_interconnect.cc mem_fetch.cc mem_latency_stat.cc power_interface.cc power_stat.cc scoreboard.cc shader.cc stack.cc stat-tool.cc traffic_breakdown.cc visualizer.cc 2> /dev/null
touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/Makefile.makedepend
makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/Makefile.makedepend -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/ cuda_runtime_api.cc 2> /dev/null
make[2]: 'depend' is up to date.
make[2]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda'
make[2]: 'depend' is up to date.
make[2]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim'
make -C ./libcuda/
make -C ./src/gpgpu-sim/
make[2]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda'
make[2]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim'
touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/Makefile.makedepend
makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/Makefile.makedepend -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/ cuda_runtime_api.cc 2> /dev/null
touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/Makefile.makedepend
makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/Makefile.makedepend -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/ addrdec.cc dram.cc dram_sched.cc gpu-cache.cc gpu-misc.cc gpu-sim.cc hashing.cc histogram.cc icnt_wrapper.cc l2cache.cc local_interconnect.cc mem_fetch.cc mem_latency_stat.cc power_interface.cc power_stat.cc scoreboard.cc shader.cc stack.cc stat-tool.cc traffic_breakdown.cc visualizer.cc 2> /dev/null
g++  -std=c++0x -O3 -g -Wall -Wno-unused-function -Wno-sign-compare -fPIC  -DCUDART_VERSION=11060 -I./ -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda -I/usr/local/cuda-11.6/include  -c cuda_runtime_api.cc -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuda_runtime_api.o
bison -t -d -v --report=all -p cuobjdump_  -o/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuobjdump_parser.c cuobjdump.y --file-prefix=/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuobjdump
g++  -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/addrdec.o -c addrdec.cc
g++  -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/dram.o -c dram.cc
g++  -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/dram_sched.o -c dram_sched.cc
g++  -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/gpu-cache.o -c gpu-cache.cc
g++  -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/gpu-misc.o -c gpu-misc.cc
g++  -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/gpu-sim.o -c gpu-sim.cc
g++  -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/hashing.o -c hashing.cc
g++  -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/histogram.o -c histogram.cc
g++  -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/icnt_wrapper.o -c icnt_wrapper.cc
g++  -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/l2cache.o -c l2cache.cc
:
flex -B  -P cuobjdump_ -o/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuobjdump_lexer.c cuobjdump.l
g++  -Wall -Wno-unused-function -Wno-sign-compare -fPIC -I./ -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda -I/usr/local/cuda-11.6/include  -c /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuobjdump_parser.c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuobjdump_parser.o
:
flex -B  -P sass_ -o/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/sass_lexer.cc sass.l
g++ -ggdb -fPIC -Wall -Wno-unused-function -Wno-sign-compare  -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/../cuda-sim/ -I . -I ../src/cuda-sim/ -c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/sass_parser.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/sass_parser.cc
g++  -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/local_interconnect.o -c local_interconnect.cc
g++  -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/mem_fetch.o -c mem_fetch.cc
g++  -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/mem_latency_stat.o -c mem_latency_stat.cc
g++ -ggdb -fPIC -Wall -Wno-unused-function -Wno-sign-compare  -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/../cuda-sim/ -I . -I ../src/cuda-sim/ -c /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/ptx.tab.c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/ptx.tab.o
g++ -ggdb -fPIC -Wall -Wno-unused-function -Wno-sign-compare  -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/../cuda-sim/ -I . -I ../src/cuda-sim/ -c /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/lex.ptx_.c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/lex.ptx_.o
g++  -Wall -Wno-unused-function -Wno-sign-compare -fPIC -I./ -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda -I/usr/local/cuda-11.6/include  -c /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuobjdump_lexer.c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuobjdump_lexer.o
g++  -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/power_interface.o -c power_interface.cc
g++ -ggdb -fPIC -Wall -Wno-unused-function -Wno-sign-compare  -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus -I /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/../cuda-sim/ -I . -I ../src/cuda-sim/ -c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/sass_lexer.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/sass_lexer.cc
g++  -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/power_stat.o -c power_stat.cc
g++  -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/scoreboard.o -c scoreboard.cc
g++  -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/shader.o -c shader.cc
In file included from cuda_runtime_api.cc:127:
/usr/local/cuda-11.6/include/host_defines.h:54:2: warning: #warning "host_defines.h is an internal header file and must not be used directly.  This file will be removed in a future CUDA release.  Please use cuda_runtime_api.h or cuda_runtime.h instead." [-Wcpp]
   54 | #warning "host_defines.h is an internal header file and must not be used directly.  This file will be removed in a future CUDA release.  Please use cuda_runtime_api.h or cuda_runtime.h instead."
      |  ^~~~~~~
g++  -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/stack.o -c stack.cc
g++  -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/stat-tool.o -c stat-tool.cc
g++  -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/traffic_breakdown.o -c traffic_breakdown.cc
g++  -O3 -g3 -fPIC -DCUDART_VERSION=11060 -Wall -DTRACING_ON=1 -std=c++0x -I/usr/local/cuda-11.6/include -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/ -DGPGPUSIM_POWER_MODEL -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/visualizer.o -c visualizer.cc
cuobjdumpInstList.cc: In member function ‘std::string cuobjdumpInstList::parseCuobjdumpRegister(std::string, bool, int)’:
cuobjdumpInstList.cc:508:21: warning: format not a string literal and no format arguments [-Wformat-security]
  508 |   printf(reg.c_str());
      |                     ^
gpu-sim.cc:83: warning: "MAX" redefined
   83 | #define MAX(a, b) (((a) > (b)) ? (a) : (b))
      |
In file included from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/cacti/cacti_interface.h:42,
                 from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/cacti/area.h:37,
                 from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/cacti/parameter.h:37,
                 from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/basic_components.h:37,
                 from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/array.h:37,
                 from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/processor.h:44,
                 from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/gpgpu_sim_wrapper.h:41,
                 from power_interface.h:38,
                 from gpu-sim.cc:72:
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch/cacti/const.h:105: note: this is the location of the previous definition
  105 | #define MAX(a,b) (((a)>(b))?(a):(b))
      |
In file included from ../abstract_hardware_model.h:217,
                 from gpu-cache.h:36,
                 from gpu-cache.cc:32:
../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
   67 |   bool done() const { return m_updates == m_issued; }
      |                              ~~~~~~~~~~^~~~~~~~~~~
cuobjdump_to_ptxplus.cc: In function ‘void output(const char*)’:
cuobjdump_to_ptxplus.cc:57:27: warning: format not a string literal and no format arguments [-Wformat-security]
   57 |  fprintf(ptxplus_out, text);
      |                           ^
In file included from ../abstract_hardware_model.h:217,
                 from l2cache.cc:38:
../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
   67 |   bool done() const { return m_updates == m_issued; }
      |                              ~~~~~~~~~~^~~~~~~~~~~
In file included from gpu-cache.h:36,
                 from gpu-cache.cc:32:
../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from gpu-cache.h:36,
                 from gpu-cache.cc:32:
../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from l2cache.cc:38:
../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from ../abstract_hardware_model.h:217,
                 from gpu-sim.h:39,
                 from dram_sched.h:36,
                 from dram_sched.cc:29:
../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
   67 |   bool done() const { return m_updates == m_issued; }
      |                              ~~~~~~~~~~^~~~~~~~~~~
In file included from l2cache.cc:38:
../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from ../abstract_hardware_model.h:217,
                 from mem_latency_stat.cc:31:
../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
   67 |   bool done() const { return m_updates == m_issued; }
      |                              ~~~~~~~~~~^~~~~~~~~~~
In file included from gpu-sim.h:39,
                 from dram_sched.h:36,
                 from dram_sched.cc:29:
../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from gpu-sim.h:39,
                 from dram_sched.h:36,
                 from dram_sched.cc:29:
../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from mem_latency_stat.cc:31:
../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from mem_latency_stat.cc:31:
../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from ../abstract_hardware_model.h:217,
                 from hashing.cc:6:
../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
   67 |   bool done() const { return m_updates == m_issued; }
      |                              ~~~~~~~~~~^~~~~~~~~~~
In file included from ../abstract_hardware_model.h:217,
                 from addrdec.h:37,
                 from addrdec.cc:29:
../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
   67 |   bool done() const { return m_updates == m_issued; }
      |                              ~~~~~~~~~~^~~~~~~~~~~
In file included from gpu-sim.h:44,
                 from dram_sched.h:36,
                 from dram_sched.cc:29:
shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
In file included from ../abstract_hardware_model.h:217,
                 from mem_fetch.h:33,
                 from mem_fetch.cc:29:
../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
   67 |   bool done() const { return m_updates == m_issued; }
      |                              ~~~~~~~~~~^~~~~~~~~~~
In file included from gpu-sim.h:44,
                 from gpu-cache.cc:34:
shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
 1298 |     if (inst.op != m_supported_op) {
      |         ~~~~~~~~^~~~~~~~~~~~~~~~~
shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
 1298 |     if (inst.op != m_supported_op) {
      |         ~~~~~~~~^~~~~~~~~~~~~~~~~
In file included from addrdec.h:37,
                 from addrdec.cc:29:
../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from gpu-sim.h:44,
                 from l2cache.cc:43:
shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
In file included from addrdec.h:37,
                 from addrdec.cc:29:
../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from mem_fetch.h:33,
                 from mem_fetch.cc:29:
../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from mem_fetch.h:33,
                 from mem_fetch.cc:29:
../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from ../abstract_hardware_model.h:217,
                 from stack.h:32,
                 from stack.cc:29:
../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
   67 |   bool done() const { return m_updates == m_issued; }
      |                              ~~~~~~~~~~^~~~~~~~~~~
In file included from ../abstract_hardware_model.h:217,
                 from mem_fetch.h:33,
                 from traffic_breakdown.cc:2:
../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
   67 |   bool done() const { return m_updates == m_issued; }
      |                              ~~~~~~~~~~^~~~~~~~~~~
shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
 1298 |     if (inst.op != m_supported_op) {
      |         ~~~~~~~~^~~~~~~~~~~~~~~~~
In file included from hashing.cc:6:
../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from hashing.cc:6:
../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from ../abstract_hardware_model.h:217,
                 from shader.h:50,
                 from shader.cc:32:
../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
   67 |   bool done() const { return m_updates == m_issued; }
      |                              ~~~~~~~~~~^~~~~~~~~~~
In file included from ../abstract_hardware_model.h:217,
                 from gpu-sim.h:39,
                 from gpu-sim.cc:32:
../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
   67 |   bool done() const { return m_updates == m_issued; }
      |                              ~~~~~~~~~~^~~~~~~~~~~
In file included from ../abstract_hardware_model.h:217,
                 from stat-tool.h:32,
                 from stat-tool.cc:29:
../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
   67 |   bool done() const { return m_updates == m_issued; }
      |                              ~~~~~~~~~~^~~~~~~~~~~
In file included from ../abstract_hardware_model.h:217,
                 from scoreboard.h:38,
                 from scoreboard.cc:29:
../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
   67 |   bool done() const { return m_updates == m_issued; }
      |                              ~~~~~~~~~~^~~~~~~~~~~
In file included from stack.h:32,
                 from stack.cc:29:
../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from mem_fetch.h:33,
                 from traffic_breakdown.cc:2:
../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from mem_fetch.h:33,
                 from traffic_breakdown.cc:2:
../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from stack.h:32,
                 from stack.cc:29:
../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from ../abstract_hardware_model.h:217,
                 from gpu-sim.h:39,
                 from dram_sched.h:36,
                 from dram.cc:33:
../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
   67 |   bool done() const { return m_updates == m_issued; }
      |                              ~~~~~~~~~~^~~~~~~~~~~
In file included from gpu-sim.h:44,
                 from mem_latency_stat.cc:36:
shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
In file included from ../src/cuda-sim/cuda-sim.h:36,
                 from gpgpu_context.h:3,
                 from cuda_runtime_api.cc:136:
../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../src/cuda-sim/../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from ../src/cuda-sim/cuda-sim.h:36,
                 from gpgpu_context.h:3,
                 from cuda_runtime_api.cc:136:
../src/cuda-sim/../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../src/cuda-sim/../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from ../abstract_hardware_model.h:217,
                 from mem_fetch.h:33,
                 from local_interconnect.cc:38:
../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
   67 |   bool done() const { return m_updates == m_issued; }
      |                              ~~~~~~~~~~^~~~~~~~~~~
In file included from ../abstract_hardware_model.h:217,
                 from gpu-sim.h:39,
                 from power_interface.h:34,
                 from power_interface.cc:32:
../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
   67 |   bool done() const { return m_updates == m_issued; }
      |                              ~~~~~~~~~~^~~~~~~~~~~
shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
 1298 |     if (inst.op != m_supported_op) {
      |         ~~~~~~~~^~~~~~~~~~~~~~~~~
In file included from ../abstract_hardware_model.h:217,
                 from gpu-sim.h:39,
                 from power_stat.h:36,
                 from power_stat.cc:31:
../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
   67 |   bool done() const { return m_updates == m_issued; }
      |                              ~~~~~~~~~~^~~~~~~~~~~
In file included from gpu-sim.h:39,
                 from dram_sched.h:36,
                 from dram.cc:33:
../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from scoreboard.h:38,
                 from scoreboard.cc:29:
../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from stat-tool.h:32,
                 from stat-tool.cc:29:
../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from gpu-sim.h:39,
                 from dram_sched.h:36,
                 from dram.cc:33:
../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from shader.h:50,
                 from shader.cc:32:
../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from gpu-sim.h:39,
                 from gpu-sim.cc:32:
../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from scoreboard.h:38,
                 from scoreboard.cc:29:
../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from stat-tool.h:32,
                 from stat-tool.cc:29:
../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from shader.h:50,
                 from shader.cc:32:
../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from gpu-sim.h:39,
                 from gpu-sim.cc:32:
../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from ../abstract_hardware_model.h:217,
                 from gpu-sim.h:39,
                 from visualizer.cc:32:
../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
   67 |   bool done() const { return m_updates == m_issued; }
      |                              ~~~~~~~~~~^~~~~~~~~~~
In file included from gpu-sim.h:44,
                 from mem_fetch.cc:30:
shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
In file included from gpu-sim.h:44,
                 from addrdec.cc:33:
shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
 1298 |     if (inst.op != m_supported_op) {
      |         ~~~~~~~~^~~~~~~~~~~~~~~~~
In file included from ../src/cuda-sim/cuda-sim.h:37,
                 from gpgpu_context.h:3,
                 from cuda_runtime_api.cc:136:
../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
In file included from mem_fetch.h:33,
                 from local_interconnect.cc:38:
../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from gpu-sim.h:39,
                 from power_interface.h:34,
                 from power_interface.cc:32:
../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
 1298 |     if (inst.op != m_supported_op) {
      |         ~~~~~~~~^~~~~~~~~~~~~~~~~
In file included from mem_fetch.h:33,
                 from local_interconnect.cc:38:
../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from gpu-sim.h:39,
                 from power_interface.h:34,
                 from power_interface.cc:32:
../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
local_interconnect.cc: In member function ‘void xbar_router::RR_Advance()’:
local_interconnect.cc:152:16: warning: operation on ‘((xbar_router*)this)->xbar_router::next_node_id’ may be undefined [-Wsequence-point]
  152 |   next_node_id = (++next_node_id % total_nodes);
      |   ~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
local_interconnect.cc:162:25: warning: format ‘%d’ expects argument of type ‘int’, but argument 3 has type ‘long long unsigned int’ [-Wformat=]
  162 |     printf("%d : cycle %d : conflicts = %d\n", m_id, cycles, conflict_sub);
      |                        ~^                            ~~~~~~
      |                         |                            |
      |                         int                          long long unsigned int
      |                        %lld
local_interconnect.cc:163:25: warning: format ‘%d’ expects argument of type ‘int’, but argument 3 has type ‘long long unsigned int’ [-Wformat=]
  163 |     printf("%d : cycle %d : passing reqs = %d\n", m_id, cycles, reqs);
      |                        ~^                               ~~~~~~
      |                         |                               |
      |                         int                             long long unsigned int
      |                        %lld
local_interconnect.cc: In member function ‘void xbar_router::iSLIP_Advance()’:
local_interconnect.cc:220:35: warning: format ‘%d’ expects argument of type ‘int’, but argument 3 has type ‘long long unsigned int’ [-Wformat=]
  220 |               printf("%d : cycle %d : send req from %d to %d\n", m_id, cycles,
      |                                  ~^                                    ~~~~~~
      |                                   |                                    |
      |                                   int                                  long long unsigned int
      |                                  %lld
local_interconnect.cc:231:41: warning: format ‘%d’ expects argument of type ‘int’, but argument 3 has type ‘long long unsigned int’ [-Wformat=]
  231 |                     printf("%d : cycle %d : cannot send req from %d to %d\n",
      |                                        ~^
      |                                         |
      |                                         int
      |                                        %lld
  232 |                            m_id, cycles, node_id2, i - _n_shader);
      |                                  ~~~~~~
      |                                  |
      |                                  long long unsigned int
local_interconnect.cc:251:25: warning: format ‘%d’ expects argument of type ‘int’, but argument 3 has type ‘long long unsigned int’ [-Wformat=]
  251 |     printf("%d : cycle %d : grant_cycles = %d\n", m_id, cycles, grant_cycles);
      |                        ~^                               ~~~~~~
      |                         |                               |
      |                         int                             long long unsigned int
      |                        %lld
local_interconnect.cc:259:25: warning: format ‘%d’ expects argument of type ‘int’, but argument 3 has type ‘long long unsigned int’ [-Wformat=]
  259 |     printf("%d : cycle %d : conflicts = %d\n", m_id, cycles, conflict_sub);
      |                        ~^                            ~~~~~~
      |                         |                            |
      |                         int                          long long unsigned int
      |                        %lld
local_interconnect.cc:260:25: warning: format ‘%d’ expects argument of type ‘int’, but argument 3 has type ‘long long unsigned int’ [-Wformat=]
  260 |     printf("%d : cycle %d : passing reqs = %d\n", m_id, cycles, reqs);
      |                        ~^                               ~~~~~~
      |                         |                               |
      |                         int                             long long unsigned int
      |                        %lld
In file included from gpu-sim.h:39,
                 from power_stat.h:36,
                 from power_stat.cc:31:
../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from gpu-sim.h:44,
                 from dram_sched.h:36,
                 from dram.cc:33:
shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
In file included from gpu-sim.h:39,
                 from power_stat.h:36,
                 from power_stat.cc:31:
../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from gpu-sim.h:39,
                 from visualizer.cc:32:
../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from hashing.h:4,
                 from addrdec.cc:34:
addrdec.cc: In member function ‘void linear_to_raw_address_translation::sweep_test() const’:
addrdec.cc:522:28: warning: comparison of integer expressions of different signedness: ‘int’ and ‘const unsigned int’ [-Wsign-compare]
  522 |       assert((int)tlx.chip < m_n_channel);
      |              ~~~~~~~~~~~~~~^~~~~~~~~~~~~
addrdec.cc: In function ‘unsigned int next_powerOf2(unsigned int)’:
addrdec.cc:587:16: warning: suggest parentheses around ‘-’ in operand of ‘&’ [-Wparentheses]
  587 |   while (n & n - 1) n = n & (n - 1);  // unset rightmost bit
      |              ~~^~~
g++ -ggdb -fPIC -Wall -Wno-unused-function -Wno-sign-compare  -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/cuobjdump_to_ptxplus /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/cuobjdumpInst.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/cuobjdumpInstList.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/cuobjdump_to_ptxplus.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/ptx.tab.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/lex.ptx_.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/sass_lexer.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/sass_parser.o  /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/elf_lexer.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/elf_parser.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/header_parser.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuobjdump_to_ptxplus/header_lexer.o
In file included from gpu-sim.h:39,
                 from visualizer.cc:32:
../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
 1298 |     if (inst.op != m_supported_op) {
      |         ~~~~~~~~^~~~~~~~~~~~~~~~~
In file included from gpu-cache.cc:32:
gpu-cache.h: In member function ‘unsigned int sector_cache_block::get_sector_index(mem_access_sector_mask_t)’:
gpu-cache.h:502:3: warning: control reaches end of non-void function [-Wreturn-type]
  502 |   }
      |   ^
In file included from gpu-sim.h:44,
                 from gpu-sim.cc:32:
shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
In file included from ../../libcuda/../src/cuda-sim/cuda-sim.h:37,
                 from ../../libcuda/gpgpu_context.h:3,
                 from stat-tool.cc:40:
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
 1298 |     if (inst.op != m_supported_op) {
      |         ~~~~~~~~^~~~~~~~~~~~~~~~~
make[2]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/cuobjdump_to_ptxplus'
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
../../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
 1298 |     if (inst.op != m_supported_op) {
      |         ~~~~~~~~^~~~~~~~~~~~~~~~~
In file included from gpu-sim.h:44,
                 from power_interface.h:34,
                 from power_interface.cc:32:
shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
In file included from shader.cc:32:
shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
 1298 |     if (inst.op != m_supported_op) {
      |         ~~~~~~~~^~~~~~~~~~~~~~~~~
In file included from gpu-sim.h:44,
                 from power_stat.h:36,
                 from power_stat.cc:31:
shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
In file included from gpu-sim.h:44,
                 from visualizer.cc:32:
shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
cuda_runtime_api.cc: In function ‘cudaError_t cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlagsInternal(int*, const char*, int, size_t, unsigned int, gpgpu_context*)’:
cuda_runtime_api.cc:1413:18: warning: format ‘%d’ expects argument of type ‘int’, but argument 4 has type ‘size_t’ {aka ‘long unsigned int’} [-Wformat=]
 1413 |       "SMemSize=%d\n",
      |                 ~^
      |                  |
      |                  int
      |                 %ld
 1414 |       hostFunc, blockSize, dynamicSMemSize);
      |                            ~~~~~~~~~~~~~~~
      |                            |
      |                            size_t {aka long unsigned int}
shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
 1298 |     if (inst.op != m_supported_op) {
      |         ~~~~~~~~^~~~~~~~~~~~~~~~~
dram.cc: In member function ‘unsigned int dram_t::get_bankgrp_number(unsigned int)’:
dram.cc:884:1: warning: control reaches end of non-void function [-Wreturn-type]
  884 | }
      | ^
shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
 1298 |     if (inst.op != m_supported_op) {
      |         ~~~~~~~~^~~~~~~~~~~~~~~~~
cuda_runtime_api.cc: In function ‘unsigned int __cudaPushCallConfiguration(dim3, dim3, size_t, CUstream_st*)’:
cuda_runtime_api.cc:3599:1: warning: no return statement in function returning non-void [-Wreturn-type]
 3599 | }
      | ^
mem_fetch.cc: In member function ‘void mem_fetch::print(FILE*, bool) const’:
mem_fetch.cc:90:3: warning: nonnull argument ‘this’ compared to NULL [-Wnonnull-compare]
   90 |   if (this == NULL) {
      |   ^~
shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
 1298 |     if (inst.op != m_supported_op) {
      |         ~~~~~~~~^~~~~~~~~~~~~~~~~
In file included from scoreboard.cc:31:
shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
power_interface.cc: In function ‘void calculate_hw_mcpat(const gpgpu_sim_config&, const shader_core_config*, gpgpu_sim_wrapper*, power_stat_t*, unsigned int, unsigned int, unsigned int, unsigned int, unsigned int, int, bool, char*, char*, std::string, const bool*, bool)’:
power_interface.cc:269:3: warning: this ‘if’ clause does not guard... [-Wmisleading-indentation]
  269 |   if((power_simulation_mode == 2) && (accelwattch_hybrid_configuration[HW_L1_WM]))
      |   ^~
power_interface.cc:272:5: note: ...this statement, but the latter is misleadingly indented as if it were guarded by the ‘if’
  272 |     if(aggregate_power_stats){
      |     ^~
shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
 1298 |     if (inst.op != m_supported_op) {
      |         ~~~~~~~~^~~~~~~~~~~~~~~~~
stat-tool.cc: In member function ‘void thread_insn_span::print_span(FILE*) const’:
stat-tool.cc:522:21: warning: format ‘%d’ expects argument of type ‘int’, but argument 3 has type ‘long long unsigned int’ [-Wformat=]
  522 |     fprintf(fout, "%d ", i_sc->first);
      |                    ~^    ~~~~~~~~~~~
      |                     |          |
      |                     int        long long unsigned int
      |                    %lld
shader.cc: In member function ‘void shader_core_ctx::create_front_pipeline()’:
shader.cc:118:21: warning: comparison of integer expressions of different signedness: ‘int’ and ‘std::vector<specialized_unit_params>::size_type’ {aka ‘long unsigned int’} [-Wsign-compare]
  118 |   for (int j = 0; j < m_config->m_specialized_unit.size(); j++) {
      |                   ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
shader.cc:126:21: warning: comparison of integer expressions of different signedness: ‘int’ and ‘std::vector<specialized_unit_params>::size_type’ {aka ‘long unsigned int’} [-Wsign-compare]
  126 |   for (int j = 0; j < m_config->m_specialized_unit.size(); j++) {
      |                   ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
shader.cc:151:23: warning: comparison of integer expressions of different signedness: ‘int’ and ‘std::vector<specialized_unit_params>::size_type’ {aka ‘long unsigned int’} [-Wsign-compare]
  151 |     for (int j = 0; j < m_config->m_specialized_unit.size(); j++) {
      |                     ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
shader.cc: In member function ‘void shader_core_ctx::create_exec_pipeline()’:
shader.cc:419:26: warning: comparison of integer expressions of different signedness: ‘unsigned int’ and ‘const int’ [-Wsign-compare]
  419 |   for (unsigned k = 0; k < m_config->gpgpu_num_sp_units; k++) {
      |                        ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
shader.cc:425:26: warning: comparison of integer expressions of different signedness: ‘unsigned int’ and ‘const int’ [-Wsign-compare]
  425 |   for (unsigned k = 0; k < m_config->gpgpu_num_dp_units; k++) {
      |                        ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
shader.cc:430:26: warning: comparison of integer expressions of different signedness: ‘unsigned int’ and ‘const int’ [-Wsign-compare]
  430 |   for (unsigned k = 0; k < m_config->gpgpu_num_int_units; k++) {
      |                        ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
shader.cc:436:26: warning: comparison of integer expressions of different signedness: ‘unsigned int’ and ‘const int’ [-Wsign-compare]
  436 |   for (unsigned k = 0; k < m_config->gpgpu_num_sfu_units; k++) {
      |                        ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
shader.cc:442:26: warning: comparison of integer expressions of different signedness: ‘unsigned int’ and ‘const int’ [-Wsign-compare]
  442 |   for (unsigned k = 0; k < m_config->gpgpu_num_tensor_core_units; k++) {
      |                        ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In file included from gpu-sim.cc:43:
gpu-sim.cc: In member function ‘bool shader_core_ctx::occupy_shader_resource_1block(kernel_info_t&, bool)’:
shader_trace.h:41:26: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
   41 |    (Trace::sampling_core == get_sid() || Trace::sampling_core == -1))
      |     ~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~
shader_trace.h:47:9: note: in expansion of macro ‘SHADER_DTRACE’
   47 |     if (SHADER_DTRACE(x)) {                                   \
      |         ^~~~~~~~~~~~~
gpu-sim.cc:1680:5: note: in expansion of macro ‘SHADER_DPRINTF’
 1680 |     SHADER_DPRINTF(LIVENESS,
      |     ^~~~~~~~~~~~~~
shader.cc: In constructor ‘shader_core_ctx::shader_core_ctx(gpgpu_sim*, simt_core_cluster*, unsigned int, unsigned int, const shader_core_config*, const memory_config*, shader_core_stats*)’:
shader.cc:492:12: warning: unused variable ‘warp_size’ [-Wunused-variable]
  492 |   unsigned warp_size = config->warp_size;
      |            ^~~~~~~~~
gpu-sim.cc: In member function ‘void shader_core_ctx::issue_block2core(kernel_info_t&)’:
shader_trace.h:41:26: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
   41 |    (Trace::sampling_core == get_sid() || Trace::sampling_core == -1))
      |     ~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~
shader_trace.h:47:9: note: in expansion of macro ‘SHADER_DTRACE’
   47 |     if (SHADER_DTRACE(x)) {                                   \
      |         ^~~~~~~~~~~~~
gpu-sim.cc:1848:3: note: in expansion of macro ‘SHADER_DPRINTF’
 1848 |   SHADER_DPRINTF(LIVENESS,
      |   ^~~~~~~~~~~~~~
In file included from gpu-sim.h:41,
                 from gpu-sim.cc:32:
gpu-sim.cc: In member function ‘void gpgpu_sim::cycle()’:
gpu-sim.cc:2203:18: warning: unknown conversion type character ‘[’ in format [-Wformat=]
 2203 |                  "uArch: inst.: %lld (ipc=%4.1f, occ=%0.4f\% [%llu / %llu]) "
      |                  ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 2204 |                  "sim_rate=%u (inst/sec) elapsed = %u:%u:%02u:%02u / %s",
      |                  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
../trace.h:76:14: note: in definition of macro ‘DPRINTFG’
   76 |       printf(__VA_ARGS__);                                     \
      |              ^~~~~~~~~~~
gpu-sim.cc:2203:62: note: format string is defined here
 2203 |                  "uArch: inst.: %lld (ipc=%4.1f, occ=%0.4f\% [%llu / %llu]) "
      |                                                              ^
shader.cc: In member function ‘void shader_core_ctx::read_operands()’:
shader.cc:1656:21: warning: comparison of integer expressions of different signedness: ‘int’ and ‘const unsigned int’ [-Wsign-compare]
 1656 |   for (int i = 0; i < m_config->reg_file_port_throughput; ++i)
      |                   ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
shader.cc: In member function ‘mem_stage_stall_type ldst_unit::process_memory_access_queue_l1cache(l1_cache*, warp_inst_t&)’:
shader.cc:1959:23: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
 1959 |     for (int j = 0; j < m_config->m_L1D_config.l1_banks;
      |                     ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
shader.cc: In member function ‘void ldst_unit::L1_latency_queue_cycle()’:
shader.cc:2012:21: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
 2012 |   for (int j = 0; j < m_config->m_L1D_config.l1_banks; j++) {
      |                   ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In file included from shader.cc:48:
shader.cc: In member function ‘void shader_core_ctx::register_cta_thread_exit(unsigned int, kernel_info_t*)’:
shader_trace.h:41:26: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
   41 |    (Trace::sampling_core == get_sid() || Trace::sampling_core == -1))
      |     ~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~
shader_trace.h:47:9: note: in expansion of macro ‘SHADER_DTRACE’
   47 |     if (SHADER_DTRACE(x)) {                                   \
      |         ^~~~~~~~~~~~~
shader.cc:2828:5: note: in expansion of macro ‘SHADER_DPRINTF’
 2828 |     SHADER_DPRINTF(
      |     ^~~~~~~~~~~~~~
shader_trace.h:41:26: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
   41 |    (Trace::sampling_core == get_sid() || Trace::sampling_core == -1))
      |     ~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~
shader_trace.h:47:9: note: in expansion of macro ‘SHADER_DTRACE’
   47 |     if (SHADER_DTRACE(x)) {                                   \
      |         ^~~~~~~~~~~~~
shader.cc:2835:7: note: in expansion of macro ‘SHADER_DPRINTF’
 2835 |       SHADER_DPRINTF(
      |       ^~~~~~~~~~~~~~
shader_trace.h:41:26: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
   41 |    (Trace::sampling_core == get_sid() || Trace::sampling_core == -1))
      |     ~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~
shader_trace.h:47:9: note: in expansion of macro ‘SHADER_DTRACE’
   47 |     if (SHADER_DTRACE(x)) {                                   \
      |         ^~~~~~~~~~~~~
shader.cc:2853:9: note: in expansion of macro ‘SHADER_DPRINTF’
 2853 |         SHADER_DPRINTF(LIVENESS,
      |         ^~~~~~~~~~~~~~
shader.cc: In member function ‘void warp_inst_t::print(FILE*) const’:
shader.cc:3091:25: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 3091 |     fprintf(fout, "0x%04x ", pc);
      |                      ~~~^    ~~
      |                         |    |
      |                         |    address_type {aka long long unsigned int}
      |                         unsigned int
      |                      %04llx
shader.cc: In member function ‘void shader_core_ctx::display_pipeline(FILE*, int, int) const’:
shader.cc:3277:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 3277 |     fprintf(fout, "w%2u : pc = 0x%x, nbytes = %u\n",
      |                                  ~^
      |                                   |
      |                                   unsigned int
      |                                  %llx
 3278 |             m_inst_fetch_buffer.m_warp_id, m_inst_fetch_buffer.m_pc,
      |                                            ~~~~~~~~~~~~~~~~~~~~~~~~
      |                                                                |
      |                                                                address_type {aka long long unsigned int}
shader.cc: In member function ‘void shader_core_ctx::cycle()’:
shader.cc:3510:21: warning: comparison of integer expressions of different signedness: ‘int’ and ‘const unsigned int’ [-Wsign-compare]
 3510 |   for (int i = 0; i < m_config->inst_fetch_throughput; ++i) {
      |                   ~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
shader.cc: In member function ‘void shd_warp_t::print(FILE*) const’:
shader.cc:3943:36: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 4 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 3943 |     fprintf(fout, "w%02u npc: 0x%04x, done:%c%c%c%c:%2u i:%u s:%u a:%u (done: ",
      |                                 ~~~^
      |                                    |
      |                                    unsigned int
      |                                 %04llx
 3944 |             m_warp_id, m_next_pc, (functional_done() ? 'f' : ' '),
      |                        ~~~~~~~~~
      |                        |
      |                        address_type {aka long long unsigned int}
shader.cc: In member function ‘bool simt_core_cluster::icnt_injection_buffer_full(unsigned int, bool)’:
shader.cc:4425:12: warning: unused variable ‘source’ [-Wunused-variable]
 4425 |   unsigned source = m_cluster_id / (m_config->n_simt_clusters/m_config->chiplet_num);
      |            ^~~~~~
cuda_runtime_api.cc: In function ‘int get_app_cuda_version()’:
cuda_runtime_api.cc:466:9: warning: ignoring return value of ‘int system(const char*)’, declared with attribute warn_unused_result [-Wunused-result]
  466 |   system(app_cuda_version_command.c_str());
      |   ~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
cuda_runtime_api.cc: In function ‘char* readfile(std::string)’:
cuda_runtime_api.cc:3237:8: warning: ignoring return value of ‘size_t fread(void*, size_t, size_t, FILE*)’, declared with attribute warn_unused_result [-Wunused-result]
 3237 |   fread(ret, 1, filesize, fp);
      |   ~~~~~^~~~~~~~~~~~~~~~~~~~~~
cuda_runtime_api.cc: In function ‘char* get_app_binary_name(std::string)’:
cuda_runtime_api.cc:451:25: warning: ‘self_exe_path’ may be used uninitialized in this function [-Wmaybe-uninitialized]
  451 |   self_exe_path = strtok(self_exe_path, ".");
      |                   ~~~~~~^~~~~~~~~~~~~~~~~~~~
cuda_runtime_api.cc: In member function ‘void cuda_runtime_api::extract_ptx_files_using_cuobjdump(CUctx_st*)’:
cuda_runtime_api.cc:3006:30: warning: ‘%s’ directive output may be truncated writing up to 1023 bytes into a region of size 922 [-Wformat-truncation=]
 3006 |            "awk '{$1=$1}1' > %s",
      |                              ^~
 3007 |            app_binary.c_str(), ptx_list_file_name);
      |                                ~~~~~~~~~~~~~~~~~~
In file included from /usr/include/stdio.h:867,
                 from cuda_runtime_api.cc:107:
/usr/include/x86_64-linux-gnu/bits/stdio2.h:67:35: note: ‘__builtin___snprintf_chk’ output 79 or more bytes (assuming 1102) into a destination of size 1000
   67 |   return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1,
      |          ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   68 |        __bos (__s), __fmt, __va_arg_pack ());
      |        ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
cuda_runtime_api.cc: In member function ‘void gpgpu_context::cuobjdumpParseBinary(unsigned int)’:
cuda_runtime_api.cc:3499:22: warning: ‘symtab’ may be used uninitialized in this function [-Wmaybe-uninitialized]
 3499 |   api->load_constants(symtab, STATIC_ALLOC_LIMIT,
      |   ~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~
 3500 |                       context->get_device()->get_gpgpu());
      |                       ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
shader.cc: In member function ‘void opndcoll_rfu_t::init(unsigned int, shader_core_ctx*)’:
shader.cc:4033:18: warning: ‘reg_id’ may be used uninitialized in this function [-Wmaybe-uninitialized]
 4033 |     m_cu[j]->init(j, num_banks, m_bank_warp_shift, shader->get_config(), this,
      |     ~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 4034 |                   sub_core_model, reg_id, m_num_banks_per_sched);
      |                   ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
In file included from shader.h:50,
                 from shader.cc:32:
../abstract_hardware_model.h: In member function ‘void opndcoll_rfu_t::allocate_cu(unsigned int)’:
../abstract_hardware_model.h:1403:27: warning: ‘reg_id’ may be used uninitialized in this function [-Wmaybe-uninitialized]
 1403 |     assert(not regs[reg_id]->empty());
      |                           ^
../abstract_hardware_model.h:1389:14: note: ‘reg_id’ was declared here
 1389 |     unsigned reg_id;
      |              ^~~~~~
echo /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuda_runtime_api.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuobjdump_lexer.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuobjdump_parser.o
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuda_runtime_api.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuobjdump_lexer.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuobjdump_parser.o
ar rcs /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/libcuda.a /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuda_runtime_api.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuobjdump_lexer.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/cuobjdump_parser.o
make[2]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/libcuda'
g++ -m64 /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/Ucache.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/XML_Parse.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/arbiter.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/area.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/array.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/bank.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/basic_circuit.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/basic_components.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/cacti_interface.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/component.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/core.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/crossbar.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/decoder.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/htree2.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/interconnect.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/io.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/iocontrollers.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/logic.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/main.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/mat.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/memoryctrl.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/noc.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/nuca.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/parameter.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/processor.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/router.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/sharedcache.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/subarray.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/technology.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/uca.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/wire.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/xmlParser.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/gpgpu_sim_wrapper.o -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/mcpat -lm -Wno-unknown-pragmas  -O3 -fPIC -msse2 -mfpmath=sse -DNTHREADS=4 -Icacti -lz  -I/usr/lib/ -I/usr/lib64/ -pthread
make[3]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch'
make[2]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/accelwattch'
ar rcs  /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/libgpu_uarch_sim.a /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/addrdec.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/dram.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/dram_sched.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/gpu-cache.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/gpu-misc.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/gpu-sim.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/hashing.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/histogram.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/icnt_wrapper.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/l2cache.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/local_interconnect.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/mem_fetch.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/mem_latency_stat.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/power_interface.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/power_stat.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/scoreboard.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/shader.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/stack.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/stat-tool.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/traffic_breakdown.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/visualizer.o
make[2]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim'
make "CREATE_LIBRARY=1" "DEBUG=0" -C ./src/intersim2
make[2]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/intersim2'
touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/Makefile.makedepend
makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/Makefile.makedepend -I-I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/ -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/ config_utils.cpp booksim_config.cpp module.cpp buffer.cpp vc.cpp routefunc.cpp traffic.cpp flitchannel.cpp trafficmanager.cpp batchtrafficmanager.cpp packet_reply_info.cpp buffer_state.cpp stats.cpp credit.cpp outputset.cpp flit.cpp injection.cpp misc_utils.cpp rng_wrapper.cpp rng_double_wrapper.cpp power_module.cpp switch_monitor.cpp buffer_monitor.cpp main.cpp gputrafficmanager.cpp intersim_config.cpp interconnect_interface.cpp allocators/allocator.cpp allocators/islip.cpp allocators/loa.cpp allocators/maxsize.cpp allocators/pim.cpp allocators/selalloc.cpp allocators/separable.cpp allocators/separable_input_first.cpp allocators/separable_output_first.cpp allocators/wavefront.cpp arbiters/arbiter.cpp arbiters/matrix_arb.cpp arbiters/prio_arb.cpp arbiters/roundrobin_arb.cpp arbiters/tree_arb.cpp networks/anynet.cpp networks/cmesh.cpp networks/dragonfly.cpp networks/fattree.cpp networks/flatfly_onchip.cpp networks/fly.cpp networks/kncube.cpp networks/network.cpp networks/qtree.cpp networks/tree4.cpp power/buffer_monitor.cpp power/power_module.cpp power/switch_monitor.cpp routers/chaos_router.cpp routers/event_router.cpp routers/iq_router.cpp routers/router.cpp 2> /dev/null
flex -o/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/lex.yy.c config.l
bison -y -d config.y --file-prefix=/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/y
bison -y -d config.y --file-prefix=/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/y
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c config_utils.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/config_utils.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c booksim_config.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/booksim_config.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c module.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/module.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c buffer.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/buffer.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c vc.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/vc.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c routefunc.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/routefunc.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c traffic.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/traffic.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c flitchannel.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/flitchannel.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c trafficmanager.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/trafficmanager.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c batchtrafficmanager.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/batchtrafficmanager.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c packet_reply_info.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/packet_reply_info.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c buffer_state.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/buffer_state.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c stats.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/stats.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c credit.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/credit.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c outputset.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/outputset.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c flit.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/flit.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c injection.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/injection.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c misc_utils.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/misc_utils.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c rng_wrapper.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/rng_wrapper.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c rng_double_wrapper.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/rng_double_wrapper.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c power/power_module.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/power_module.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c power/switch_monitor.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/switch_monitor.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c power/buffer_monitor.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/buffer_monitor.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c main.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/main.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c gputrafficmanager.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/gputrafficmanager.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c intersim_config.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/intersim_config.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c interconnect_interface.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/interconnect_interface.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c networks/fattree.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/fattree.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c networks/cmesh.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/cmesh.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c networks/flatfly_onchip.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/flatfly_onchip.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c networks/qtree.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/qtree.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c networks/tree4.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/tree4.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c networks/network.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/network.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c networks/anynet.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/anynet.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c networks/fly.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/fly.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c networks/dragonfly.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/dragonfly.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c networks/kncube.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/kncube.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c allocators/wavefront.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/wavefront.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c allocators/islip.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/islip.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c allocators/selalloc.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/selalloc.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c allocators/separable_output_first.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/separable_output_first.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c allocators/separable.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/separable.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c allocators/allocator.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/allocator.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c allocators/separable_input_first.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/separable_input_first.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c allocators/pim.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/pim.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c allocators/loa.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/loa.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c allocators/maxsize.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/maxsize.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c arbiters/prio_arb.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/prio_arb.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c arbiters/matrix_arb.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/matrix_arb.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c arbiters/tree_arb.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/tree_arb.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c arbiters/roundrobin_arb.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/roundrobin_arb.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c arbiters/arbiter.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/arbiter.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c routers/router.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/router.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c routers/event_router.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/event_router.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c routers/iq_router.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/iq_router.o
g++ -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c routers/chaos_router.cpp -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/chaos_router.o
gcc -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/lex.yy.c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/lex.yy.o
gcc -Wall -I. -Iarbiters -Iallocators -Irouters -Inetworks -Ipower -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src -I/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/  -DCREATE_LIBRARY -O3 -g -fPIC -I/usr/local/cuda-11.6/include -c /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/y.tab.c -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/y.tab.o
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/lex.yy.c:1194:16: warning: ‘input’ defined but not used [-Wunused-function]
 1194 |     static int input  (void)
      |                ^~~~~
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/lex.yy.c:1151:17: warning: ‘yyunput’ defined but not used [-Wunused-function]
 1151 |     static void yyunput (int c, char * yy_bp )
      |                 ^~~~~~~
vc.cpp: In member function ‘void VC::AddFlit(Flit*)’:
vc.cpp:86:15: warning: comparison of integer expressions of different signedness: ‘long long unsigned int’ and ‘int’ [-Wsign-compare]
   86 |     if(f->pid != _expected_pid) {
      |        ~~~~~~~^~~~~~~~~~~~~~~~
networks/qtree.cpp: In member function ‘virtual void QTree::_BuildNet(const Configuration&, int)’:
networks/qtree.cpp:143:12: warning: ‘r’ may be used uninitialized in this function [-Wmaybe-uninitialized]
  143 |  _routers[r]->AddInputChannel( _chan[c],
      |            ^
networks/dragonfly.cpp: In function ‘int dragonfly_port(int, int, int)’:
networks/dragonfly.cpp:114:7: warning: variable ‘group_dest’ set but not used [-Wunused-but-set-variable]
  114 |   int group_dest=-1;
      |       ^~~~~~~~~~
networks/dragonfly.cpp: In member function ‘virtual void DragonFlyNew::_BuildNet(const Configuration&, int)’:
networks/dragonfly.cpp:362:9: warning: variable ‘_grp_num_routers’ set but not used [-Wunused-but-set-variable]
  362 |     int _grp_num_routers;
      |         ^~~~~~~~~~~~~~~~
networks/dragonfly.cpp:364:9: warning: variable ‘grp_ID2’ set but not used [-Wunused-but-set-variable]
  364 |     int grp_ID2;
      |         ^~~~~~~
networks/dragonfly.cpp:227:7: warning: variable ‘_dim_size’ set but not used [-Wunused-but-set-variable]
  227 |   int _dim_size=-1;
      |       ^~~~~~~~~
networks/dragonfly.cpp: In function ‘void ugal_dragonflynew(const Router*, const Flit*, int, OutputSet*, int, bool)’:
networks/dragonfly.cpp:501:23: warning: variable ‘min_hopcnt’ set but not used [-Wunused-but-set-variable]
  501 |   int min_queue_size, min_hopcnt;
      |                       ^~~~~~~~~~
networks/dragonfly.cpp:502:26: warning: variable ‘nonmin_hopcnt’ set but not used [-Wunused-but-set-variable]
  502 |   int nonmin_queue_size, nonmin_hopcnt;
      |                          ^~~~~~~~~~~~~
networks/kncube.cpp: In member function ‘virtual void KNCube::InsertRandomFaults(const Configuration&)’:
networks/kncube.cpp:305:22: warning: ‘chan’ may be used uninitialized in this function [-Wmaybe-uninitialized]
  305 |       OutChannelFault( node, chan );
      |       ~~~~~~~~~~~~~~~^~~~~~~~~~~~~~
networks/flatfly_onchip.cpp: In function ‘int find_distance(int, int, int)’:
networks/flatfly_onchip.cpp:1212:7: warning: variable ‘_dim_size’ set but not used [-Wunused-but-set-variable]
 1212 |   int _dim_size;
      |       ^~~~~~~~~
networks/anynet.cpp: In member function ‘void AnyNet::readFile()’:
networks/anynet.cpp:495:22: warning: comparison of integer expressions of different signedness: ‘__gnu_cxx::__alloc_traits<std::allocator<int>, int>::value_type’ {aka ‘int’} and ‘size_t’ {aka ‘long unsigned int’} [-Wsign-compare]
  495 |     if(node_check[i] != i){
In file included from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/../abstract_hardware_model.h:217,
                 from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/mem_fetch.h:33,
                 from interconnect_interface.cpp:41:
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/../stream_manager.h: In member function ‘bool CUevent_st::done() const’:
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/../stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
   67 |   bool done() const { return m_updates == m_issued; }
      |                              ~~~~~~~~~~^~~~~~~~~~~
main.cpp: In function ‘int GetSimTime(int)’:
main.cpp:92:1: warning: control reaches end of non-void function [-Wreturn-type]
   92 | }
      | ^
main.cpp: In function ‘Stats* GetStats(const string&, int)’:
main.cpp:111:1: warning: control reaches end of non-void function [-Wreturn-type]
  111 | }
      | ^
networks/tree4.cpp: In member function ‘int Tree4::_WireLatency(int, int, int, int)’:
networks/tree4.cpp:290:10: warning: ‘L’ may be used uninitialized in this function [-Wmaybe-uninitialized]
  290 |   return L;
      |          ^
In file included from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/mem_fetch.h:33,
                 from interconnect_interface.cpp:41:
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/../abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/../abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/mem_fetch.h:33,
                 from interconnect_interface.cpp:41:
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/../abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim/../abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
make[2]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/intersim2'
make -C ./src/ depend
make[2]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src'
touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/Makefile.makedepend
makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/Makefile.makedepend -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/ abstract_hardware_model.cc debug.cc gpgpusim_entrypoint.cc option_parser.cc statwrapper.cc stream_manager.cc trace.cc 2> /dev/null
make[2]: 'depend' is up to date.
make[2]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src'
make -C ./src/
make[2]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src'
touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/Makefile.makedepend
makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/Makefile.makedepend -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/ abstract_hardware_model.cc debug.cc gpgpusim_entrypoint.cc option_parser.cc statwrapper.cc stream_manager.cc trace.cc 2> /dev/null
g++  -O3 -g3 -fPIC -Wall -DDEBUG -DCUDART_VERSION=11060 -std=c++0x -DTRACING_ON=1 -I/usr/local/cuda-11.6/include -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/abstract_hardware_model.o -c abstract_hardware_model.cc
g++  -O3 -g3 -fPIC -Wall -DDEBUG -DCUDART_VERSION=11060 -std=c++0x -DTRACING_ON=1 -I/usr/local/cuda-11.6/include -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/debug.o -c debug.cc
g++  -O3 -g3 -fPIC -Wall -DDEBUG -DCUDART_VERSION=11060 -std=c++0x -DTRACING_ON=1 -I/usr/local/cuda-11.6/include -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpusim_entrypoint.o -c gpgpusim_entrypoint.cc
g++  -O3 -g3 -fPIC -Wall -DDEBUG -DCUDART_VERSION=11060 -std=c++0x -DTRACING_ON=1 -I/usr/local/cuda-11.6/include -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/option_parser.o -c option_parser.cc
g++  -O3 -g3 -fPIC -Wall -DDEBUG -DCUDART_VERSION=11060 -std=c++0x -DTRACING_ON=1 -I/usr/local/cuda-11.6/include -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/statwrapper.o -c statwrapper.cc
g++  -O3 -g3 -fPIC -Wall -DDEBUG -DCUDART_VERSION=11060 -std=c++0x -DTRACING_ON=1 -I/usr/local/cuda-11.6/include -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/stream_manager.o -c stream_manager.cc
g++  -O3 -g3 -fPIC -Wall -DDEBUG -DCUDART_VERSION=11060 -std=c++0x -DTRACING_ON=1 -I/usr/local/cuda-11.6/include -o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/trace.o -c trace.cc
make   -C ./gpgpu-sim
make[3]: Entering directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim'
touch /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/Makefile.makedepend
makedepend -f/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/Makefile.makedepend -p/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/ addrdec.cc dram.cc dram_sched.cc gpu-cache.cc gpu-misc.cc gpu-sim.cc hashing.cc histogram.cc icnt_wrapper.cc l2cache.cc local_interconnect.cc mem_fetch.cc mem_latency_stat.cc power_interface.cc power_stat.cc scoreboard.cc shader.cc stack.cc stat-tool.cc traffic_breakdown.cc visualizer.cc 2> /dev/null
ar rcs  /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/libgpu_uarch_sim.a /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/addrdec.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/dram.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/dram_sched.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/gpu-cache.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/gpu-misc.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/gpu-sim.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/hashing.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/histogram.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/icnt_wrapper.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/l2cache.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/local_interconnect.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/mem_fetch.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/mem_latency_stat.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/power_interface.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/power_stat.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/scoreboard.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/shader.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/stack.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/stat-tool.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/traffic_breakdown.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/visualizer.o
make[3]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src/gpgpu-sim'
In file included from abstract_hardware_model.h:217,
                 from abstract_hardware_model.cc:32:
stream_manager.h: In member function ‘bool CUevent_st::done() const’:
stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
   67 |   bool done() const { return m_updates == m_issued; }
      |                              ~~~~~~~~~~^~~~~~~~~~~
In file included from abstract_hardware_model.h:217,
                 from debug.h:32,
                 from debug.cc:29:
stream_manager.h: In member function ‘bool CUevent_st::done() const’:
stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
   67 |   bool done() const { return m_updates == m_issued; }
      |                              ~~~~~~~~~~^~~~~~~~~~~
In file included from abstract_hardware_model.cc:32:
abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from debug.h:32,
                 from debug.cc:29:
abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from abstract_hardware_model.cc:32:
abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from abstract_hardware_model.h:217,
                 from gpgpusim_entrypoint.h:35,
                 from gpgpusim_entrypoint.cc:29:
stream_manager.h: In member function ‘bool CUevent_st::done() const’:
stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
   67 |   bool done() const { return m_updates == m_issued; }
      |                              ~~~~~~~~~~^~~~~~~~~~~
In file included from debug.h:32,
                 from debug.cc:29:
abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from gpgpusim_entrypoint.h:35,
                 from gpgpusim_entrypoint.cc:29:
abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from gpgpusim_entrypoint.h:35,
                 from gpgpusim_entrypoint.cc:29:
abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from stream_manager.h:35,
                 from stream_manager.cc:29:
abstract_hardware_model.h: In member function ‘virtual void inst_t::print_insn(FILE*) const’:
abstract_hardware_model.h:966:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
  966 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from stream_manager.h:35,
                 from stream_manager.cc:29:
abstract_hardware_model.h: In member function ‘virtual void warp_inst_t::print_insn(FILE*) const’:
abstract_hardware_model.h:1160:35: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1160 |     fprintf(fp, " [inst @ pc=0x%04x] ", pc);
      |                                ~~~^     ~~
      |                                   |     |
      |                                   |     address_type {aka long long unsigned int}
      |                                   unsigned int
      |                                %04llx
In file included from stream_manager.cc:29:
stream_manager.h: In member function ‘bool CUevent_st::done() const’:
stream_manager.h:67:40: warning: comparison of integer expressions of different signedness: ‘const int’ and ‘const unsigned int’ [-Wsign-compare]
   67 |   bool done() const { return m_updates == m_issued; }
      |                              ~~~~~~~~~~^~~~~~~~~~~
In file included from cuda-sim/cuda-sim.h:37,
                 from debug.cc:30:
cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
cuda-sim/../gpgpu-sim/shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
cuda-sim/../gpgpu-sim/shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
cuda-sim/../gpgpu-sim/shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
cuda-sim/../gpgpu-sim/shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
 1298 |     if (inst.op != m_supported_op) {
      |         ~~~~~~~~^~~~~~~~~~~~~~~~~
In file included from ../libcuda/../src/cuda-sim/cuda-sim.h:37,
                 from ../libcuda/gpgpu_context.h:3,
                 from gpgpusim_entrypoint.cc:32:
../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
In file included from ../libcuda/../src/cuda-sim/cuda-sim.h:37,
                 from ../libcuda/gpgpu_context.h:3,
                 from abstract_hardware_model.cc:37:
../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
 1298 |     if (inst.op != m_supported_op) {
      |         ~~~~~~~~^~~~~~~~~~~~~~~~~
../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
 1298 |     if (inst.op != m_supported_op) {
      |         ~~~~~~~~^~~~~~~~~~~~~~~~~
In file included from ../libcuda/../src/cuda-sim/cuda-sim.h:37,
                 from ../libcuda/gpgpu_context.h:3,
                 from stream_manager.cc:30:
../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In constructor ‘scheduler_unit::scheduler_unit(shader_core_stats*, shader_core_ctx*, Scoreboard*, simt_stack**, std::vector<shd_warp_t*>*, register_set*, register_set*, register_set*, register_set*, register_set*, std::vector<register_set*>&, register_set*, int)’:
../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:450:32: warning: ‘scheduler_unit::m_spec_cores_out’ will be initialized after [-Wreorder]
  450 |   std::vector<register_set *> &m_spec_cores_out;
      |                                ^~~~~~~~~~~~~~~~
../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:449:17: warning:   ‘register_set* scheduler_unit::m_mem_out’ [-Wreorder]
  449 |   register_set *m_mem_out;
      |                 ^~~~~~~~~
../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:344:3: warning:   when initialized here [-Wreorder]
  344 |   scheduler_unit(shader_core_stats *stats, shader_core_ctx *shader,
      |   ^~~~~~~~~~~~~~
../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h: In member function ‘virtual bool specialized_unit::can_issue(const warp_inst_t&) const’:
../libcuda/../src/cuda-sim/../gpgpu-sim/shader.h:1298:17: warning: comparison of integer expressions of different signedness: ‘int’ and ‘unsigned int’ [-Wsign-compare]
 1298 |     if (inst.op != m_supported_op) {
      |         ~~~~~~~~^~~~~~~~~~~~~~~~~
abstract_hardware_model.cc: In member function ‘void simt_stack::print(FILE*) const’:
abstract_hardware_model.cc:1009:30: warning: format ‘%x’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1009 |     fprintf(fout, " pc: 0x%03x", stack_entry.m_pc);
      |                           ~~~^   ~~~~~~~~~~~~~~~~
      |                              |               |
      |                              unsigned int    address_type {aka long long unsigned int}
      |                           %03llx
abstract_hardware_model.cc:1015:29: warning: format ‘%u’ expects argument of type ‘unsigned int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1015 |       fprintf(fout, " rp: %4u tp: %s cd: %2u ", stack_entry.m_recvg_pc,
      |                           ~~^                   ~~~~~~~~~~~~~~~~~~~~~~
      |                             |                               |
      |                             unsigned int                    address_type {aka long long unsigned int}
      |                           %4llu
abstract_hardware_model.cc: In member function ‘void simt_stack::print_checkpoint(FILE*) const’:
abstract_hardware_model.cc:1035:21: warning: format ‘%d’ expects argument of type ‘int’, but argument 3 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1035 |     fprintf(fout, "%d %d %d %lld %d ", stack_entry.m_pc,
      |                    ~^                  ~~~~~~~~~~~~~~~~
      |                     |                              |
      |                     int                            address_type {aka long long unsigned int}
      |                    %lld
abstract_hardware_model.cc:1035:27: warning: format ‘%d’ expects argument of type ‘int’, but argument 5 has type ‘address_type’ {aka ‘long long unsigned int’} [-Wformat=]
 1035 |     fprintf(fout, "%d %d %d %lld %d ", stack_entry.m_pc,
      |                          ~^
      |                           |
      |                           int
      |                          %lld
 1036 |             stack_entry.m_calldepth, stack_entry.m_recvg_pc,
      |                                      ~~~~~~~~~~~~~~~~~~~~~~
      |                                                  |
      |                                                  address_type {aka long long unsigned int}
stream_manager.cc: In member function ‘void stream_operation::print(FILE*) const’:
stream_manager.cc:205:10: warning: enumeration value ‘stream_wait_event’ not handled in switch [-Wswitch]
  205 |   switch (m_type) {
      |          ^
debug.cc: In member function ‘void gpgpu_sim::gpgpu_debug()’:
debug.cc:127:10: warning: ignoring return value of ‘char* fgets(char*, int, FILE*)’, declared with attribute warn_unused_result [-Wunused-result]
  127 |     fgets(line, 1024, stdin);
      |     ~~~~~^~~~~~~~~~~~~~~~~~~
debug.cc:139:12: warning: ignoring return value of ‘char* fgets(char*, int, FILE*)’, declared with attribute warn_unused_result [-Wunused-result]
  139 |       fgets(line, 1024, stdin);
      |       ~~~~~^~~~~~~~~~~~~~~~~~~
abstract_hardware_model.cc: In member function ‘void checkpoint::load_global_mem(memory_space*, char*)’:
abstract_hardware_model.cc:98:14: warning: ‘offset’ may be used uninitialized in this function [-Wmaybe-uninitialized]
   98 |       offset = offset + 4;
      |       ~~~~~~~^~~~~~~~~~~~
ar rcs  /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libgpgpusim.a /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/abstract_hardware_model.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/debug.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpusim_entrypoint.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/option_parser.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/statwrapper.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/stream_manager.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/trace.o /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/*.o
make[2]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/src'
g++ -shared -Wl,-soname,libcudart.so -Wl,--version-script=linux-so-version.txt\
                /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/libcuda/*.o \
                /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/*.o \
                /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/cuda-sim/decuda_pred_table/*.o \
                /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/gpgpu-sim/*.o \
                /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/intersim2/*.o \
                /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/*.o -lm -lz -lGL -pthread \
                /home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/build/gcc-9.4.0/cuda-11060/release/accelwattch/*.o \
                -o lib/gcc-9.4.0/cuda-11060/release/libcudart.so
if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.2 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.2; fi
if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.3 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.3; fi
if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.4 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.4; fi
if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.5.0 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.5.0; fi
if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.5.5 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.5.5; fi
if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.6.0 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.6.0; fi
if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.6.5 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.6.5; fi
if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.7.0 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.7.0; fi
if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.7.5 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.7.5; fi
if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.8.0 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.8.0; fi
if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.9.0 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.9.0; fi
if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.9.1 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.9.1; fi
if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.9.2 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.9.2; fi
if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.10.0 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.10.0; fi
if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.10.1 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.10.1; fi
if [ ! -f lib/gcc-9.4.0/cuda-11060/release/libcudart.so.11.0 ]; then ln -s libcudart.so lib/gcc-9.4.0/cuda-11060/release/libcudart.so.11.0; fi
make[1]: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim'
g++ -std=c++0x -o ./bin/release/accel-sim.out  -L/home/mnaderan/accelsim-chiplet/gpu-simulator/gpgpu-sim/lib/gcc-9.4.0/cuda-11060/release/ -lcudart -lm -lz -lGL -pthread ./build/release/*.o
make: Leaving directory '/home/mnaderan/accelsim-chiplet/gpu-simulator'
mnaderan@rtx3080:accelsim-chiplet$ cd test/
mnaderan@rtx3080:test$ sh runme.sh
Accel-Sim [build accelsim-commit-9e19b6621cc94d89d8f7b18e65d802ad6979c195_modified_0.0]

        *** GPGPU-Sim Simulator Version 4.2.0  [build gpgpu-sim_git-commit-60af80a7d140a30c781ca485707a1d4dbb8031fa_modified_0.0] ***


GPGPU-Sim: Configuration options:

-save_embedded_ptx                      0 # saves ptx files embedded in binary as <n>.ptx
-keep                                   0 # keep intermediate files created by GPGPU-Sim when interfacing with external programs
-gpgpu_ptx_save_converted_ptxplus                    0 # Saved converted ptxplus to a file
-gpgpu_occupancy_sm_number                   86 # The SM number to pass to ptxas when getting register usage for computing GPU occupancy. This parameter is required in the config.
-ptx_opcode_latency_int           4,4,4,4,21 # Opcode latencies for integers <ADD,MAX,MUL,MAD,DIV,SHFL>Default 1,1,19,25,145,32
-ptx_opcode_latency_fp           4,4,4,4,39 # Opcode latencies for single precision floating points <ADD,MAX,MUL,MAD,DIV>Default 1,1,1,1,30
-ptx_opcode_latency_dp      64,64,64,64,330 # Opcode latencies for double precision floating points <ADD,MAX,MUL,MAD,DIV>Default 8,8,8,8,335
-ptx_opcode_latency_sfu                   21 # Opcode latencies for SFU instructionsDefault 8
-ptx_opcode_latency_tesnor                   64 # Opcode latencies for Tensor instructionsDefault 64
-ptx_opcode_initiation_int            2,2,2,2,2 # Opcode initiation intervals for integers <ADD,MAX,MUL,MAD,DIV,SHFL>Default 1,1,4,4,32,4
-ptx_opcode_initiation_fp            1,1,1,1,2 # Opcode initiation intervals for single precision floating points <ADD,MAX,MUL,MAD,DIV>Default 1,1,1,1,5
-ptx_opcode_initiation_dp      64,64,64,64,130 # Opcode initiation intervals for double precision floating points <ADD,MAX,MUL,MAD,DIV>Default 8,8,8,8,130
-ptx_opcode_initiation_sfu                    8 # Opcode initiation intervals for sfu instructionsDefault 8
-ptx_opcode_initiation_tensor                   64 # Opcode initiation intervals for tensor instructionsDefault 64
-cdp_latency         7200,8000,100,12000,1600 # CDP API latency <cudaStreamCreateWithFlags, cudaGetParameterBufferV2_init_perWarp, cudaGetParameterBufferV2_perKernel, cudaLaunchDeviceV2_init_perWarp, cudaLaunchDevicV2_perKernel>Default 7200,8000,100,12000,1600
-network_mode                           1 # Interconnection network mode
-inter_config_file   config_ampere_islip.icnt # Interconnection network config file
-inter_config_file_chLet config_ampere_islip_ch.icnt # Interconnection network config file
-icnt_in_buffer_limit                  512 # in_buffer_limit
-icnt_out_buffer_limit                  512 # out_buffer_limit
-icnt_subnets                           2 # subnets
-icnt_arbiter_algo                      1 # arbiter_algo
-icnt_verbose                           0 # inct_verbose
-icnt_grant_cycles                      1 # grant_cycles
-gpgpu_ptx_use_cuobjdump                    1 # Use cuobjdump to extract ptx and sass from binaries
-gpgpu_experimental_lib_support                    0 # Try to extract code from cuda libraries [Broken because of unknown cudaGetExportTable]
-checkpoint_option                      0 #  checkpointing flag (0 = no checkpoint)
-checkpoint_kernel                      1 #  checkpointing during execution of which kernel (1- 1st kernel)
-checkpoint_CTA                         0 #  checkpointing after # of CTA (< less than total CTA)
-resume_option                          0 #  resume flag (0 = no resume)
-resume_kernel                          0 #  Resume from which kernel (1= 1st kernel)
-resume_CTA                             0 #  resume from which CTA
-checkpoint_CTA_t                       0 #  resume from which CTA
-checkpoint_insn_Y                      0 #  resume from which CTA
-gpgpu_ptx_convert_to_ptxplus                    0 # Convert SASS (native ISA) to ptxplus and run ptxplus
-gpgpu_ptx_force_max_capability                   86 # Force maximum compute capability
-gpgpu_ptx_inst_debug_to_file                    0 # Dump executed instructions' debug information to file
-gpgpu_ptx_inst_debug_file       inst_debug.txt # Executed instructions' debug output file
-gpgpu_ptx_inst_debug_thread_uid                    1 # Thread UID for executed instructions' debug output
-gpgpu_simd_model                       1 # 1 = post-dominator
-gpgpu_shader_core_pipeline              1536:32 # shader core pipeline config, i.e., {<nthread>:<warpsize>}
-gpgpu_tex_cache:l1  N:4:128:256,L:R:m:N:L,T:512:8,128:2 # per-shader L1 texture cache  (READ-ONLY) config  {<nsets>:<bsize>:<assoc>,<rep>:<wr>:<alloc>:<wr_alloc>,<mshr>:<N>:<merge>,<mq>:<rf>}
-gpgpu_const_cache:l1 N:128:64:8,L:R:f:N:L,S:2:64,4 # per-shader L1 constant memory cache  (READ-ONLY) config  {<nsets>:<bsize>:<assoc>,<rep>:<wr>:<alloc>:<wr_alloc>,<mshr>:<N>:<merge>,<mq>}
-gpgpu_cache:il1     N:64:128:16,L:R:f:N:L,S:2:48,4 # shader L1 instruction cache config  {<nsets>:<bsize>:<assoc>,<rep>:<wr>:<alloc>:<wr_alloc>,<mshr>:<N>:<merge>,<mq>}
-gpgpu_cache:dl1     S:4:128:256,L:T:m:L:L,A:384:48,16:0,32 # per-shader L1 data cache config  {<nsets>:<bsize>:<assoc>,<rep>:<wr>:<alloc>:<wr_alloc>,<mshr>:<N>:<merge>,<mq> | none}
-gpgpu_l1_cache_write_ratio                   25 # L1D write ratio
-gpgpu_l1_banks                         4 # The number of L1 cache banks
-gpgpu_l1_banks_byte_interleaving                   32 # l1 banks byte interleaving granularity
-gpgpu_l1_banks_hashing_function                    0 # l1 banks hashing function
-gpgpu_l1_latency                      39 # L1 Hit Latency
-gpgpu_smem_latency                    29 # smem Latency
-gpgpu_cache:dl1PrefL1                 none # per-shader L1 data cache config  {<nsets>:<bsize>:<assoc>,<rep>:<wr>:<alloc>:<wr_alloc>,<mshr>:<N>:<merge>,<mq> | none}
-gpgpu_cache:dl1PrefShared                 none # per-shader L1 data cache config  {<nsets>:<bsize>:<assoc>,<rep>:<wr>:<alloc>:<wr_alloc>,<mshr>:<N>:<merge>,<mq> | none}
-gpgpu_gmem_skip_L1D                    0 # global memory access skip L1D cache (implements -Xptxas -dlcm=cg, default=no skip)
-gpgpu_perfect_mem                      0 # enable perfect memory mode (no cache miss)
-n_regfile_gating_group                    4 # group of lanes that should be read/written together)
-gpgpu_clock_gated_reg_file                    0 # enable clock gated reg file for power calculations
-gpgpu_clock_gated_lanes                    0 # enable clock gated lanes for power calculations
-gpgpu_shader_registers                65536 # Number of registers per shader core. Limits number of concurrent CTAs. (default 8192)
-gpgpu_registers_per_block                65536 # Maximum number of registers per CTA. (default 8192)
-gpgpu_ignore_resources_limitation                    0 # gpgpu_ignore_resources_limitation (default 0)
-gpgpu_shader_cta                      32 # Maximum number of concurrent CTAs in shader (default 32)
-gpgpu_num_cta_barriers                   16 # Maximum number of named barriers per CTA (default 16)
-gpgpu_n_chiplets                      16 # number of chiplets
-gpgpu_n_clusters                      32 # number of processing clusters
-gpgpu_n_cores_per_cluster                    1 # number of simd cores per cluster
-gpgpu_n_cluster_ejection_buffer_size                   32 # number of packets in ejection buffer
-gpgpu_n_ldst_response_buffer_size                    2 # number of response packets in ld/st unit ejection buffer
-gpgpu_shmem_per_block                49152 # Size of shared memory per thread block or CTA (default 48kB)
-gpgpu_shmem_size                  102400 # Size of shared memory per shader core (default 16kB)
-gpgpu_shmem_option      0,8,16,32,64,100 # Option list of shared memory sizes
-gpgpu_unified_l1d_size                  128 # Size of unified data cache(L1D + shared memory) in KB
-gpgpu_adaptive_cache_config                    1 # adaptive_cache_config
-gpgpu_shmem_sizeDefault               102400 # Size of shared memory per shader core (default 16kB)
-gpgpu_shmem_size_PrefL1                16384 # Size of shared memory per shader core (default 16kB)
-gpgpu_shmem_size_PrefShared                16384 # Size of shared memory per shader core (default 16kB)
-gpgpu_shmem_num_banks                   32 # Number of banks in the shared memory in each shader core (default 16)
-gpgpu_shmem_limited_broadcast                    0 # Limit shared memory to do one broadcast per cycle (default on)
-gpgpu_shmem_warp_parts                    1 # Number of portions a warp is divided into for shared memory bank conflict check
-gpgpu_mem_unit_ports                    1 # The number of memory transactions allowed per core cycle
-gpgpu_shmem_warp_parts                    1 # Number of portions a warp is divided into for shared memory bank conflict check
-gpgpu_warpdistro_shader                   -1 # Specify which shader core to collect the warp size distribution from
-gpgpu_warp_issue_shader                    0 # Specify which shader core to collect the warp issue distribution from
-gpgpu_local_mem_map                    1 # Mapping from local memory space address to simulated GPU physical address space (default = enabled)
-gpgpu_num_reg_banks                    8 # Number of register banks (default = 8)
-gpgpu_reg_bank_use_warp_id                    0 # Use warp ID in mapping registers to banks (default = off)
-gpgpu_sub_core_model                    1 # Sub Core Volta/Pascal model (default = off)
-gpgpu_enable_specialized_operand_collector                    0 # enable_specialized_operand_collector
-gpgpu_operand_collector_num_units_sp                    4 # number of collector units (default = 4)
-gpgpu_operand_collector_num_units_dp                    0 # number of collector units (default = 0)
-gpgpu_operand_collector_num_units_sfu                    4 # number of collector units (default = 4)
-gpgpu_operand_collector_num_units_int                    0 # number of collector units (default = 0)
-gpgpu_operand_collector_num_units_tensor_core                    4 # number of collector units (default = 4)
-gpgpu_operand_collector_num_units_mem                    2 # number of collector units (default = 2)
-gpgpu_operand_collector_num_units_gen                    8 # number of collector units (default = 0)
-gpgpu_operand_collector_num_in_ports_sp                    1 # number of collector unit in ports (default = 1)
-gpgpu_operand_collector_num_in_ports_dp                    0 # number of collector unit in ports (default = 0)
-gpgpu_operand_collector_num_in_ports_sfu                    1 # number of collector unit in ports (default = 1)
-gpgpu_operand_collector_num_in_ports_int                    0 # number of collector unit in ports (default = 0)
-gpgpu_operand_collector_num_in_ports_tensor_core                    1 # number of collector unit in ports (default = 1)
-gpgpu_operand_collector_num_in_ports_mem                    1 # number of collector unit in ports (default = 1)
-gpgpu_operand_collector_num_in_ports_gen                    8 # number of collector unit in ports (default = 0)
-gpgpu_operand_collector_num_out_ports_sp                    1 # number of collector unit in ports (default = 1)
-gpgpu_operand_collector_num_out_ports_dp                    0 # number of collector unit in ports (default = 0)
-gpgpu_operand_collector_num_out_ports_sfu                    1 # number of collector unit in ports (default = 1)
-gpgpu_operand_collector_num_out_ports_int                    0 # number of collector unit in ports (default = 0)
-gpgpu_operand_collector_num_out_ports_tensor_core                    1 # number of collector unit in ports (default = 1)
-gpgpu_operand_collector_num_out_ports_mem                    1 # number of collector unit in ports (default = 1)
-gpgpu_operand_collector_num_out_ports_gen                    8 # number of collector unit in ports (default = 0)
-gpgpu_coalesce_arch                   86 # Coalescing arch (GT200 = 13, Fermi = 20)
-gpgpu_num_sched_per_core                    4 # Number of warp schedulers per core
-gpgpu_max_insn_issue_per_warp                    1 # Max number of instructions that can be issued per warp in one cycle by scheduler (either 1 or 2)
-gpgpu_dual_issue_diff_exec_units                    1 # should dual issue use two different execution unit resources (Default = 1)
-gpgpu_simt_core_sim_order                    1 # Select the simulation order of cores in a cluster (0=Fix, 1=Round-Robin)
-gpgpu_pipeline_widths 4,4,4,4,4,4,4,4,4,4,8,4,4 # Pipeline widths ID_OC_SP,ID_OC_DP,ID_OC_INT,ID_OC_SFU,ID_OC_MEM,OC_EX_SP,OC_EX_DP,OC_EX_INT,OC_EX_SFU,OC_EX_MEM,EX_WB,ID_OC_TENSOR_CORE,OC_EX_TENSOR_CORE
-gpgpu_tensor_core_avail                    1 # Tensor Core Available (default=0)
-gpgpu_num_sp_units                     4 # Number of SP units (default=1)
-gpgpu_num_dp_units                     4 # Number of DP units (default=0)
-gpgpu_num_int_units                    4 # Number of INT units (default=0)
-gpgpu_num_sfu_units                    4 # Number of SF units (default=1)
-gpgpu_num_tensor_core_units                    4 # Number of tensor_core units (default=1)
-gpgpu_num_mem_units                    1 # Number if ldst units (default=1) WARNING: not hooked up to anything
-gpgpu_scheduler                      lrr # Scheduler configuration: < lrr | gto | two_level_active > If two_level_active:<num_active_warps>:<inner_prioritization>:<outer_prioritization>For complete list of prioritization values see shader.h enum scheduler_prioritization_typeDefault: gto
-gpgpu_concurrent_kernel_sm                    0 # Support concurrent kernels on a SM (default = disabled)
-gpgpu_perfect_inst_const_cache                    1 # perfect inst and const cache mode, so all inst and const hits in the cache(default = disabled)
-gpgpu_inst_fetch_throughput                    4 # the number of fetched intruction per warp each cycle
-gpgpu_reg_file_port_throughput                    2 # the number ports of the register file
-specialized_unit_1         1,4,4,4,4,BRA # specialized unit config {<enabled>,<num_units>:<latency>:<initiation>,<ID_OC_SPEC>:<OC_EX_SPEC>,<NAME>}
-specialized_unit_2       1,4,200,4,4,TEX # specialized unit config {<enabled>,<num_units>:<latency>:<initiation>,<ID_OC_SPEC>:<OC_EX_SPEC>,<NAME>}
-specialized_unit_3     1,4,32,4,4,TENSOR # specialized unit config {<enabled>,<num_units>:<latency>:<initiation>,<ID_OC_SPEC>:<OC_EX_SPEC>,<NAME>}
-specialized_unit_4         1,4,4,4,4,UDP # specialized unit config {<enabled>,<num_units>:<latency>:<initiation>,<ID_OC_SPEC>:<OC_EX_SPEC>,<NAME>}
-specialized_unit_5         0,4,4,4,4,BRA # specialized unit config {<enabled>,<num_units>:<latency>:<initiation>,<ID_OC_SPEC>:<OC_EX_SPEC>,<NAME>}
-specialized_unit_6         0,4,4,4,4,BRA # specialized unit config {<enabled>,<num_units>:<latency>:<initiation>,<ID_OC_SPEC>:<OC_EX_SPEC>,<NAME>}
-specialized_unit_7         0,4,4,4,4,BRA # specialized unit config {<enabled>,<num_units>:<latency>:<initiation>,<ID_OC_SPEC>:<OC_EX_SPEC>,<NAME>}
-specialized_unit_8         0,4,4,4,4,BRA # specialized unit config {<enabled>,<num_units>:<latency>:<initiation>,<ID_OC_SPEC>:<OC_EX_SPEC>,<NAME>}
-gpgpu_perf_sim_memcpy                    1 # Fill the L2 cache on memcpy
-gpgpu_simple_dram_model                    0 # simple_dram_model with fixed latency and BW
-gpgpu_dram_scheduler                    1 # 0 = fifo, 1 = FR-FCFS (defaul)
-gpgpu_dram_partition_queues          64:64:64:64 # i2$:$2d:d2$:$2i
-l2_ideal                               0 # Use a ideal L2 cache that always hit
-gpgpu_cache:dl2     S:64:128:16,L:B:m:L:P,A:192:4,32:0,32 # unified banked L2 data cache config  {<nsets>:<bsize>:<assoc>,<rep>:<wr>:<alloc>:<wr_alloc>,<mshr>:<N>:<merge>,<mq>}
-gpgpu_cache:dl2_texture_only                    0 # L2 cache used for texture only
-gpgpu_n_mem                           16 # number of memory modules (e.g. memory controllers) in gpu
-gpgpu_n_sub_partition_per_mchannel                    2 # number of memory subpartition in each memory module
-gpgpu_n_mem_per_ctrlr                    1 # number of memory chips per memory controller
-gpgpu_memlatency_stat                   14 # track and display latency statistics 0x2 enables MC, 0x4 enables queue logs
-gpgpu_frfcfs_dram_sched_queue_size                   64 # 0 = unlimited (default); # entries per chip
-gpgpu_dram_return_queue_size                  192 # 0 = unlimited (default); # entries per chip
-gpgpu_dram_buswidth                    2 # default = 4 bytes (8 bytes per cycle at DDR)
-gpgpu_dram_burst_length                   16 # Burst length of each DRAM request (default = 4 data bus cycle)
-dram_data_command_freq_ratio                    4 # Frequency ratio between DRAM data bus and command bus (default = 2 times, i.e. DDR)
-gpgpu_dram_timing_opt nbk=16:CCD=4:RRD=12:RCD=24:RAS=55:RP=24:RC=78:CL=24:WL=8:CDLR=10:WR=24:nbkgrp=4:CCDL=6:RTPL=4 # DRAM timing parameters = {nbk:tCCD:tRRD:tRCD:tRAS:tRP:tRC:CL:WL:tCDLR:tWR:nbkgrp:tCCDL:tRTPL}
-gpgpu_l2_rop_latency                  187 # ROP queue latency (default 85)
-dram_latency                         254 # DRAM latency (default 30)
-dram_dual_bus_interface                    0 # dual_bus_interface (default = 0)
-dram_bnk_indexing_policy                    0 # dram_bnk_indexing_policy (0 = normal indexing, 1 = Xoring with the higher bits) (Default = 0)
-dram_bnkgrp_indexing_policy                    1 # dram_bnkgrp_indexing_policy (0 = take higher bits, 1 = take lower bits) (Default = 0)
-dram_seperate_write_queue_enable                    0 # Seperate_Write_Queue_Enable
-dram_write_queue_size             32:28:16 # Write_Queue_Size
-dram_elimnate_rw_turnaround                    0 # elimnate_rw_turnaround i.e set tWTR and tRTW = 0
-icnt_flit_size                        40 # icnt_flit_size
-gpgpu_mem_addr_mapping dramid@8;00000000.00000000.00000000.00000000.0000RRRR.RRRRRRRR.RBBBCCCC.BCCSSSSS # mapping memory address to dram model {dramid@<start bit>;<memory address map>}
-gpgpu_mem_addr_test                    0 # run sweep test to check address mapping for aliased address
-gpgpu_mem_address_mask                    1 # 0 = old addressing mask, 1 = new addressing mask, 2 = new add. mask + flipped bank sel and chip sel bits
-gpgpu_memory_partition_indexing                    2 # 0 = no indexing, 1 = bitwise xoring, 2 = IPoly, 3 = custom indexing
-accelwattch_xml_file accelwattch_sass_sim.xml # AccelWattch XML file
-power_simulation_enabled                    0 # Turn on power simulator (1=On, 0=Off)
-power_per_cycle_dump                    0 # Dump detailed power output each cycle
-hw_perf_file_name            hw_perf.csv # Hardware Performance Statistics file
-hw_perf_bench_name                       # Kernel Name in Hardware Performance Statistics file
-power_simulation_mode                    0 # Switch performance counter input for power simulation (0=Sim, 1=HW, 2=HW-Sim Hybrid)
-dvfs_enabled                           0 # Turn on DVFS for power model
-aggregate_power_stats                    0 # Accumulate power across all kernels
-accelwattch_hybrid_perfsim_L1_RH                    0 # Get L1 Read Hits for Accelwattch-Hybrid from Accel-Sim
-accelwattch_hybrid_perfsim_L1_RM                    0 # Get L1 Read Misses for Accelwattch-Hybrid from Accel-Sim
-accelwattch_hybrid_perfsim_L1_WH                    0 # Get L1 Write Hits for Accelwattch-Hybrid from Accel-Sim
-accelwattch_hybrid_perfsim_L1_WM                    0 # Get L1 Write Misses for Accelwattch-Hybrid from Accel-Sim
-accelwattch_hybrid_perfsim_L2_RH                    0 # Get L2 Read Hits for Accelwattch-Hybrid from Accel-Sim
-accelwattch_hybrid_perfsim_L2_RM                    0 # Get L2 Read Misses for Accelwattch-Hybrid from Accel-Sim
-accelwattch_hybrid_perfsim_L2_WH                    0 # Get L2 Write Hits for Accelwattch-Hybrid from Accel-Sim
-accelwattch_hybrid_perfsim_L2_WM                    0 # Get L2 Write Misses for Accelwattch-Hybrid from Accel-Sim
-accelwattch_hybrid_perfsim_CC_ACC                    0 # Get Constant Cache Acesses for Accelwattch-Hybrid from Accel-Sim
-accelwattch_hybrid_perfsim_SHARED_ACC                    0 # Get Shared Memory Acesses for Accelwattch-Hybrid from Accel-Sim
-accelwattch_hybrid_perfsim_DRAM_RD                    0 # Get DRAM Reads for Accelwattch-Hybrid from Accel-Sim
-accelwattch_hybrid_perfsim_DRAM_WR                    0 # Get DRAM Writes for Accelwattch-Hybrid from Accel-Sim
-accelwattch_hybrid_perfsim_NOC                    0 # Get Interconnect Acesses for Accelwattch-Hybrid from Accel-Sim
-accelwattch_hybrid_perfsim_PIPE_DUTY                    0 # Get Pipeline Duty Cycle Acesses for Accelwattch-Hybrid from Accel-Sim
-accelwattch_hybrid_perfsim_NUM_SM_IDLE                    0 # Get Number of Idle SMs for Accelwattch-Hybrid from Accel-Sim
-accelwattch_hybrid_perfsim_CYCLES                    0 # Get Executed Cycles for Accelwattch-Hybrid from Accel-Sim
-accelwattch_hybrid_perfsim_VOLTAGE                    0 # Get Chip Voltage for Accelwattch-Hybrid from Accel-Sim
-power_trace_enabled                    0 # produce a file for the power trace (1=On, 0=Off)
-power_trace_zlevel                     6 # Compression level of the power trace output log (0=no comp, 9=highest)
-steady_power_levels_enabled                    0 # produce a file for the steady power levels (1=On, 0=Off)
-steady_state_definition                  8:4 # allowed deviation:number of samples
-gpgpu_max_cycle                        0 # terminates gpu simulation early (0 = no limit)
-gpgpu_max_insn                         0 # terminates gpu simulation early (0 = no limit)
-gpgpu_max_cta                          0 # terminates gpu simulation early (0 = no limit)
-gpgpu_max_completed_cta                    0 # terminates gpu simulation early (0 = no limit)
-gpgpu_runtime_stat                   500 # display runtime statistics such as dram utilization {<freq>:<flag>}
-liveness_message_freq                    1 # Minimum number of seconds between simulation liveness messages (0 = always print)
-gpgpu_compute_capability_major                    8 # Major compute capability version number
-gpgpu_compute_capability_minor                    6 # Minor compute capability version number
-gpgpu_flush_l1_cache                    1 # Flush L1 cache at the end of each kernel call
-gpgpu_flush_l2_cache                    0 # Flush L2 cache at the end of each kernel call
-gpgpu_deadlock_detect                    1 # Stop the simulation at deadlock (1=on (default), 0=off)
-gpgpu_ptx_instruction_classification                    0 # if enabled will classify ptx instruction types per kernel (Max 255 kernels now)
-gpgpu_ptx_sim_mode                     0 # Select between Performance (default) or Functional simulation (1)
-gpgpu_clock_domains 1132:113200:1132:1132:3500.5 # Clock Domain Frequencies in MhZ {<Core Clock>:<ICNT Clock>:<Inter-chiplet ICNT Clock>:<L2 Clock>:<DRAM Clock>}
-gpgpu_max_concurrent_kernel                  128 # maximum kernels that can run concurrently on GPU, set this value according to max resident grids for your compute capability
-gpgpu_cflog_interval                    0 # Interval between each snapshot in control flow logger
-visualizer_enabled                     0 # Turn on visualizer output (1=On, 0=Off)
-visualizer_outputfile                 NULL # Specifies the output log file for visualizer
-visualizer_zlevel                      6 # Compression level of the visualizer output log (0=no comp, 9=highest)
-gpgpu_stack_size_limit                 1024 # GPU thread stack size
-gpgpu_heap_size_limit              8388608 # GPU malloc heap size
-gpgpu_runtime_sync_depth_limit                    2 # GPU device runtime synchronize depth
-gpgpu_runtime_pending_launch_count_limit                 2048 # GPU device runtime pending launch count
-trace_enabled                          0 # Turn on traces
-trace_components                    none # comma seperated list of traces to enable. Complete list found in trace_streams.tup. Default none
-trace_sampling_core                    0 # The core which is printed using CORE_DPRINTF. Default 0
-trace_sampling_memory_partition                   -1 # The memory partition which is printed using MEMPART_DPRINTF. Default -1 (i.e. all)
-enable_ptx_file_line_stats                    1 # Turn on PTX source line statistic profiling. (1 = On)
-ptx_line_stats_filename gpgpu_inst_stats.txt # Output file for PTX source line statistics.
-gpgpu_kernel_launch_latency                 5000 # Kernel launch latency in cycles. Default: 0
-gpgpu_cdp_enabled                      0 # Turn on CDP
-gpgpu_TB_launch_latency                    0 # thread block launch latency in cycles. Default: 0
-trace                    ./kernelslist.g # traces kernel filetraces kernel file directory
-trace_opcode_latency_initiation_int                  2,2 # Opcode latencies and initiation for integers in trace driven mode <latency,initiation>
-trace_opcode_latency_initiation_sp                  2,1 # Opcode latencies and initiation for sp in trace driven mode <latency,initiation>
-trace_opcode_latency_initiation_dp                64,64 # Opcode latencies and initiation for dp in trace driven mode <latency,initiation>
-trace_opcode_latency_initiation_sfu                 21,8 # Opcode latencies and initiation for sfu in trace driven mode <latency,initiation>
-trace_opcode_latency_initiation_tensor                32,32 # Opcode latencies and initiation for tensor in trace driven mode <latency,initiation>
-trace_opcode_latency_initiation_spec_op_1                  4,4 # specialized unit config <latency,initiation>
-trace_opcode_latency_initiation_spec_op_2                200,4 # specialized unit config <latency,initiation>
-trace_opcode_latency_initiation_spec_op_3                32,32 # specialized unit config <latency,initiation>
-trace_opcode_latency_initiation_spec_op_4                  4,1 # specialized unit config <latency,initiation>
-trace_opcode_latency_initiation_spec_op_5                  4,4 # specialized unit config <latency,initiation>
-trace_opcode_latency_initiation_spec_op_6                  4,4 # specialized unit config <latency,initiation>
-trace_opcode_latency_initiation_spec_op_7                  4,4 # specialized unit config <latency,initiation>
-trace_opcode_latency_initiation_spec_op_8                  4,4 # specialized unit config <latency,initiation>
DRAM Timing Options:
nbk                                    16 # number of banks
CCD                                     4 # column to column delay
RRD                                    12 # minimal delay between activation of rows in different banks
RCD                                    24 # row to column delay
RAS                                    55 # time needed to activate row
RP                                     24 # time needed to precharge (deactivate) row
RC                                     78 # row cycle time
CDLR                                   10 # switching from write to read (changes tWTR)
WR                                     24 # last data-in to row precharge
CL                                     24 # CAS latency
WL                                      8 # Write latency
nbkgrp                                  4 # number of bank groups
CCDL                                    6 # column to column delay between accesses to different bank groups
RTPL                                    4 # read to precharge delay between accesses to different bank groups
Total number of memory sub partition = 32
addr_dec_mask[CHIP]  = 0000000000000f00         high:12 low:8
addr_dec_mask[BK]    = 0000000000070080         high:19 low:7
addr_dec_mask[ROW]   = 00000000fff80000         high:32 low:19
addr_dec_mask[COL]   = 000000000000f07f         high:16 low:0
addr_dec_mask[BURST] = 000000000000001f         high:5 low:0
sub_partition_id_mask = 0000000000000080
GPGPU-Sim uArch: clock freqs: 1132000000.000000:113200000000.000000:1132000000.000000:1132000000.000000:3500500000.000000
GPGPU-Sim uArch: clock periods: 0.00000000088339222615:0.00000000088339222615:0.00000000000000000000:0.00000000088339222615:0.00000000028567347522
*** Initializing Memory Statistics ***
GPGPU-Sim uArch: interconnect node map (shaderID+MemID to icntID)
GPGPU-Sim uArch: Memory nodes ID start from index: 32
GPGPU-Sim uArch:    0   1   2   3   4   5   6   7
GPGPU-Sim uArch:    8   9  10  11  12  13  14  15
GPGPU-Sim uArch:   16  17  18  19  20  21  22  23
GPGPU-Sim uArch:   24  25  26  27  28  29  30  31
GPGPU-Sim uArch:   32  33  34  35  36  37  38  39
GPGPU-Sim uArch:   40  41  42  43  44  45  46  47
GPGPU-Sim uArch:   48  49  50  51  52  53  54  55
GPGPU-Sim uArch:   56  57  58  59  60  61  62  63
GPGPU-Sim uArch: interconnect node reverse map (icntID to shaderID+MemID)
GPGPU-Sim uArch: Memory nodes start from ID: 32
GPGPU-Sim uArch:    0   1   2   3   4   5   6   7
GPGPU-Sim uArch:    8   9  10  11  12  13  14  15
GPGPU-Sim uArch:   16  17  18  19  20  21  22  23
GPGPU-Sim uArch:   24  25  26  27  28  29  30  31
GPGPU-Sim uArch:   32  33  34  35  36  37  38  39
GPGPU-Sim uArch:   40  41  42  43  44  45  46  47
GPGPU-Sim uArch:   48  49  50  51  52  53  54  55
GPGPU-Sim uArch:   56  57  58  59  60  61  62  63
GPGPU-Sim uArch: interconnect node map (shaderID+MemID to icntID)
GPGPU-Sim uArch: Memory nodes ID start from index: 8
GPGPU-Sim uArch:    0   1   2   3
GPGPU-Sim uArch:    4   5   6   7
GPGPU-Sim uArch:    8   9  10  11
GPGPU-Sim uArch:   12  13  14  15
GPGPU-Sim uArch: interconnect node reverse map (icntID to shaderID+MemID)
GPGPU-Sim uArch: Memory nodes start from ID: 8
GPGPU-Sim uArch:    0   1   2   3
GPGPU-Sim uArch:    4   5   6   7
GPGPU-Sim uArch:    8   9  10  11
GPGPU-Sim uArch:   12  13  14  15
GPGPU-Sim uArch: performance model initialization complete.
launching memcpy command : MemcpyHtoD,0x00007f9d94000000,85191092
launching memcpy command : MemcpyHtoD,0x00007f9d14000000,2120204360
launching memcpy command : MemcpyHtoD,0x00007f9d0e000000,85191092
launching memcpy command : MemcpyHtoD,0x00007f9b00000000,2120204360
launching memcpy command : MemcpyHtoD,0x00007f9b7e600000,21297772
launching memcpy command : MemcpyHtoD,0x00007f9cee000000,530051090
launching memcpy command : MemcpyHtoD,0x00007f9dd6101000,28
launching memcpy command : MemcpyHtoD,0x00007f9dd6101600,491520
launching memcpy command : MemcpyHtoD,0x00007f9dd6100000,688
launching memcpy command : MemcpyHtoD,0x00007f9dd6100600,4
launching memcpy command : MemcpyHtoD,0x00007f9dd6100800,4
launching memcpy command : MemcpyHtoD,0x00007f9dd6101000,28
Processing kernel ./kernel-10.traceg
-kernel name = _ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_
-kernel id = 10
-grid dim = (544,1,1)
-block dim = (256,1,1)
-shmem = 12160
-nregs = 34
-binary version = 86
-cuda stream id = -685638256
-shmem base_addr = 0x00007f9e58000000
-local mem base_addr = 0x00007f9e56000000
-nvbit version = 1.5.3
-accelsim tracer version = 3
Header info loaded for kernel command : ./kernel-10.traceg
launching kernel name: _ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_ uid: 1
GPGPU-Sim uArch: Shader 0 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
GPGPU-Sim uArch: CTA/core = 6, limited by: threads
GPGPU-Sim: Reconfigure L1 cache to 28KB
thread block = 0,0,0
GPGPU-Sim uArch: Shader 1 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 1,0,0
GPGPU-Sim uArch: Shader 2 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 2,0,0
GPGPU-Sim uArch: Shader 3 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 3,0,0
GPGPU-Sim uArch: Shader 4 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 4,0,0
GPGPU-Sim uArch: Shader 5 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 5,0,0
GPGPU-Sim uArch: Shader 6 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 6,0,0
GPGPU-Sim uArch: Shader 7 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 7,0,0
GPGPU-Sim uArch: Shader 8 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 8,0,0
GPGPU-Sim uArch: Shader 9 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 9,0,0
GPGPU-Sim uArch: Shader 10 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 10,0,0
GPGPU-Sim uArch: Shader 11 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 11,0,0
GPGPU-Sim uArch: Shader 12 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 12,0,0
GPGPU-Sim uArch: Shader 13 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 13,0,0
GPGPU-Sim uArch: Shader 14 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 14,0,0
GPGPU-Sim uArch: Shader 15 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 15,0,0
GPGPU-Sim uArch: Shader 16 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 16,0,0
GPGPU-Sim uArch: Shader 17 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 17,0,0
GPGPU-Sim uArch: Shader 18 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 18,0,0
GPGPU-Sim uArch: Shader 19 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 19,0,0
GPGPU-Sim uArch: Shader 20 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 20,0,0
GPGPU-Sim uArch: Shader 21 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 21,0,0
GPGPU-Sim uArch: Shader 22 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 22,0,0
GPGPU-Sim uArch: Shader 23 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 23,0,0
GPGPU-Sim uArch: Shader 24 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 24,0,0
GPGPU-Sim uArch: Shader 25 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 25,0,0
GPGPU-Sim uArch: Shader 26 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 26,0,0
GPGPU-Sim uArch: Shader 27 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 27,0,0
GPGPU-Sim uArch: Shader 28 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 28,0,0
GPGPU-Sim uArch: Shader 29 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 29,0,0
GPGPU-Sim uArch: Shader 30 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 30,0,0
GPGPU-Sim uArch: Shader 31 bind to kernel 1 '_ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_'
thread block = 31,0,0
thread block = 32,0,0
thread block = 33,0,0
thread block = 34,0,0
thread block = 35,0,0
thread block = 36,0,0
thread block = 37,0,0
thread block = 38,0,0
thread block = 39,0,0
thread block = 40,0,0
thread block = 41,0,0
thread block = 42,0,0
thread block = 43,0,0
thread block = 44,0,0
thread block = 45,0,0
thread block = 46,0,0
thread block = 47,0,0
thread block = 48,0,0
thread block = 49,0,0
thread block = 50,0,0
thread block = 51,0,0
thread block = 52,0,0
thread block = 53,0,0
thread block = 54,0,0
thread block = 55,0,0
thread block = 56,0,0
thread block = 57,0,0
thread block = 58,0,0
thread block = 59,0,0
thread block = 60,0,0
thread block = 61,0,0
thread block = 62,0,0
thread block = 63,0,0
thread block = 64,0,0
thread block = 65,0,0
thread block = 66,0,0
thread block = 67,0,0
thread block = 68,0,0
thread block = 69,0,0
thread block = 70,0,0
thread block = 71,0,0
thread block = 72,0,0
thread block = 73,0,0
thread block = 74,0,0
thread block = 75,0,0
thread block = 76,0,0
thread block = 77,0,0
thread block = 78,0,0
thread block = 79,0,0
thread block = 80,0,0
thread block = 81,0,0
thread block = 82,0,0
thread block = 83,0,0
thread block = 84,0,0
thread block = 85,0,0
thread block = 86,0,0
thread block = 87,0,0
thread block = 88,0,0
thread block = 89,0,0
thread block = 90,0,0
thread block = 91,0,0
thread block = 92,0,0
thread block = 93,0,0
thread block = 94,0,0
thread block = 95,0,0
thread block = 96,0,0
thread block = 97,0,0
thread block = 98,0,0
thread block = 99,0,0
thread block = 100,0,0
thread block = 101,0,0
thread block = 102,0,0
thread block = 103,0,0
thread block = 104,0,0
thread block = 105,0,0
thread block = 106,0,0
thread block = 107,0,0
thread block = 108,0,0
thread block = 109,0,0
thread block = 110,0,0
thread block = 111,0,0
thread block = 112,0,0
thread block = 113,0,0
thread block = 114,0,0
thread block = 115,0,0
thread block = 116,0,0
thread block = 117,0,0
thread block = 118,0,0
thread block = 119,0,0
thread block = 120,0,0
thread block = 121,0,0
thread block = 122,0,0
thread block = 123,0,0
thread block = 124,0,0
thread block = 125,0,0
thread block = 126,0,0
thread block = 127,0,0
thread block = 128,0,0
thread block = 129,0,0
thread block = 130,0,0
thread block = 131,0,0
thread block = 132,0,0
thread block = 133,0,0
thread block = 134,0,0
thread block = 135,0,0
thread block = 136,0,0
thread block = 137,0,0
thread block = 138,0,0
thread block = 139,0,0
thread block = 140,0,0
thread block = 141,0,0
thread block = 142,0,0
thread block = 143,0,0
thread block = 144,0,0
thread block = 145,0,0
thread block = 146,0,0
thread block = 147,0,0
thread block = 148,0,0
thread block = 149,0,0
thread block = 150,0,0
thread block = 151,0,0
thread block = 152,0,0
thread block = 153,0,0
thread block = 154,0,0
thread block = 155,0,0
thread block = 156,0,0
thread block = 157,0,0
thread block = 158,0,0
thread block = 159,0,0
thread block = 160,0,0
thread block = 161,0,0
thread block = 162,0,0
thread block = 163,0,0
thread block = 164,0,0
thread block = 165,0,0
thread block = 166,0,0
thread block = 167,0,0
thread block = 168,0,0
thread block = 169,0,0
thread block = 170,0,0
thread block = 171,0,0
thread block = 172,0,0
thread block = 173,0,0
thread block = 174,0,0
thread block = 175,0,0
thread block = 176,0,0
thread block = 177,0,0
thread block = 178,0,0
thread block = 179,0,0
thread block = 180,0,0
thread block = 181,0,0
thread block = 182,0,0
thread block = 183,0,0
thread block = 184,0,0
thread block = 185,0,0
thread block = 186,0,0
thread block = 187,0,0
thread block = 188,0,0
thread block = 189,0,0
thread block = 190,0,0
thread block = 191,0,0
thread block = 192,0,0
thread block = 193,0,0
thread block = 194,0,0
thread block = 195,0,0
thread block = 196,0,0
thread block = 197,0,0
thread block = 198,0,0
thread block = 199,0,0
thread block = 200,0,0
thread block = 201,0,0
thread block = 202,0,0
thread block = 203,0,0
thread block = 204,0,0
thread block = 205,0,0
thread block = 206,0,0
thread block = 207,0,0
thread block = 208,0,0
thread block = 209,0,0
thread block = 210,0,0
thread block = 211,0,0
thread block = 212,0,0
thread block = 213,0,0
thread block = 214,0,0
thread block = 215,0,0
thread block = 216,0,0
thread block = 217,0,0
thread block = 218,0,0
thread block = 219,0,0
thread block = 220,0,0
thread block = 221,0,0
thread block = 222,0,0
thread block = 223,0,0
thread block = 224,0,0
thread block = 225,0,0
thread block = 226,0,0
thread block = 227,0,0
thread block = 228,0,0
thread block = 229,0,0
thread block = 230,0,0
thread block = 231,0,0
thread block = 232,0,0
thread block = 233,0,0
thread block = 234,0,0
thread block = 235,0,0
thread block = 236,0,0
thread block = 237,0,0
thread block = 238,0,0
thread block = 239,0,0
thread block = 240,0,0
thread block = 241,0,0
thread block = 242,0,0
thread block = 243,0,0
thread block = 244,0,0
thread block = 245,0,0
thread block = 246,0,0
thread block = 247,0,0
thread block = 248,0,0
thread block = 249,0,0
thread block = 250,0,0
thread block = 251,0,0
thread block = 252,0,0
thread block = 253,0,0
thread block = 254,0,0
thread block = 255,0,0
thread block = 256,0,0
thread block = 257,0,0
thread block = 258,0,0
thread block = 259,0,0
thread block = 260,0,0
thread block = 261,0,0
thread block = 262,0,0
thread block = 263,0,0
thread block = 264,0,0
thread block = 265,0,0
thread block = 266,0,0
thread block = 267,0,0
thread block = 268,0,0
thread block = 269,0,0
thread block = 270,0,0
thread block = 271,0,0
thread block = 272,0,0
thread block = 273,0,0
thread block = 274,0,0
thread block = 275,0,0
thread block = 276,0,0
thread block = 277,0,0
thread block = 278,0,0
thread block = 279,0,0
thread block = 280,0,0
thread block = 281,0,0
thread block = 282,0,0
thread block = 283,0,0
thread block = 284,0,0
thread block = 285,0,0
thread block = 286,0,0
thread block = 287,0,0
thread block = 288,0,0
thread block = 289,0,0
thread block = 290,0,0
thread block = 291,0,0
thread block = 292,0,0
thread block = 293,0,0
thread block = 294,0,0
thread block = 295,0,0
thread block = 296,0,0
thread block = 297,0,0
thread block = 298,0,0
thread block = 299,0,0
thread block = 300,0,0
thread block = 301,0,0
thread block = 302,0,0
thread block = 303,0,0
thread block = 304,0,0
thread block = 305,0,0
thread block = 306,0,0
thread block = 307,0,0
thread block = 308,0,0
thread block = 309,0,0
thread block = 310,0,0
thread block = 311,0,0
thread block = 312,0,0
thread block = 313,0,0
thread block = 314,0,0
thread block = 315,0,0
thread block = 316,0,0
thread block = 317,0,0
thread block = 318,0,0
thread block = 319,0,0
thread block = 320,0,0
thread block = 321,0,0
thread block = 322,0,0
thread block = 323,0,0
thread block = 324,0,0
thread block = 325,0,0
thread block = 326,0,0
thread block = 327,0,0
thread block = 328,0,0
thread block = 329,0,0
thread block = 330,0,0
thread block = 331,0,0
thread block = 332,0,0
thread block = 333,0,0
thread block = 334,0,0
thread block = 335,0,0
thread block = 336,0,0
thread block = 337,0,0
thread block = 338,0,0
thread block = 339,0,0
thread block = 340,0,0
thread block = 341,0,0
thread block = 342,0,0
thread block = 343,0,0
thread block = 344,0,0
thread block = 345,0,0
thread block = 346,0,0
thread block = 347,0,0
thread block = 348,0,0
thread block = 349,0,0
thread block = 350,0,0
thread block = 351,0,0
thread block = 352,0,0
thread block = 353,0,0
thread block = 354,0,0
thread block = 355,0,0
thread block = 356,0,0
thread block = 357,0,0
thread block = 358,0,0
thread block = 359,0,0
thread block = 360,0,0
thread block = 361,0,0
thread block = 362,0,0
thread block = 363,0,0
thread block = 364,0,0
thread block = 365,0,0
thread block = 366,0,0
thread block = 367,0,0
thread block = 368,0,0
thread block = 369,0,0
thread block = 370,0,0
thread block = 371,0,0
thread block = 372,0,0
thread block = 373,0,0
thread block = 374,0,0
thread block = 375,0,0
thread block = 376,0,0
thread block = 377,0,0
thread block = 378,0,0
thread block = 379,0,0
thread block = 380,0,0
thread block = 381,0,0
thread block = 382,0,0
thread block = 383,0,0
thread block = 384,0,0
thread block = 385,0,0
thread block = 386,0,0
thread block = 387,0,0
thread block = 388,0,0
thread block = 389,0,0
thread block = 390,0,0
thread block = 391,0,0
thread block = 392,0,0
thread block = 393,0,0
thread block = 394,0,0
thread block = 395,0,0
thread block = 396,0,0
thread block = 397,0,0
thread block = 398,0,0
thread block = 399,0,0
thread block = 400,0,0
thread block = 401,0,0
thread block = 402,0,0
thread block = 403,0,0
thread block = 404,0,0
thread block = 405,0,0
thread block = 406,0,0
thread block = 407,0,0
thread block = 408,0,0
thread block = 409,0,0
thread block = 410,0,0
thread block = 411,0,0
thread block = 412,0,0
thread block = 413,0,0
thread block = 414,0,0
thread block = 415,0,0
thread block = 416,0,0
thread block = 417,0,0
thread block = 418,0,0
thread block = 419,0,0
thread block = 420,0,0
thread block = 421,0,0
thread block = 422,0,0
thread block = 423,0,0
thread block = 424,0,0
thread block = 425,0,0
thread block = 426,0,0
thread block = 427,0,0
thread block = 428,0,0
thread block = 429,0,0
thread block = 430,0,0
thread block = 431,0,0
thread block = 432,0,0
thread block = 433,0,0
thread block = 434,0,0
thread block = 435,0,0
thread block = 436,0,0
thread block = 437,0,0
thread block = 438,0,0
thread block = 439,0,0
thread block = 440,0,0
thread block = 441,0,0
thread block = 442,0,0
thread block = 443,0,0
thread block = 444,0,0
thread block = 445,0,0
thread block = 446,0,0
thread block = 447,0,0
thread block = 448,0,0
thread block = 449,0,0
thread block = 450,0,0
thread block = 451,0,0
thread block = 452,0,0
thread block = 453,0,0
thread block = 454,0,0
thread block = 455,0,0
thread block = 456,0,0
thread block = 457,0,0
thread block = 458,0,0
thread block = 459,0,0
thread block = 460,0,0
thread block = 461,0,0
thread block = 462,0,0
thread block = 463,0,0
thread block = 464,0,0
thread block = 465,0,0
thread block = 466,0,0
thread block = 467,0,0
thread block = 468,0,0
thread block = 469,0,0
thread block = 470,0,0
thread block = 471,0,0
thread block = 472,0,0
thread block = 473,0,0
thread block = 474,0,0
thread block = 475,0,0
thread block = 476,0,0
thread block = 477,0,0
thread block = 478,0,0
thread block = 479,0,0
thread block = 480,0,0
thread block = 481,0,0
thread block = 482,0,0
thread block = 483,0,0
thread block = 484,0,0
thread block = 485,0,0
thread block = 486,0,0
thread block = 487,0,0
thread block = 488,0,0
thread block = 489,0,0
thread block = 490,0,0
thread block = 491,0,0
thread block = 492,0,0
thread block = 493,0,0
thread block = 494,0,0
thread block = 495,0,0
thread block = 496,0,0
thread block = 497,0,0
thread block = 498,0,0
thread block = 499,0,0
thread block = 500,0,0
thread block = 501,0,0
thread block = 502,0,0
thread block = 503,0,0
thread block = 504,0,0
thread block = 505,0,0
thread block = 506,0,0
thread block = 507,0,0
thread block = 508,0,0
thread block = 509,0,0
thread block = 510,0,0
thread block = 511,0,0
thread block = 512,0,0
thread block = 513,0,0
thread block = 514,0,0
thread block = 515,0,0
thread block = 516,0,0
thread block = 517,0,0
thread block = 518,0,0
thread block = 519,0,0
thread block = 520,0,0
thread block = 521,0,0
thread block = 522,0,0
thread block = 523,0,0
thread block = 524,0,0
thread block = 525,0,0
thread block = 526,0,0
thread block = 527,0,0
thread block = 528,0,0
thread block = 529,0,0
thread block = 530,0,0
thread block = 531,0,0
thread block = 532,0,0
thread block = 533,0,0
thread block = 534,0,0
thread block = 535,0,0
thread block = 536,0,0
thread block = 537,0,0
thread block = 538,0,0
thread block = 539,0,0
thread block = 540,0,0
thread block = 541,0,0
thread block = 542,0,0
thread block = 543,0,0
Destroy streams for kernel 1: size 0
kernel_name = _ZN7gunrock5oprtr4CULL6KernelILj1EjjjijZNS_3app3bfs16BFSIterationLoopINS4_7EnactorINS4_7ProblemINS3_9TestGraphIjjiLj768ELj0EEEjiLj0EEELj0ELj0EEEE4CoreEiEUlRKjRjSE_SE_SE_SF_E0_EEvbT2_PKT0_PKT3_SH_T4_PSO_PhPT1_NS_4util15CtaWorkProgressISH_EET5_
kernel_launch_uid = 1
Hossein: Number of Local Requests: 35
Hossein: Number of Remote Requests: 513
gpu_sim_cycle = 8748
gpu_sim_insn = 4086966
gpu_ipc =     467.1886
gpu_tot_sim_cycle = 8748
gpu_tot_sim_insn = 4086966
gpu_tot_ipc =     467.1886
gpu_tot_issued_cta = 544
gpu_occupancy = 87.7916%
gpu_tot_occupancy = 87.7916%
max_total_param_size = 0
gpu_stall_dramfull = 336
gpu_stall_icnt2sh    = 0
partiton_level_parallism =       0.0626
partiton_level_parallism_total  =       0.0626
partiton_level_parallism_util =       1.0000
partiton_level_parallism_util_total  =       1.0000
L2_BW  =       2.2692 GB/Sec
L2_BW_total  =       2.2692 GB/Sec
gpu_total_sim_rate=85145

========= Core cache stats =========
L1I_cache:
        L1I_total_cache_accesses = 0
        L1I_total_cache_misses = 0
        L1I_total_cache_pending_hits = 0
        L1I_total_cache_reservation_fails = 0
L1D_cache:
        L1D_cache_core[0]: Access = 2, Miss = 2, Miss_rate = 1.000, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[1]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[2]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[3]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[4]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[5]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[6]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[7]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[8]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[9]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[10]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[11]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[12]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[13]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[14]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[15]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[16]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[17]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[18]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[19]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[20]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[21]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[22]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[23]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[24]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[25]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[26]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[27]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[28]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[29]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[30]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_cache_core[31]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
        L1D_total_cache_accesses = 2
        L1D_total_cache_misses = 2
        L1D_total_cache_miss_rate = 1.0000
        L1D_total_cache_pending_hits = 0
        L1D_total_cache_reservation_fails = 0
        L1D_cache_data_port_util = 0.000
        L1D_cache_fill_port_util = 0.000
L1C_cache:
        L1C_total_cache_accesses = 0
        L1C_total_cache_misses = 0
        L1C_total_cache_pending_hits = 0
        L1C_total_cache_reservation_fails = 0
L1T_cache:
        L1T_total_cache_accesses = 0
        L1T_total_cache_misses = 0
        L1T_total_cache_pending_hits = 0
        L1T_total_cache_reservation_fails = 0

Total_core_cache_stats:
        Total_core_cache_stats_breakdown[GLOBAL_ACC_R][HIT] = 0
        Total_core_cache_stats_breakdown[GLOBAL_ACC_R][HIT_RESERVED] = 0
        Total_core_cache_stats_breakdown[GLOBAL_ACC_R][MISS] = 0
        Total_core_cache_stats_breakdown[GLOBAL_ACC_R][RESERVATION_FAIL] = 0
        Total_core_cache_stats_breakdown[GLOBAL_ACC_R][SECTOR_MISS] = 0
        Total_core_cache_stats_breakdown[GLOBAL_ACC_R][MSHR_HIT] = 0
        Total_core_cache_stats_breakdown[LOCAL_ACC_R][HIT] = 0
        Total_core_cache_stats_breakdown[LOCAL_ACC_R][HIT_RESERVED] = 0
        Total_core_cache_stats_breakdown[LOCAL_ACC_R][MISS] = 0
        Total_core_cache_stats_breakdown[LOCAL_ACC_R][RESERVATION_FAIL] = 0
        Total_core_cache_stats_breakdown[LOCAL_ACC_R][SECTOR_MISS] = 0
        Total_core_cache_stats_breakdown[LOCAL_ACC_R][MSHR_HIT] = 0
        Total_core_cache_stats_breakdown[CONST_ACC_R][HIT] = 0
        Total_core_cache_stats_breakdown[CONST_ACC_R][HIT_RESERVED] = 0
        Total_core_cache_stats_breakdown[CONST_ACC_R][MISS] = 0
        Total_core_cache_stats_breakdown[CONST_ACC_R][RESERVATION_FAIL] = 0
        Total_core_cache_stats_breakdown[CONST_ACC_R][SECTOR_MISS] = 0
        Total_core_cache_stats_breakdown[CONST_ACC_R][MSHR_HIT] = 0
        Total_core_cache_stats_breakdown[TEXTURE_ACC_R][HIT] = 0
        Total_core_cache_stats_breakdown[TEXTURE_ACC_R][HIT_RESERVED] = 0
        Total_core_cache_stats_breakdown[TEXTURE_ACC_R][MISS] = 0
        Total_core_cache_stats_breakdown[TEXTURE_ACC_R][RESERVATION_FAIL] = 0
        Total_core_cache_stats_breakdown[TEXTURE_ACC_R][SECTOR_MISS] = 0
        Total_core_cache_stats_breakdown[TEXTURE_ACC_R][MSHR_HIT] = 0
        Total_core_cache_stats_breakdown[GLOBAL_ACC_W][HIT] = 0
        Total_core_cache_stats_breakdown[GLOBAL_ACC_W][HIT_RESERVED] = 0
        Total_core_cache_stats_breakdown[GLOBAL_ACC_W][MISS] = 2
        Total_core_cache_stats_breakdown[GLOBAL_ACC_W][RESERVATION_FAIL] = 0
        Total_core_cache_stats_breakdown[GLOBAL_ACC_W][SECTOR_MISS] = 0
        Total_core_cache_stats_breakdown[GLOBAL_ACC_W][MSHR_HIT] = 0
        Total_core_cache_stats_breakdown[LOCAL_ACC_W][HIT] = 0
        Total_core_cache_stats_breakdown[LOCAL_ACC_W][HIT_RESERVED] = 0
        Total_core_cache_stats_breakdown[LOCAL_ACC_W][MISS] = 0
        Total_core_cache_stats_breakdown[LOCAL_ACC_W][RESERVATION_FAIL] = 0
        Total_core_cache_stats_breakdown[LOCAL_ACC_W][SECTOR_MISS] = 0
        Total_core_cache_stats_breakdown[LOCAL_ACC_W][MSHR_HIT] = 0
        Total_core_cache_stats_breakdown[L1_WRBK_ACC][HIT] = 0
        Total_core_cache_stats_breakdown[L1_WRBK_ACC][HIT_RESERVED] = 0
        Total_core_cache_stats_breakdown[L1_WRBK_ACC][MISS] = 0
        Total_core_cache_stats_breakdown[L1_WRBK_ACC][RESERVATION_FAIL] = 0
        Total_core_cache_stats_breakdown[L1_WRBK_ACC][SECTOR_MISS] = 0
        Total_core_cache_stats_breakdown[L1_WRBK_ACC][MSHR_HIT] = 0
        Total_core_cache_stats_breakdown[L2_WRBK_ACC][HIT] = 0
        Total_core_cache_stats_breakdown[L2_WRBK_ACC][HIT_RESERVED] = 0
        Total_core_cache_stats_breakdown[L2_WRBK_ACC][MISS] = 0
        Total_core_cache_stats_breakdown[L2_WRBK_ACC][RESERVATION_FAIL] = 0
        Total_core_cache_stats_breakdown[L2_WRBK_ACC][SECTOR_MISS] = 0
        Total_core_cache_stats_breakdown[L2_WRBK_ACC][MSHR_HIT] = 0
        Total_core_cache_stats_breakdown[INST_ACC_R][HIT] = 0
        Total_core_cache_stats_breakdown[INST_ACC_R][HIT_RESERVED] = 0
        Total_core_cache_stats_breakdown[INST_ACC_R][MISS] = 0
        Total_core_cache_stats_breakdown[INST_ACC_R][RESERVATION_FAIL] = 0
        Total_core_cache_stats_breakdown[INST_ACC_R][SECTOR_MISS] = 0
        Total_core_cache_stats_breakdown[INST_ACC_R][MSHR_HIT] = 0
        Total_core_cache_stats_breakdown[L1_WR_ALLOC_R][HIT] = 0
        Total_core_cache_stats_breakdown[L1_WR_ALLOC_R][HIT_RESERVED] = 0
        Total_core_cache_stats_breakdown[L1_WR_ALLOC_R][MISS] = 0
        Total_core_cache_stats_breakdown[L1_WR_ALLOC_R][RESERVATION_FAIL] = 0
        Total_core_cache_stats_breakdown[L1_WR_ALLOC_R][SECTOR_MISS] = 0
        Total_core_cache_stats_breakdown[L1_WR_ALLOC_R][MSHR_HIT] = 0
        Total_core_cache_stats_breakdown[L2_WR_ALLOC_R][HIT] = 0
        Total_core_cache_stats_breakdown[L2_WR_ALLOC_R][HIT_RESERVED] = 0
        Total_core_cache_stats_breakdown[L2_WR_ALLOC_R][MISS] = 0
        Total_core_cache_stats_breakdown[L2_WR_ALLOC_R][RESERVATION_FAIL] = 0
        Total_core_cache_stats_breakdown[L2_WR_ALLOC_R][SECTOR_MISS] = 0
        Total_core_cache_stats_breakdown[L2_WR_ALLOC_R][MSHR_HIT] = 0
        Total_core_cache_stats_breakdown[GLOBAL_ACC_W][TOTAL_ACCESS] = 2

Total_core_cache_fail_stats:
ctas_completed 544, Shader 0 warp_id issue ditsribution:
warp_id:
0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47,
distro:
277, 159, 159, 159, 188, 174, 174, 174, 240, 96, 96, 96, 96, 96, 96, 96, 240, 96, 96, 96, 96, 96, 96, 96, 240, 96, 96, 96, 96, 96, 96, 96, 240, 96, 96, 96, 96, 96, 96, 96, 160, 64, 64, 64, 64, 64, 64, 64,
gpgpu_n_tot_thrd_icount = 5329152
gpgpu_n_tot_w_icount = 166536
gpgpu_n_stall_shd_mem = 0
gpgpu_n_mem_read_local = 0
gpgpu_n_mem_write_local = 0
gpgpu_n_mem_read_global = 546
gpgpu_n_mem_write_global = 2
gpgpu_n_mem_texture = 0
gpgpu_n_mem_const = 0
gpgpu_n_load_insn  = 546
gpgpu_n_store_insn = 2
gpgpu_n_shmem_insn = 424700
gpgpu_n_sstarr_insn = 0
gpgpu_n_tex_insn = 0
gpgpu_n_const_mem_insn = 0
gpgpu_n_param_mem_insn = 0
gpgpu_n_shmem_bkconflict = 0
gpgpu_n_cache_bkconflict = 0
gpgpu_n_intrawarp_mshr_merge = 0
gpgpu_n_cmem_portconflict = 0
gpgpu_stall_shd_mem[c_mem][resource_stall] = 0
gpgpu_stall_shd_mem[s_mem][bk_conf] = 0
gpgpu_stall_shd_mem[gl_mem][resource_stall] = 0
gpgpu_stall_shd_mem[gl_mem][coal_stall] = 0
gpgpu_stall_shd_mem[gl_mem][data_port_stall] = 0
gpu_reg_bank_conflict_stalls = 0
Warp Occupancy Distribution:
Stall:91835     W0_Idle:89655   W0_Scoreboard:65779     W1:20697        W2:0    W3:0    W4:0    W5:34   W6:0    W7:0    W8:0    W9:0    W10:0   W11:0   W12:0   W13:0   W14:0   W15:0       W16:0   W17:0   W18:0   W19:0   W20:0   W21:0   W22:0   W23:0   W24:0   W25:0   W26:0   W27:15  W28:0   W29:0   W30:0   W31:2178        W32:124943
single_issue_nums: WS0:61281    WS1:35085       WS2:35085       WS3:35085
dual_issue_nums: WS0:0  WS1:0   WS2:0   WS3:0
traffic_breakdown_coretomem[GLOBAL_ACC_R] = 4360 {8:545,}
traffic_breakdown_coretomem[GLOBAL_ACC_W] = 80 {40:2,}
traffic_breakdown_coretomem[GLOBAL_ATOMIC] = 40 {40:1,}
traffic_breakdown_memtocore[GLOBAL_ACC_R] = 21800 {40:545,}
traffic_breakdown_memtocore[GLOBAL_ACC_W] = 16 {8:2,}
traffic_breakdown_memtocore[GLOBAL_ATOMIC] = 40 {40:1,}
maxmflatency = 549
max_icnt2mem_latency = 85
maxmrqlatency = 0
max_icnt2sh_latency = 7
averagemflatency = 314
avg_icnt2mem_latency = 23
avg_mrq_latency = 0
avg_icnt2sh_latency = 7
mrq_lat_table:2         0       0       0       0       0       0       0       0       0       0       0       0       0       0       0       0       0       0       0       0  0        0       0       0       0       0       0       0       0       0       0
dq_lat_table:0  0       0       0       0       0       0       0       0       0       0       0       0       0       0       0       0       0       0       0       0       0  0        0       0       0       0       0       0       0       0       0
mf_lat_table:0  0       0       0       0       0       0       355     65      128     0       0       0       0       0       0       0       0       0       0       0       0  0        0       0       0       0       0       0       0       0       0
icnt2mem_lat_table:0    0       267     104     14      95      68      0       0       0       0       0       0       0       0       0       0       0       0       0       0  0        0       0
icnt2sh_lat_table:0     0       548     0       0       0       0       0       0       0       0       0       0       0       0       0       0       0       0       0       0  0        0       0
mf_lat_pw_table:0       0       0       0       0       0       0       5       0       1       0       0       0       0       0       0       0       0       0       0       0  0        0       0       0       0       0       0       0       0       0       0
maximum concurrent accesses to same row:
dram[0]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[1]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[2]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[3]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[4]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[5]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[6]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[7]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[8]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[9]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[10]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[11]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[12]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[13]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[14]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[15]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
maximum service time to same row:
dram[0]:      7506         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[1]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[2]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[3]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[4]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[5]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[6]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[7]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[8]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[9]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[10]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[11]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[12]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[13]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[14]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[15]:      5680         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
average row accesses per activate:
dram[0]:  1.000000      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan
dram[1]:      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan
dram[2]:      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan
dram[3]:      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan
dram[4]:      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan
dram[5]:      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan
dram[6]:      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan
dram[7]:      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan
dram[8]:      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan
dram[9]:      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan
dram[10]:      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan
dram[11]:      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan
dram[12]:      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan
dram[13]:      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan
dram[14]:      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan
dram[15]:  1.000000      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan      -nan
average row locality = 2/2 = 1.000000
number of total memory accesses made:
dram[0]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[1]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[2]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[3]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[4]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[5]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[6]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[7]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[8]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[9]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[10]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[11]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[12]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[13]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[14]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[15]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
total accesses: 0
min_bank_accesses = 0!
min_chip_accesses = 0!
number of total read accesses:
dram[0]:         1         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[1]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[2]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[3]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[4]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[5]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[6]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[7]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[8]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[9]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[10]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[11]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[12]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[13]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[14]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[15]:         1         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
total dram reads = 2
min_bank_accesses = 0!
min_chip_accesses = 0!
number of total write accesses:
dram[0]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[1]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[2]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[3]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[4]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[5]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[6]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[7]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[8]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[9]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[10]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[11]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[12]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[13]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[14]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[15]:         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
total dram writes = 0
min_bank_accesses = 0!
min_chip_accesses = 0!
average mf latency per bank:
dram[0]:        476    none      none      none      none      none      none      none      none      none      none      none      none      none      none      none
dram[1]:     none      none      none      none      none      none      none      none      none      none      none      none      none      none      none      none
dram[2]:     none      none      none      none      none      none      none      none      none      none      none      none      none      none      none      none
dram[3]:     none      none      none      none      none      none      none      none      none      none      none      none      none      none      none      none
dram[4]:     none      none      none      none      none      none      none      none      none      none      none      none      none      none      none      none
dram[5]:     none      none      none      none      none      none      none      none      none      none      none      none      none      none      none      none
dram[6]:     none      none      none      none      none      none      none      none      none      none      none      none      none      none      none      none
dram[7]:     none      none      none      none      none      none      none      none      none      none      none      none      none      none      none      none
dram[8]:     none      none      none      none      none      none      none      none      none      none      none      none      none      none      none      none
dram[9]:     none      none      none      none      none      none      none      none      none      none      none      none      none      none      none      none
dram[10]:     none      none      none      none      none      none      none      none      none      none      none      none      none      none      none      none
dram[11]:     none      none      none      none      none      none      none      none      none      none      none      none      none      none      none      none
dram[12]:     none      none      none      none      none      none      none      none      none      none      none      none      none      none      none      none
dram[13]:     none      none      none      none      none      none      none      none      none      none      none      none      none      none      none      none
dram[14]:     none      none      none      none      none      none      none      none      none      none      none      none      none      none      none      none
dram[15]:     171358    none      none      none      none      none      none      none      none      none      none      none      none      none      none      none
maximum mf latency per bank:
dram[0]:        476         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[1]:          0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[2]:          0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[3]:          0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[4]:          0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[5]:          0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[6]:          0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[7]:          0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[8]:          0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[9]:          0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[10]:          0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[11]:        243         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[12]:          0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[13]:          0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[14]:          0         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
dram[15]:        549         0         0         0         0         0         0         0         0         0         0         0         0         0         0         0
Memory Partition 0:
Cache L2_bank_000:
MSHR contents

Cache L2_bank_001:
MSHR contents

In Dram Latency Queue (total = 0):
DRAM[0]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
n_cmd=27049 n_nop=27047 n_act=1 n_pre=0 n_ref_event=0 n_req=1 n_rd=1 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0.0001479
n_activity=78 dram_eff=0.05128
bk0: 1a 27025i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i

------------------------------------------------------------------------

Row_Buffer_Locality = 0.000000
Row_Buffer_Locality_read = 0.000000
Row_Buffer_Locality_write = -nan
Bank_Level_Parallism = 1.000000
Bank_Level_Parallism_Col = 1.000000
Bank_Level_Parallism_Ready = 1.000000
write_to_read_ratio_blp_rw_average = 0.000000
GrpLevelPara = 1.000000

BW Util details:
bwutil = 0.000148
total_CMD = 27049
util_bw = 4
Wasted_Col = 24
Wasted_Row = 0
Idle = 27021

BW Util Bottlenecks:
RCDc_limit = 24
RCDWRc_limit = 0
WTRc_limit = 0
RTWc_limit = 0
CCDLc_limit = 0
rwq = 0
CCDLc_limit_alone = 0
WTRc_limit_alone = 0
RTWc_limit_alone = 0

Commands details:
total_CMD = 27049
n_nop = 27047
Read = 1
Write = 0
L2_Alloc = 0
L2_WB = 0
n_act = 1
n_pre = 0
n_ref = 0
n_req = 1
total_req = 1

Dual Bus Interface Util:
issued_total_row = 1
issued_total_col = 1
Row_Bus_Util =  0.000037
CoL_Bus_Util = 0.000037
Either_Row_CoL_Bus_Util = 0.000074
Issued_on_Two_Bus_Simul_Util = 0.000000
issued_two_Eff = 0.000000
queue_avg = 0.000000


dram_util_bins: 0 0 0 0 0 0 0 0 0 0
dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
mrqq: max=0 avg=0
Memory Partition 1:
Cache L2_bank_002:
MSHR contents

Cache L2_bank_003:
MSHR contents

In Dram Latency Queue (total = 0):
DRAM[1]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
n_activity=0 dram_eff=-nan
bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i

------------------------------------------------------------------------

Row_Buffer_Locality = -nan
Row_Buffer_Locality_read = -nan
Row_Buffer_Locality_write = -nan
Bank_Level_Parallism = -nan
Bank_Level_Parallism_Col = -nan
Bank_Level_Parallism_Ready = -nan
write_to_read_ratio_blp_rw_average = -nan
GrpLevelPara = -nan

BW Util details:
bwutil = 0.000000
total_CMD = 27049
util_bw = 0
Wasted_Col = 0
Wasted_Row = 0
Idle = 27049

BW Util Bottlenecks:
RCDc_limit = 0
RCDWRc_limit = 0
WTRc_limit = 0
RTWc_limit = 0
CCDLc_limit = 0
rwq = 0
CCDLc_limit_alone = 0
WTRc_limit_alone = 0
RTWc_limit_alone = 0

Commands details:
total_CMD = 27049
n_nop = 27049
Read = 0
Write = 0
L2_Alloc = 0
L2_WB = 0
n_act = 0
n_pre = 0
n_ref = 0
n_req = 0
total_req = 0

Dual Bus Interface Util:
issued_total_row = 0
issued_total_col = 0
Row_Bus_Util =  0.000000
CoL_Bus_Util = 0.000000
Either_Row_CoL_Bus_Util = 0.000000
Issued_on_Two_Bus_Simul_Util = 0.000000
issued_two_Eff = -nan
queue_avg = 0.000000


dram_util_bins: 0 0 0 0 0 0 0 0 0 0
dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
mrqq: max=0 avg=0
Memory Partition 2:
Cache L2_bank_004:
MSHR contents

Cache L2_bank_005:
MSHR contents

In Dram Latency Queue (total = 0):
DRAM[2]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
n_activity=0 dram_eff=-nan
bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i

------------------------------------------------------------------------

Row_Buffer_Locality = -nan
Row_Buffer_Locality_read = -nan
Row_Buffer_Locality_write = -nan
Bank_Level_Parallism = -nan
Bank_Level_Parallism_Col = -nan
Bank_Level_Parallism_Ready = -nan
write_to_read_ratio_blp_rw_average = -nan
GrpLevelPara = -nan

BW Util details:
bwutil = 0.000000
total_CMD = 27049
util_bw = 0
Wasted_Col = 0
Wasted_Row = 0
Idle = 27049

BW Util Bottlenecks:
RCDc_limit = 0
RCDWRc_limit = 0
WTRc_limit = 0
RTWc_limit = 0
CCDLc_limit = 0
rwq = 0
CCDLc_limit_alone = 0
WTRc_limit_alone = 0
RTWc_limit_alone = 0

Commands details:
total_CMD = 27049
n_nop = 27049
Read = 0
Write = 0
L2_Alloc = 0
L2_WB = 0
n_act = 0
n_pre = 0
n_ref = 0
n_req = 0
total_req = 0

Dual Bus Interface Util:
issued_total_row = 0
issued_total_col = 0
Row_Bus_Util =  0.000000
CoL_Bus_Util = 0.000000
Either_Row_CoL_Bus_Util = 0.000000
Issued_on_Two_Bus_Simul_Util = 0.000000
issued_two_Eff = -nan
queue_avg = 0.000000


dram_util_bins: 0 0 0 0 0 0 0 0 0 0
dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
mrqq: max=0 avg=0
Memory Partition 3:
Cache L2_bank_006:
MSHR contents

Cache L2_bank_007:
MSHR contents

In Dram Latency Queue (total = 0):
DRAM[3]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
n_activity=0 dram_eff=-nan
bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i

------------------------------------------------------------------------

Row_Buffer_Locality = -nan
Row_Buffer_Locality_read = -nan
Row_Buffer_Locality_write = -nan
Bank_Level_Parallism = -nan
Bank_Level_Parallism_Col = -nan
Bank_Level_Parallism_Ready = -nan
write_to_read_ratio_blp_rw_average = -nan
GrpLevelPara = -nan

BW Util details:
bwutil = 0.000000
total_CMD = 27049
util_bw = 0
Wasted_Col = 0
Wasted_Row = 0
Idle = 27049

BW Util Bottlenecks:
RCDc_limit = 0
RCDWRc_limit = 0
WTRc_limit = 0
RTWc_limit = 0
CCDLc_limit = 0
rwq = 0
CCDLc_limit_alone = 0
WTRc_limit_alone = 0
RTWc_limit_alone = 0

Commands details:
total_CMD = 27049
n_nop = 27049
Read = 0
Write = 0
L2_Alloc = 0
L2_WB = 0
n_act = 0
n_pre = 0
n_ref = 0
n_req = 0
total_req = 0

Dual Bus Interface Util:
issued_total_row = 0
issued_total_col = 0
Row_Bus_Util =  0.000000
CoL_Bus_Util = 0.000000
Either_Row_CoL_Bus_Util = 0.000000
Issued_on_Two_Bus_Simul_Util = 0.000000
issued_two_Eff = -nan
queue_avg = 0.000000


dram_util_bins: 0 0 0 0 0 0 0 0 0 0
dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
mrqq: max=0 avg=0
Memory Partition 4:
Cache L2_bank_008:
MSHR contents

Cache L2_bank_009:
MSHR contents

In Dram Latency Queue (total = 0):
DRAM[4]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
n_activity=0 dram_eff=-nan
bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i

------------------------------------------------------------------------

Row_Buffer_Locality = -nan
Row_Buffer_Locality_read = -nan
Row_Buffer_Locality_write = -nan
Bank_Level_Parallism = -nan
Bank_Level_Parallism_Col = -nan
Bank_Level_Parallism_Ready = -nan
write_to_read_ratio_blp_rw_average = -nan
GrpLevelPara = -nan

BW Util details:
bwutil = 0.000000
total_CMD = 27049
util_bw = 0
Wasted_Col = 0
Wasted_Row = 0
Idle = 27049

BW Util Bottlenecks:
RCDc_limit = 0
RCDWRc_limit = 0
WTRc_limit = 0
RTWc_limit = 0
CCDLc_limit = 0
rwq = 0
CCDLc_limit_alone = 0
WTRc_limit_alone = 0
RTWc_limit_alone = 0

Commands details:
total_CMD = 27049
n_nop = 27049
Read = 0
Write = 0
L2_Alloc = 0
L2_WB = 0
n_act = 0
n_pre = 0
n_ref = 0
n_req = 0
total_req = 0

Dual Bus Interface Util:
issued_total_row = 0
issued_total_col = 0
Row_Bus_Util =  0.000000
CoL_Bus_Util = 0.000000
Either_Row_CoL_Bus_Util = 0.000000
Issued_on_Two_Bus_Simul_Util = 0.000000
issued_two_Eff = -nan
queue_avg = 0.000000


dram_util_bins: 0 0 0 0 0 0 0 0 0 0
dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
mrqq: max=0 avg=0
Memory Partition 5:
Cache L2_bank_010:
MSHR contents

Cache L2_bank_011:
MSHR contents

In Dram Latency Queue (total = 0):
DRAM[5]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
n_activity=0 dram_eff=-nan
bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i

------------------------------------------------------------------------

Row_Buffer_Locality = -nan
Row_Buffer_Locality_read = -nan
Row_Buffer_Locality_write = -nan
Bank_Level_Parallism = -nan
Bank_Level_Parallism_Col = -nan
Bank_Level_Parallism_Ready = -nan
write_to_read_ratio_blp_rw_average = -nan
GrpLevelPara = -nan

BW Util details:
bwutil = 0.000000
total_CMD = 27049
util_bw = 0
Wasted_Col = 0
Wasted_Row = 0
Idle = 27049

BW Util Bottlenecks:
RCDc_limit = 0
RCDWRc_limit = 0
WTRc_limit = 0
RTWc_limit = 0
CCDLc_limit = 0
rwq = 0
CCDLc_limit_alone = 0
WTRc_limit_alone = 0
RTWc_limit_alone = 0

Commands details:
total_CMD = 27049
n_nop = 27049
Read = 0
Write = 0
L2_Alloc = 0
L2_WB = 0
n_act = 0
n_pre = 0
n_ref = 0
n_req = 0
total_req = 0

Dual Bus Interface Util:
issued_total_row = 0
issued_total_col = 0
Row_Bus_Util =  0.000000
CoL_Bus_Util = 0.000000
Either_Row_CoL_Bus_Util = 0.000000
Issued_on_Two_Bus_Simul_Util = 0.000000
issued_two_Eff = -nan
queue_avg = 0.000000


dram_util_bins: 0 0 0 0 0 0 0 0 0 0
dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
mrqq: max=0 avg=0
Memory Partition 6:
Cache L2_bank_012:
MSHR contents

Cache L2_bank_013:
MSHR contents

In Dram Latency Queue (total = 0):
DRAM[6]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
n_activity=0 dram_eff=-nan
bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i

------------------------------------------------------------------------

Row_Buffer_Locality = -nan
Row_Buffer_Locality_read = -nan
Row_Buffer_Locality_write = -nan
Bank_Level_Parallism = -nan
Bank_Level_Parallism_Col = -nan
Bank_Level_Parallism_Ready = -nan
write_to_read_ratio_blp_rw_average = -nan
GrpLevelPara = -nan

BW Util details:
bwutil = 0.000000
total_CMD = 27049
util_bw = 0
Wasted_Col = 0
Wasted_Row = 0
Idle = 27049

BW Util Bottlenecks:
RCDc_limit = 0
RCDWRc_limit = 0
WTRc_limit = 0
RTWc_limit = 0
CCDLc_limit = 0
rwq = 0
CCDLc_limit_alone = 0
WTRc_limit_alone = 0
RTWc_limit_alone = 0

Commands details:
total_CMD = 27049
n_nop = 27049
Read = 0
Write = 0
L2_Alloc = 0
L2_WB = 0
n_act = 0
n_pre = 0
n_ref = 0
n_req = 0
total_req = 0

Dual Bus Interface Util:
issued_total_row = 0
issued_total_col = 0
Row_Bus_Util =  0.000000
CoL_Bus_Util = 0.000000
Either_Row_CoL_Bus_Util = 0.000000
Issued_on_Two_Bus_Simul_Util = 0.000000
issued_two_Eff = -nan
queue_avg = 0.000000


dram_util_bins: 0 0 0 0 0 0 0 0 0 0
dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
mrqq: max=0 avg=0
Memory Partition 7:
Cache L2_bank_014:
MSHR contents

Cache L2_bank_015:
MSHR contents

In Dram Latency Queue (total = 0):
DRAM[7]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
n_activity=0 dram_eff=-nan
bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i

------------------------------------------------------------------------

Row_Buffer_Locality = -nan
Row_Buffer_Locality_read = -nan
Row_Buffer_Locality_write = -nan
Bank_Level_Parallism = -nan
Bank_Level_Parallism_Col = -nan
Bank_Level_Parallism_Ready = -nan
write_to_read_ratio_blp_rw_average = -nan
GrpLevelPara = -nan

BW Util details:
bwutil = 0.000000
total_CMD = 27049
util_bw = 0
Wasted_Col = 0
Wasted_Row = 0
Idle = 27049

BW Util Bottlenecks:
RCDc_limit = 0
RCDWRc_limit = 0
WTRc_limit = 0
RTWc_limit = 0
CCDLc_limit = 0
rwq = 0
CCDLc_limit_alone = 0
WTRc_limit_alone = 0
RTWc_limit_alone = 0

Commands details:
total_CMD = 27049
n_nop = 27049
Read = 0
Write = 0
L2_Alloc = 0
L2_WB = 0
n_act = 0
n_pre = 0
n_ref = 0
n_req = 0
total_req = 0

Dual Bus Interface Util:
issued_total_row = 0
issued_total_col = 0
Row_Bus_Util =  0.000000
CoL_Bus_Util = 0.000000
Either_Row_CoL_Bus_Util = 0.000000
Issued_on_Two_Bus_Simul_Util = 0.000000
issued_two_Eff = -nan
queue_avg = 0.000000


dram_util_bins: 0 0 0 0 0 0 0 0 0 0
dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
mrqq: max=0 avg=0
Memory Partition 8:
Cache L2_bank_016:
MSHR contents

Cache L2_bank_017:
MSHR contents

In Dram Latency Queue (total = 0):
DRAM[8]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
n_activity=0 dram_eff=-nan
bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i

------------------------------------------------------------------------

Row_Buffer_Locality = -nan
Row_Buffer_Locality_read = -nan
Row_Buffer_Locality_write = -nan
Bank_Level_Parallism = -nan
Bank_Level_Parallism_Col = -nan
Bank_Level_Parallism_Ready = -nan
write_to_read_ratio_blp_rw_average = -nan
GrpLevelPara = -nan

BW Util details:
bwutil = 0.000000
total_CMD = 27049
util_bw = 0
Wasted_Col = 0
Wasted_Row = 0
Idle = 27049

BW Util Bottlenecks:
RCDc_limit = 0
RCDWRc_limit = 0
WTRc_limit = 0
RTWc_limit = 0
CCDLc_limit = 0
rwq = 0
CCDLc_limit_alone = 0
WTRc_limit_alone = 0
RTWc_limit_alone = 0

Commands details:
total_CMD = 27049
n_nop = 27049
Read = 0
Write = 0
L2_Alloc = 0
L2_WB = 0
n_act = 0
n_pre = 0
n_ref = 0
n_req = 0
total_req = 0

Dual Bus Interface Util:
issued_total_row = 0
issued_total_col = 0
Row_Bus_Util =  0.000000
CoL_Bus_Util = 0.000000
Either_Row_CoL_Bus_Util = 0.000000
Issued_on_Two_Bus_Simul_Util = 0.000000
issued_two_Eff = -nan
queue_avg = 0.000000


dram_util_bins: 0 0 0 0 0 0 0 0 0 0
dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
mrqq: max=0 avg=0
Memory Partition 9:
Cache L2_bank_018:
MSHR contents

Cache L2_bank_019:
MSHR contents

In Dram Latency Queue (total = 0):
DRAM[9]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
n_activity=0 dram_eff=-nan
bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i

------------------------------------------------------------------------

Row_Buffer_Locality = -nan
Row_Buffer_Locality_read = -nan
Row_Buffer_Locality_write = -nan
Bank_Level_Parallism = -nan
Bank_Level_Parallism_Col = -nan
Bank_Level_Parallism_Ready = -nan
write_to_read_ratio_blp_rw_average = -nan
GrpLevelPara = -nan

BW Util details:
bwutil = 0.000000
total_CMD = 27049
util_bw = 0
Wasted_Col = 0
Wasted_Row = 0
Idle = 27049

BW Util Bottlenecks:
RCDc_limit = 0
RCDWRc_limit = 0
WTRc_limit = 0
RTWc_limit = 0
CCDLc_limit = 0
rwq = 0
CCDLc_limit_alone = 0
WTRc_limit_alone = 0
RTWc_limit_alone = 0

Commands details:
total_CMD = 27049
n_nop = 27049
Read = 0
Write = 0
L2_Alloc = 0
L2_WB = 0
n_act = 0
n_pre = 0
n_ref = 0
n_req = 0
total_req = 0

Dual Bus Interface Util:
issued_total_row = 0
issued_total_col = 0
Row_Bus_Util =  0.000000
CoL_Bus_Util = 0.000000
Either_Row_CoL_Bus_Util = 0.000000
Issued_on_Two_Bus_Simul_Util = 0.000000
issued_two_Eff = -nan
queue_avg = 0.000000


dram_util_bins: 0 0 0 0 0 0 0 0 0 0
dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
mrqq: max=0 avg=0
Memory Partition 10:
Cache L2_bank_020:
MSHR contents

Cache L2_bank_021:
MSHR contents

In Dram Latency Queue (total = 0):
DRAM[10]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
n_activity=0 dram_eff=-nan
bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i

------------------------------------------------------------------------

Row_Buffer_Locality = -nan
Row_Buffer_Locality_read = -nan
Row_Buffer_Locality_write = -nan
Bank_Level_Parallism = -nan
Bank_Level_Parallism_Col = -nan
Bank_Level_Parallism_Ready = -nan
write_to_read_ratio_blp_rw_average = -nan
GrpLevelPara = -nan

BW Util details:
bwutil = 0.000000
total_CMD = 27049
util_bw = 0
Wasted_Col = 0
Wasted_Row = 0
Idle = 27049

BW Util Bottlenecks:
RCDc_limit = 0
RCDWRc_limit = 0
WTRc_limit = 0
RTWc_limit = 0
CCDLc_limit = 0
rwq = 0
CCDLc_limit_alone = 0
WTRc_limit_alone = 0
RTWc_limit_alone = 0

Commands details:
total_CMD = 27049
n_nop = 27049
Read = 0
Write = 0
L2_Alloc = 0
L2_WB = 0
n_act = 0
n_pre = 0
n_ref = 0
n_req = 0
total_req = 0

Dual Bus Interface Util:
issued_total_row = 0
issued_total_col = 0
Row_Bus_Util =  0.000000
CoL_Bus_Util = 0.000000
Either_Row_CoL_Bus_Util = 0.000000
Issued_on_Two_Bus_Simul_Util = 0.000000
issued_two_Eff = -nan
queue_avg = 0.000000


dram_util_bins: 0 0 0 0 0 0 0 0 0 0
dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
mrqq: max=0 avg=0
Memory Partition 11:
Cache L2_bank_022:
MSHR contents

Cache L2_bank_023:
MSHR contents

In Dram Latency Queue (total = 0):
DRAM[11]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
n_activity=0 dram_eff=-nan
bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i

------------------------------------------------------------------------

Row_Buffer_Locality = -nan
Row_Buffer_Locality_read = -nan
Row_Buffer_Locality_write = -nan
Bank_Level_Parallism = -nan
Bank_Level_Parallism_Col = -nan
Bank_Level_Parallism_Ready = -nan
write_to_read_ratio_blp_rw_average = -nan
GrpLevelPara = -nan

BW Util details:
bwutil = 0.000000
total_CMD = 27049
util_bw = 0
Wasted_Col = 0
Wasted_Row = 0
Idle = 27049

BW Util Bottlenecks:
RCDc_limit = 0
RCDWRc_limit = 0
WTRc_limit = 0
RTWc_limit = 0
CCDLc_limit = 0
rwq = 0
CCDLc_limit_alone = 0
WTRc_limit_alone = 0
RTWc_limit_alone = 0

Commands details:
total_CMD = 27049
n_nop = 27049
Read = 0
Write = 0
L2_Alloc = 0
L2_WB = 0
n_act = 0
n_pre = 0
n_ref = 0
n_req = 0
total_req = 0

Dual Bus Interface Util:
issued_total_row = 0
issued_total_col = 0
Row_Bus_Util =  0.000000
CoL_Bus_Util = 0.000000
Either_Row_CoL_Bus_Util = 0.000000
Issued_on_Two_Bus_Simul_Util = 0.000000
issued_two_Eff = -nan
queue_avg = 0.000000


dram_util_bins: 0 0 0 0 0 0 0 0 0 0
dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
mrqq: max=0 avg=0
Memory Partition 12:
Cache L2_bank_024:
MSHR contents

Cache L2_bank_025:
MSHR contents

In Dram Latency Queue (total = 0):
DRAM[12]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
n_activity=0 dram_eff=-nan
bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i

------------------------------------------------------------------------

Row_Buffer_Locality = -nan
Row_Buffer_Locality_read = -nan
Row_Buffer_Locality_write = -nan
Bank_Level_Parallism = -nan
Bank_Level_Parallism_Col = -nan
Bank_Level_Parallism_Ready = -nan
write_to_read_ratio_blp_rw_average = -nan
GrpLevelPara = -nan

BW Util details:
bwutil = 0.000000
total_CMD = 27049
util_bw = 0
Wasted_Col = 0
Wasted_Row = 0
Idle = 27049

BW Util Bottlenecks:
RCDc_limit = 0
RCDWRc_limit = 0
WTRc_limit = 0
RTWc_limit = 0
CCDLc_limit = 0
rwq = 0
CCDLc_limit_alone = 0
WTRc_limit_alone = 0
RTWc_limit_alone = 0

Commands details:
total_CMD = 27049
n_nop = 27049
Read = 0
Write = 0
L2_Alloc = 0
L2_WB = 0
n_act = 0
n_pre = 0
n_ref = 0
n_req = 0
total_req = 0

Dual Bus Interface Util:
issued_total_row = 0
issued_total_col = 0
Row_Bus_Util =  0.000000
CoL_Bus_Util = 0.000000
Either_Row_CoL_Bus_Util = 0.000000
Issued_on_Two_Bus_Simul_Util = 0.000000
issued_two_Eff = -nan
queue_avg = 0.000000


dram_util_bins: 0 0 0 0 0 0 0 0 0 0
dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
mrqq: max=0 avg=0
Memory Partition 13:
Cache L2_bank_026:
MSHR contents

Cache L2_bank_027:
MSHR contents

In Dram Latency Queue (total = 0):
DRAM[13]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
n_activity=0 dram_eff=-nan
bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i

------------------------------------------------------------------------

Row_Buffer_Locality = -nan
Row_Buffer_Locality_read = -nan
Row_Buffer_Locality_write = -nan
Bank_Level_Parallism = -nan
Bank_Level_Parallism_Col = -nan
Bank_Level_Parallism_Ready = -nan
write_to_read_ratio_blp_rw_average = -nan
GrpLevelPara = -nan

BW Util details:
bwutil = 0.000000
total_CMD = 27049
util_bw = 0
Wasted_Col = 0
Wasted_Row = 0
Idle = 27049

BW Util Bottlenecks:
RCDc_limit = 0
RCDWRc_limit = 0
WTRc_limit = 0
RTWc_limit = 0
CCDLc_limit = 0
rwq = 0
CCDLc_limit_alone = 0
WTRc_limit_alone = 0
RTWc_limit_alone = 0

Commands details:
total_CMD = 27049
n_nop = 27049
Read = 0
Write = 0
L2_Alloc = 0
L2_WB = 0
n_act = 0
n_pre = 0
n_ref = 0
n_req = 0
total_req = 0

Dual Bus Interface Util:
issued_total_row = 0
issued_total_col = 0
Row_Bus_Util =  0.000000
CoL_Bus_Util = 0.000000
Either_Row_CoL_Bus_Util = 0.000000
Issued_on_Two_Bus_Simul_Util = 0.000000
issued_two_Eff = -nan
queue_avg = 0.000000


dram_util_bins: 0 0 0 0 0 0 0 0 0 0
dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
mrqq: max=0 avg=0
Memory Partition 14:
Cache L2_bank_028:
MSHR contents

Cache L2_bank_029:
MSHR contents

In Dram Latency Queue (total = 0):
DRAM[14]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
n_cmd=27049 n_nop=27049 n_act=0 n_pre=0 n_ref_event=0 n_req=0 n_rd=0 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0
n_activity=0 dram_eff=-nan
bk0: 0a 27049i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i

------------------------------------------------------------------------

Row_Buffer_Locality = -nan
Row_Buffer_Locality_read = -nan
Row_Buffer_Locality_write = -nan
Bank_Level_Parallism = -nan
Bank_Level_Parallism_Col = -nan
Bank_Level_Parallism_Ready = -nan
write_to_read_ratio_blp_rw_average = -nan
GrpLevelPara = -nan

BW Util details:
bwutil = 0.000000
total_CMD = 27049
util_bw = 0
Wasted_Col = 0
Wasted_Row = 0
Idle = 27049

BW Util Bottlenecks:
RCDc_limit = 0
RCDWRc_limit = 0
WTRc_limit = 0
RTWc_limit = 0
CCDLc_limit = 0
rwq = 0
CCDLc_limit_alone = 0
WTRc_limit_alone = 0
RTWc_limit_alone = 0

Commands details:
total_CMD = 27049
n_nop = 27049
Read = 0
Write = 0
L2_Alloc = 0
L2_WB = 0
n_act = 0
n_pre = 0
n_ref = 0
n_req = 0
total_req = 0

Dual Bus Interface Util:
issued_total_row = 0
issued_total_col = 0
Row_Bus_Util =  0.000000
CoL_Bus_Util = 0.000000
Either_Row_CoL_Bus_Util = 0.000000
Issued_on_Two_Bus_Simul_Util = 0.000000
issued_two_Eff = -nan
queue_avg = 0.000000


dram_util_bins: 0 0 0 0 0 0 0 0 0 0
dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
mrqq: max=0 avg=0
Memory Partition 15:
Cache L2_bank_030:
MSHR contents

Cache L2_bank_031:
MSHR contents

In Dram Latency Queue (total = 0):
DRAM[15]: 16 bks, busW=2 BL=16 CL=24, tRRD=12 tCCD=4, tRCD=24 tRAS=55 tRP=24 tRC=78
n_cmd=27049 n_nop=27047 n_act=1 n_pre=0 n_ref_event=0 n_req=1 n_rd=1 n_rd_L2_A=0 n_write=0 n_wr_bk=0 bw_util=0.0001479
n_activity=78 dram_eff=0.05128
bk0: 1a 27025i bk1: 0a 27049i bk2: 0a 27049i bk3: 0a 27049i bk4: 0a 27049i bk5: 0a 27049i bk6: 0a 27049i bk7: 0a 27049i bk8: 0a 27049i bk9: 0a 27049i bk10: 0a 27049i bk11: 0a 27049i bk12: 0a 27049i bk13: 0a 27049i bk14: 0a 27049i bk15: 0a 27049i

------------------------------------------------------------------------

Row_Buffer_Locality = 0.000000
Row_Buffer_Locality_read = 0.000000
Row_Buffer_Locality_write = -nan
Bank_Level_Parallism = 1.000000
Bank_Level_Parallism_Col = 1.000000
Bank_Level_Parallism_Ready = 1.000000
write_to_read_ratio_blp_rw_average = 0.000000
GrpLevelPara = 1.000000

BW Util details:
bwutil = 0.000148
total_CMD = 27049
util_bw = 4
Wasted_Col = 24
Wasted_Row = 0
Idle = 27021

BW Util Bottlenecks:
RCDc_limit = 24
RCDWRc_limit = 0
WTRc_limit = 0
RTWc_limit = 0
CCDLc_limit = 0
rwq = 0
CCDLc_limit_alone = 0
WTRc_limit_alone = 0
RTWc_limit_alone = 0

Commands details:
total_CMD = 27049
n_nop = 27047
Read = 1
Write = 0
L2_Alloc = 0
L2_WB = 0
n_act = 1
n_pre = 0
n_ref = 0
n_req = 1
total_req = 1

Dual Bus Interface Util:
issued_total_row = 1
issued_total_col = 1
Row_Bus_Util =  0.000037
CoL_Bus_Util = 0.000037
Either_Row_CoL_Bus_Util = 0.000074
Issued_on_Two_Bus_Simul_Util = 0.000000
issued_two_Eff = 0.000000
queue_avg = 0.000000


dram_util_bins: 0 0 0 0 0 0 0 0 0 0
dram_eff_bins: 0 0 0 0 0 0 0 0 0 0
mrqq: max=0 avg=0

========= L2 cache stats =========
L2_cache_bank[0]: Access = 1, Miss = 1, Miss_rate = 1.000, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[1]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[2]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[3]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[4]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[5]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[6]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[7]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[8]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[9]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[10]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[11]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[12]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[13]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[14]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[15]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[16]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[17]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[18]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[19]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[20]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[21]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[22]: Access = 1, Miss = 1, Miss_rate = 1.000, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[23]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[24]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[25]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[26]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[27]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[28]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[29]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_cache_bank[30]: Access = 546, Miss = 1, Miss_rate = 0.002, Pending_hits = 3, Reservation_fails = 268
L2_cache_bank[31]: Access = 0, Miss = 0, Miss_rate = -nan, Pending_hits = 0, Reservation_fails = 0
L2_total_cache_accesses = 548
L2_total_cache_misses = 3
L2_total_cache_miss_rate = 0.0055
L2_total_cache_pending_hits = 3
L2_total_cache_reservation_fails = 268
L2_total_cache_breakdown:
        L2_cache_stats_breakdown[GLOBAL_ACC_R][HIT] = 541
        L2_cache_stats_breakdown[GLOBAL_ACC_R][HIT_RESERVED] = 3
        L2_cache_stats_breakdown[GLOBAL_ACC_R][MISS] = 2
        L2_cache_stats_breakdown[GLOBAL_ACC_R][RESERVATION_FAIL] = 268
        L2_cache_stats_breakdown[GLOBAL_ACC_R][SECTOR_MISS] = 0
        L2_cache_stats_breakdown[GLOBAL_ACC_R][MSHR_HIT] = 3
        L2_cache_stats_breakdown[LOCAL_ACC_R][HIT] = 0
        L2_cache_stats_breakdown[LOCAL_ACC_R][HIT_RESERVED] = 0
        L2_cache_stats_breakdown[LOCAL_ACC_R][MISS] = 0
        L2_cache_stats_breakdown[LOCAL_ACC_R][RESERVATION_FAIL] = 0
        L2_cache_stats_breakdown[LOCAL_ACC_R][SECTOR_MISS] = 0
        L2_cache_stats_breakdown[LOCAL_ACC_R][MSHR_HIT] = 0
        L2_cache_stats_breakdown[CONST_ACC_R][HIT] = 0
        L2_cache_stats_breakdown[CONST_ACC_R][HIT_RESERVED] = 0
        L2_cache_stats_breakdown[CONST_ACC_R][MISS] = 0
        L2_cache_stats_breakdown[CONST_ACC_R][RESERVATION_FAIL] = 0
        L2_cache_stats_breakdown[CONST_ACC_R][SECTOR_MISS] = 0
        L2_cache_stats_breakdown[CONST_ACC_R][MSHR_HIT] = 0
        L2_cache_stats_breakdown[TEXTURE_ACC_R][HIT] = 0
        L2_cache_stats_breakdown[TEXTURE_ACC_R][HIT_RESERVED] = 0
        L2_cache_stats_breakdown[TEXTURE_ACC_R][MISS] = 0
        L2_cache_stats_breakdown[TEXTURE_ACC_R][RESERVATION_FAIL] = 0
        L2_cache_stats_breakdown[TEXTURE_ACC_R][SECTOR_MISS] = 0
        L2_cache_stats_breakdown[TEXTURE_ACC_R][MSHR_HIT] = 0
        L2_cache_stats_breakdown[GLOBAL_ACC_W][HIT] = 1
        L2_cache_stats_breakdown[GLOBAL_ACC_W][HIT_RESERVED] = 0
        L2_cache_stats_breakdown[GLOBAL_ACC_W][MISS] = 1
        L2_cache_stats_breakdown[GLOBAL_ACC_W][RESERVATION_FAIL] = 0
        L2_cache_stats_breakdown[GLOBAL_ACC_W][SECTOR_MISS] = 0
        L2_cache_stats_breakdown[GLOBAL_ACC_W][MSHR_HIT] = 0
        L2_cache_stats_breakdown[LOCAL_ACC_W][HIT] = 0
        L2_cache_stats_breakdown[LOCAL_ACC_W][HIT_RESERVED] = 0
        L2_cache_stats_breakdown[LOCAL_ACC_W][MISS] = 0
        L2_cache_stats_breakdown[LOCAL_ACC_W][RESERVATION_FAIL] = 0
        L2_cache_stats_breakdown[LOCAL_ACC_W][SECTOR_MISS] = 0
        L2_cache_stats_breakdown[LOCAL_ACC_W][MSHR_HIT] = 0
        L2_cache_stats_breakdown[L1_WRBK_ACC][HIT] = 0
        L2_cache_stats_breakdown[L1_WRBK_ACC][HIT_RESERVED] = 0
        L2_cache_stats_breakdown[L1_WRBK_ACC][MISS] = 0
        L2_cache_stats_breakdown[L1_WRBK_ACC][RESERVATION_FAIL] = 0
        L2_cache_stats_breakdown[L1_WRBK_ACC][SECTOR_MISS] = 0
        L2_cache_stats_breakdown[L1_WRBK_ACC][MSHR_HIT] = 0
        L2_cache_stats_breakdown[L2_WRBK_ACC][HIT] = 0
        L2_cache_stats_breakdown[L2_WRBK_ACC][HIT_RESERVED] = 0
        L2_cache_stats_breakdown[L2_WRBK_ACC][MISS] = 0
        L2_cache_stats_breakdown[L2_WRBK_ACC][RESERVATION_FAIL] = 0
        L2_cache_stats_breakdown[L2_WRBK_ACC][SECTOR_MISS] = 0
        L2_cache_stats_breakdown[L2_WRBK_ACC][MSHR_HIT] = 0
        L2_cache_stats_breakdown[INST_ACC_R][HIT] = 0
        L2_cache_stats_breakdown[INST_ACC_R][HIT_RESERVED] = 0
        L2_cache_stats_breakdown[INST_ACC_R][MISS] = 0
        L2_cache_stats_breakdown[INST_ACC_R][RESERVATION_FAIL] = 0
        L2_cache_stats_breakdown[INST_ACC_R][SECTOR_MISS] = 0
        L2_cache_stats_breakdown[INST_ACC_R][MSHR_HIT] = 0
        L2_cache_stats_breakdown[L1_WR_ALLOC_R][HIT] = 0
        L2_cache_stats_breakdown[L1_WR_ALLOC_R][HIT_RESERVED] = 0
        L2_cache_stats_breakdown[L1_WR_ALLOC_R][MISS] = 0
        L2_cache_stats_breakdown[L1_WR_ALLOC_R][RESERVATION_FAIL] = 0
        L2_cache_stats_breakdown[L1_WR_ALLOC_R][SECTOR_MISS] = 0
        L2_cache_stats_breakdown[L1_WR_ALLOC_R][MSHR_HIT] = 0
        L2_cache_stats_breakdown[L2_WR_ALLOC_R][HIT] = 0
        L2_cache_stats_breakdown[L2_WR_ALLOC_R][HIT_RESERVED] = 0
        L2_cache_stats_breakdown[L2_WR_ALLOC_R][MISS] = 0
        L2_cache_stats_breakdown[L2_WR_ALLOC_R][RESERVATION_FAIL] = 0
        L2_cache_stats_breakdown[L2_WR_ALLOC_R][SECTOR_MISS] = 0
        L2_cache_stats_breakdown[L2_WR_ALLOC_R][MSHR_HIT] = 0
        L2_cache_stats_breakdown[GLOBAL_ACC_R][TOTAL_ACCESS] = 546
        L2_cache_stats_breakdown[GLOBAL_ACC_W][TOTAL_ACCESS] = 2
L2_total_cache_reservation_fail_breakdown:
        L2_cache_stats_fail_breakdown[GLOBAL_ACC_R][MSHR_MERGE_ENRTY_FAIL] = 268
L2_cache_data_port_util = 0.002
L2_cache_fill_port_util = 0.000

icnt_total_pkts_mem_to_simt=548
icnt_total_pkts_simt_to_mem=548
LD_mem_lat_dist  0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
ST_mem_lat_dist  0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
----------------------------Interconnect-DETAILS--------------------------------
----------------------------NOC-DETAILS--------------------------------
Class 0:
Packet latency average = 5.11429
        minimum = 5
        maximum = 7
Network latency average = 5.11429
        minimum = 5
        maximum = 7
Slowest packet = 3
Flit latency average = 5.11429
        minimum = 5
        maximum = 7
Slowest flit = 3
Fragmentation average = 0
        minimum = 0
        maximum = 0
Injected packet rate average = 0.000125029
        minimum = 0 (at node 1)
        maximum = 0.0038866 (at node 62)
Accepted packet rate average = 0.000125029
        minimum = 0 (at node 1)
        maximum = 0.0038866 (at node 62)
Injected flit rate average = 0.000125029
        minimum = 0 (at node 1)
        maximum = 0.0038866 (at node 62)
Accepted flit rate average= 0.000125029
        minimum = 0 (at node 1)
        maximum = 0.0038866 (at node 62)
Injected packet length average = 1
Accepted packet length average = 1
Total in-flight flits = 0 (0 measured)
====== Overall Traffic Statistics ======
====== Traffic class 0 ======
Packet latency average = 5.11429 (1 samples)
        minimum = 5 (1 samples)
        maximum = 7 (1 samples)
Network latency average = 5.11429 (1 samples)
        minimum = 5 (1 samples)
        maximum = 7 (1 samples)
Flit latency average = 5.11429 (1 samples)
        minimum = 5 (1 samples)
        maximum = 7 (1 samples)
Fragmentation average = 0 (1 samples)
        minimum = 0 (1 samples)
        maximum = 0 (1 samples)
Injected packet rate average = 0.000125029 (1 samples)
        minimum = 0 (1 samples)
        maximum = 0.0038866 (1 samples)
Accepted packet rate average = 0.000125029 (1 samples)
        minimum = 0 (1 samples)
        maximum = 0.0038866 (1 samples)
Injected flit rate average = 0.000125029 (1 samples)
        minimum = 0 (1 samples)
        maximum = 0.0038866 (1 samples)
Accepted flit rate average = 0.000125029 (1 samples)
        minimum = 0 (1 samples)
        maximum = 0.0038866 (1 samples)
Injected packet size average = 1 (1 samples)
Accepted packet size average = 1 (1 samples)
Hops average = 1 (1 samples)
----------------------------chLet-DETAILS--------------------------------
Class 0:
Packet latency average = 12.1462
        minimum = 5
        maximum = 75
Network latency average = 12.0273
        minimum = 5
        maximum = 74
Slowest packet = 28
Flit latency average = 12.0273
        minimum = 5
        maximum = 74
Slowest flit = 89
Fragmentation average = 0
        minimum = 0
        maximum = 0
Injected packet rate average = 0.00733025
        minimum = 0.00377229 (at node 4)
        maximum = 0.0585277 (at node 15)
Accepted packet rate average = 0.00733025
        minimum = 0.00377229 (at node 4)
        maximum = 0.0585277 (at node 15)
Injected flit rate average = 0.00733025
        minimum = 0.00377229 (at node 4)
        maximum = 0.0585277 (at node 15)
Accepted flit rate average= 0.00733025
        minimum = 0.00377229 (at node 4)
        maximum = 0.0585277 (at node 15)
Injected packet length average = 1
Accepted packet length average = 1
Total in-flight flits = 0 (0 measured)
====== Overall Traffic Statistics ======
====== Traffic class 0 ======
Packet latency average = 12.1462 (1 samples)
        minimum = 5 (1 samples)
        maximum = 75 (1 samples)
Network latency average = 12.0273 (1 samples)
        minimum = 5 (1 samples)
        maximum = 74 (1 samples)
Flit latency average = 12.0273 (1 samples)
        minimum = 5 (1 samples)
        maximum = 74 (1 samples)
Fragmentation average = 0 (1 samples)
        minimum = 0 (1 samples)
        maximum = 0 (1 samples)
Injected packet rate average = 0.00733025 (1 samples)
        minimum = 0.00377229 (1 samples)
        maximum = 0.0585277 (1 samples)
Accepted packet rate average = 0.00733025 (1 samples)
        minimum = 0.00377229 (1 samples)
        maximum = 0.0585277 (1 samples)
Injected flit rate average = 0.00733025 (1 samples)
        minimum = 0.00377229 (1 samples)
        maximum = 0.0585277 (1 samples)
Accepted flit rate average = 0.00733025 (1 samples)
        minimum = 0.00377229 (1 samples)
        maximum = 0.0585277 (1 samples)
Injected packet size average = 1 (1 samples)
Accepted packet size average = 1 (1 samples)
Hops average = 1 (1 samples)
----------------------------END-of-Interconnect-DETAILS-------------------------


gpgpu_simulation_time = 0 days, 0 hrs, 0 min, 48 sec (48 sec)
gpgpu_simulation_rate = 85145 (inst/sec)
gpgpu_simulation_rate = 182 (cycle/sec)
gpgpu_silicon_slowdown = 6219780x
GPGPU-Sim: *** simulation thread exiting ***
GPGPU-Sim: *** exit detected ***
mnaderan@rtx3080:test$