Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- Profile Statistics.
- Note that counter items are counter values and not time units.
- Device Storage
- =================
- Name Total Count Time (ms) Min Time (ms) Max Time (ms) Avg Time (ms)
- ---- ----------- --------- ------------- ------------- -------------
- Memory: cpu/0 132 0.0000 0.0000 401.6640 200.8320
- Memory: gpu/0 49 5443.5840 200.7040 6347.7759 3073.5359
- MXNET_C_API
- =================
- Name Total Count Time (ms) Min Time (ms) Max Time (ms) Avg Time (ms)
- ---- ----------- --------- ------------- ------------- -------------
- MXNDArrayReshape64 1 0.0290 0.0290 0.0290 0.0290
- MXAutogradBackwardEx 1 0.7010 0.7010 0.7010 0.7010
- MXAutogradSetIsTraining 2 0.0020 0.0010 0.0010 0.0010
- MXAutogradSetIsRecording 2 0.0010 0.0000 0.0010 0.0005
- MXNDArraySyncCopyFromCPU 1 0.0410 0.0410 0.0410 0.0410
- MXNDArrayCreateEx 3 0.0080 0.0010 0.0050 0.0027
- MXNDArrayGetDType 3 0.0010 0.0000 0.0010 0.0003
- MXNDArrayFree 146 0.4990 0.0000 0.0260 0.0034
- MXNDArraySetGradState 8 0.0050 0.0000 0.0010 0.0006
- MXNDArrayGetGradState 8 0.0070 0.0000 0.0010 0.0009
- MXNDArrayGetContext 78 0.0440 0.0000 0.0010 0.0006
- MXNet C API Calls 474 0.4740 0.0010 0.4740 0.2365
- MXNet C API Concurrency 948 0.0000 0.0000 0.0010 0.0005
- MXNDArrayGetShape 69 0.0290 0.0000 0.0010 0.0004
- MXNDArrayAt 64 0.1030 0.0010 0.0030 0.0016
- MXImperativeInvokeEx 88 2.3040 0.0160 0.1300 0.0262
- operator
- =================
- Name Total Count Time (ms) Min Time (ms) Max Time (ms) Avg Time (ms)
- ---- ----------- --------- ------------- ------------- -------------
- sgd_update 8 0.5530 0.0410 0.1330 0.0691
- _backward_Pooling 2 0.3090 0.1540 0.1550 0.1545
- _backward_copy 2 0.1600 0.0800 0.0800 0.0800
- _backward_mul_scalar 2 0.0970 0.0480 0.0490 0.0485
- SetValueOp 2 0.1460 0.0720 0.0740 0.0730
- Convolution 4 1.5370 0.3100 0.4580 0.3842
- stack 2 0.1220 0.0600 0.0620 0.0610
- _backward_Activation 4 0.1930 0.0410 0.0550 0.0483
- pick 2 0.0560 0.0270 0.0290 0.0280
- WaitForVar 2 0.0120 0.0050 0.0070 0.0060
- Activation 6 0.2890 0.0290 0.0800 0.0482
- mean 2 0.0830 0.0410 0.0420 0.0415
- _image_to_tensor 128 0.6190 0.0030 0.0170 0.0048
- _backward_pick 2 0.0850 0.0420 0.0430 0.0425
- Pooling 4 0.2300 0.0380 0.0770 0.0575
- _backward_FullyConnected 4 0.4690 0.1020 0.1320 0.1173
- DeleteVariable 176 0.3850 0.0010 0.0220 0.0022
- _mul_scalar 2 0.0490 0.0240 0.0250 0.0245
- CopyCPU2GPU 4 0.3500 0.0480 0.1260 0.0875
- FullyConnected 4 0.6890 0.1320 0.2130 0.1723
- log_softmax 2 0.0690 0.0340 0.0350 0.0345
- _backward_mean 2 0.0880 0.0430 0.0450 0.0440
- _backward_log_softmax 2 0.0580 0.0280 0.0300 0.0290
Add Comment
Please, Sign In to add comment